Unique ID: 2015070

Division: Programing and Coordination of Statistical Surveys Department
Issue Date: February 13th 2019
Last modified: February 22nd 2019
Collaborative

Big Data Sandbox web scraping project

Evaluation of technologies and methodologies for web scraped enterprise data

Collaboration in big data sandbox on web scraping on enterprise data. Evaluation of technologies and methodologies for web scraped enterprise data.

Project Objective:

Exploration, Scientific / research

Project Sources
Project Sources
Type Of Institution: National statistical office
Big Data Source: Web scraping data
Region: Europe & Central Asia
Country Area: Poland
Id Country Regional: country
Partnerships
Partnerships
Other Partners: Other
Partnership Comments: UNECE, National Statistical Offices
Accessing Data
Accessing Data
Data Access Rights: Broader access rights
Data Coverage
Data Coverage
Data Coverage: Only a portion of all data
Coverage Geo Pop: Whole country / high % of market
Cost Implication: Free
Data Quality
Data Quality
Quality Aspects Evaluated: Privacy and Security, Institutional/Business Environment
Validation Comments: Not yet.
Quality Framework Comments: Not yet.
Data Quality Concerns:
Methodology
Methodology
Methods Used: Other methods
Technologies
Technologies
Technologies: NoSQL database, Data mining tools, Data visualization tools, Hadoop Clusters
Other
Other
Income Level: High-income
Iso: PL
Timeframe To Produce Indicator: NA
Write Your Own Review
You're reviewing:Big Data Sandbox web scraping project
Your Rating