Artikel
Survey vs scraped data: Comparing time series properties of web and survey vacancy data
This paper studies the relationship between a vacancy population obtained from web crawling and vacancies in the economy inferred by a National Statistics Office (NSO) using a traditional method. We compare the time series properties of samples obtained between 2007 and 2014 by Statistics Netherlands and by a web scraping company. We find that the web and NSO vacancy data present similar time series properties, suggesting that both time series are generated by the same underlying phenomenon: the real number of new vacancies in the economy. We conclude that, in our case study, web-sourced data are able to capture aggregate economic activity in the labor market.
- Language
-
Englisch
- Bibliographic citation
-
Journal: IZA Journal of Labor Economics ; ISSN: 2193-8997 ; Volume: 8 ; Year: 2019 ; Issue: 4 ; Pages: 1-23 ; Warsaw: Sciendo
- Classification
-
Wirtschaft
Labor Demand
Labor Turnover; Vacancies; Layoffs
Single Equation Models; Single Variables: Time-Series Models; Dynamic Quantile Regressions; Dynamic Treatment Effect Models; Diffusion Processes
Data Collection and Data Estimation Methodology; Computer Programs: General
- Subject
-
web crawling
statistical inference
time series
vacancies
Labor demand
data collection
- Event
-
Geistige Schöpfung
- (who)
-
de Pedraza, Pablo
Visintin, Stefano
Tijdens, Kea Gartje
Kismihók, Gábor
- Event
-
Veröffentlichung
- (who)
-
Sciendo
- (where)
-
Warsaw
- (when)
-
2019
- DOI
-
doi:10.2478/izajole-2019-0004
- Handle
- Last update
-
10.03.2025, 11:42 AM CET
Data provider
ZBW - Deutsche Zentralbibliothek für Wirtschaftswissenschaften - Leibniz-Informationszentrum Wirtschaft. If you have any questions about the object, please contact the data provider.
Object type
- Artikel
Associated
- de Pedraza, Pablo
- Visintin, Stefano
- Tijdens, Kea Gartje
- Kismihók, Gábor
- Sciendo
Time of origin
- 2019