Artikel

Survey vs scraped data: Comparing time series properties of web and survey vacancy data

This paper studies the relationship between a vacancy population obtained from web crawling and vacancies in the economy inferred by a National Statistics Office (NSO) using a traditional method. We compare the time series properties of samples obtained between 2007 and 2014 by Statistics Netherlands and by a web scraping company. We find that the web and NSO vacancy data present similar time series properties, suggesting that both time series are generated by the same underlying phenomenon: the real number of new vacancies in the economy. We conclude that, in our case study, web-sourced data are able to capture aggregate economic activity in the labor market.

Language
Englisch

Bibliographic citation
Journal: IZA Journal of Labor Economics ; ISSN: 2193-8997 ; Volume: 8 ; Year: 2019 ; Issue: 4 ; Pages: 1-23 ; Warsaw: Sciendo

Classification
Wirtschaft
Labor Demand
Labor Turnover; Vacancies; Layoffs
Single Equation Models; Single Variables: Time-Series Models; Dynamic Quantile Regressions; Dynamic Treatment Effect Models; Diffusion Processes
Data Collection and Data Estimation Methodology; Computer Programs: General
Subject
web crawling
statistical inference
time series
vacancies
Labor demand
data collection

Event
Geistige Schöpfung
(who)
de Pedraza, Pablo
Visintin, Stefano
Tijdens, Kea Gartje
Kismihók, Gábor
Event
Veröffentlichung
(who)
Sciendo
(where)
Warsaw
(when)
2019

DOI
doi:10.2478/izajole-2019-0004
Handle
Last update
10.03.2025, 11:42 AM CET

Data provider

This object is provided by:
ZBW - Deutsche Zentralbibliothek für Wirtschaftswissenschaften - Leibniz-Informationszentrum Wirtschaft. If you have any questions about the object, please contact the data provider.

Object type

  • Artikel

Associated

  • de Pedraza, Pablo
  • Visintin, Stefano
  • Tijdens, Kea Gartje
  • Kismihók, Gábor
  • Sciendo

Time of origin

  • 2019

Other Objects (12)