Arbeitspapier
Finding needles in haystacks: Multiple-imputation record linkage using machine learning
This paper considers the problem of record linkage between a household-level survey and an establishment-level frame in the absence of unique identifiers. Linkage between frames in this setting is challenging because the distribution of employment across establishments is highly skewed. To address these difficulties, this paper develops a probabilistic record linkage methodology that combines machine learning (ML) with multiple imputation (MI). This ML-MI methodology is applied to link survey respondents in the Health and Retirement Study to their workplaces in the Census Business Register. The linked data reveal new evidence that non-sampling errors in household survey data are correlated with respondents' workplace characteristics.
- Language
-
Englisch
- Bibliographic citation
-
Series: Working Papers ; No. 22-11
- Classification
-
Wirtschaft
Estimation: General
Methodological Issues: General
Methodology for Collecting, Estimating, and Organizing Microeconomic Data; Data Access
- Subject
-
Administrative data
machine learning
multiple imputation
probabilistic record linkage
survey data
- Event
-
Geistige Schöpfung
- (who)
-
Abowd, John M.
Abramowitz, Joelle
Levenstein, Margaret
McCue, Kristin
Patki, Dhiren
Raghunathan, Trivellore
Rodgers, Ann M.
Shapiro, Matthew D.
Nasi Wada
Zinsser, Dawn
- Event
-
Veröffentlichung
- (who)
-
Federal Reserve Bank of Boston
- (where)
-
Boston, MA
- (when)
-
2022
- DOI
-
doi:10.29412/res.wp.2022.11
- Handle
- Last update
-
10.03.2025, 11:44 AM CET
Data provider
ZBW - Deutsche Zentralbibliothek für Wirtschaftswissenschaften - Leibniz-Informationszentrum Wirtschaft. If you have any questions about the object, please contact the data provider.
Object type
- Arbeitspapier
Associated
- Abowd, John M.
- Abramowitz, Joelle
- Levenstein, Margaret
- McCue, Kristin
- Patki, Dhiren
- Raghunathan, Trivellore
- Rodgers, Ann M.
- Shapiro, Matthew D.
- Nasi Wada
- Zinsser, Dawn
- Federal Reserve Bank of Boston
Time of origin
- 2022