Arbeitspapier

How valid can data fusion be?

Data fusion techniques typically aim to achieve a complete data file from different sources which do not contain the same units. Traditionally, this is done on the basis of variables common to all files. It is well known that those approaches establish conditional independence of the specific variables given the common variables, although they may be conditionally dependent in reality. We discuss the objectives of data fusion in the light of their feasibility and distinguish four levels of validity that a fusion technique may achieve. For a rather general situation, we derive the feasible set of correlation matrices for the variables not jointly observed and suggest a new quality index for data fusion. Finally, we present a suitable and effcient multiple imputation procedure to make use of auxiliary information and to overcome the conditional independence assumption.

Language
Englisch

Bibliographic citation
Series: IAB-Discussion Paper ; No. 15/2006

Classification
Wirtschaft
Bayesian Analysis: General
Statistical Simulation Methods: General
Methodology for Collecting, Estimating, and Organizing Microeconomic Data; Data Access
Subject
Daten
Datenaufbereitung
Datenqualität
Korrelation
Validität
angewandte Statistik
mathematische Statistik

Event
Geistige Schöpfung
(who)
Kiesl, Hans
Rässler, Susanne
Event
Veröffentlichung
(who)
Institut für Arbeitsmarkt- und Berufsforschung (IAB)
(where)
Nürnberg
(when)
2006

Handle
Last update
10.03.2025, 11:43 AM CET

Data provider

This object is provided by:
ZBW - Deutsche Zentralbibliothek für Wirtschaftswissenschaften - Leibniz-Informationszentrum Wirtschaft. If you have any questions about the object, please contact the data provider.

Object type

  • Arbeitspapier

Associated

  • Kiesl, Hans
  • Rässler, Susanne
  • Institut für Arbeitsmarkt- und Berufsforschung (IAB)

Time of origin

  • 2006

Other Objects (12)