Arbeitspapier

Neglected heterogeneity, Simpson's paradox, and the anatomy of least squares

When a sample combines data from two or more groups, multivariate regression yields a matrix-weighted average of the group-specific coefficient vectors. However, it is possible that the weighted average of a specific coefficient falls outside the range of the group-specific coefficients, and it may even have a different sign compared to both group-level coefficients, a manifestation of Simpson's paradox. The result of the combined regression is then prone to misinterpretation. The purpose of this paper is to raise awareness of this problem and to state conditions under which such non-convex weighting or sign reversal can arise, for a model with two regressors and two groups. Two illustrative examples, an investment equation estimated with panel data, and a cross-sectional earnings equation for men and women, highlight the relevance of these findings for applied work.

Language
Englisch

Bibliographic citation
Series: Working Paper ; No. 426

Classification
Wirtschaft
Single Equation Models; Single Variables: Cross-Sectional Models; Spatial Models; Treatment Effect Models; Quantile Regressions
Subject
Covariance-weighting
heterogeneity spillover
non-convex average
average treatment effect

Event
Geistige Schöpfung
(who)
Winkelmann, Rainer
Event
Veröffentlichung
(who)
University of Zurich, Department of Economics
(where)
Zurich
(when)
2023

DOI
doi:10.5167/uzh-229123
Handle
Last update
10.03.2025, 11:41 AM CET

Data provider

This object is provided by:
ZBW - Deutsche Zentralbibliothek für Wirtschaftswissenschaften - Leibniz-Informationszentrum Wirtschaft. If you have any questions about the object, please contact the data provider.

Object type

  • Arbeitspapier

Associated

  • Winkelmann, Rainer
  • University of Zurich, Department of Economics

Time of origin

  • 2023

Other Objects (12)