Sequential learning of regression models by penalized estimation

Abstract: When data arrive in a sequence of two or more datasets, modeling on the most recent dataset should take previous datasets into account. We specifically investigate a strategy for regression modeling when parameter estimates from previous data can be used as anchoring points, yet may not be available for all parameters, thus, covariance information cannot be reused. A procedure that updates through targeted penalized estimation, which shrinks the estimator toward a nonzero value, is presented. The parameter estimate from the previous data serves as this nonzero value when an update is sought from novel data. This naturally extends to a sequence of datasets with the same response, but potentially only partial overlap in covariates. The iteratively updated regression parameter estimator is shown to be asymptotically unbiased and consistent. The penalty parameter is chosen through constrained cross-validated log-likelihood optimization. The constraint bounds the amount of shrinkage of the updated estimator toward the current one from below. The bound aims to preserve the (updated) estimator’s goodness of fit on all-but-the-novel data. The proposed approach is compared to other regression modeling procedures. Finally, it is illustrated on an epidemiological study where the data arrive in batches with different covariate-availability and the model is refitted with the availability of a novel batch. Supplementary materials for this article are available online

Location
Deutsche Nationalbibliothek Frankfurt am Main
Extent
Online-Ressource
Language
Englisch
Notes
Journal of computational and graphical statistics. - 31, 3 (2022) , 877-886, ISSN: 1537-2715

Classification
Wirtschaft

Event
Veröffentlichung
(where)
Freiburg
(who)
Universität
(when)
2022
Creator
van Wieringen, Wessel N.
Binder, Harald

DOI
10.1080/10618600.2022.2035231
URN
urn:nbn:de:bsz:25-freidok-2266424
Rights
Open Access; Der Zugriff auf das Objekt ist unbeschränkt möglich.
Last update
03.01.2032, 1:49 PM CET

Data provider

This object is provided by:
Deutsche Nationalbibliothek. If you have any questions about the object, please contact the data provider.

Associated

Time of origin

  • 2022

Other Objects (12)