Arbeitspapier

Double/debiased machine learning for treatment and structural parameters

We revisit the classic semiparametric problem of inference on a low di-mensional parameter Ø0 in the presence of high-dimensional nuisance parameters Û0. We depart from the classical setting by allowing for Û0 to be so high-dimensional that the traditional assumptions, such as Donsker properties, that limit complexity of the parameter space for this object break down. To estimate Û0, we consider the use of statistical or machine learning (ML) methods which are particularly well-suited to estimation in modern, very high-dimensional cases. ML methods perform well by employing regularization to reduce variance and trading off regularization bias with overfitting in practice. However, both regularization bias and overfitting in estimating Û0 cause a heavy bias in estimators of Ø0 that are obtained by naively plugging ML estimators of Û0 into estimating equations for Ø0. This bias results in the naive estimator failing to be N -1/2 consistent, where N is the sample size. We show that the impact of regularization bias and overfitting on estimation of the parameter of interest Ø0 can be removed by using two simple, yet critical, ingredients: (1) using Neyman-orthogonal moments/scores that have reduced sensitivity with respect to nuisance parameters to estimate Ø0, and (2) making use of cross-fitting which provides an efficient form of data-splitting. We call the resulting set of methods double or debiased ML (DML). We verify that DML delivers point estimators that concentrate in a N -1/2-neighborhood of the true parameter values and are approximately unbiased and normally distributed, which allows construction of valid confidence statements. The generic statistical theory of DML is elementary and simultaneously relies on only weak theoretical requirements which will admit the use of a broad array of modern ML methods for estimating the nuisance parameters such as random forests, lasso, ridge, deep neural nets, boosted trees, and various hybrids and ensembles of these methods. We illustrate the general theory by applying it to provide theoretical properties of DML applied to learn the main regression parameter in a partially linear regression model, DML applied to learn the coefficient on an endogenous variable in a partially linear instrumental variables model, DML applied to learn the average treatment effect and the average treatment effect on the treated under unconfoundedness, and DML applied to learn the local average treatment effect in an instrumental variables setting. In addition to these theoretical applications, we also illustrate the use of DML in three empirical examples.

Sprache: Englisch

Erschienen in: Series: cemmap working paper ; No. CWP28/17

Klassifikation: Wirtschaft

Thema: Kausalanalyse
Ökonometrie

Ereignis: Geistige Schöpfung

(wer): Chernozhukov, Victor
Chetverikov, Denis
Demirer, Mert
Duflo, Esther
Hansen, Christian B.
Newey, Whitney K.
Robins, James

Ereignis: Veröffentlichung

(wer): Centre for Microdata Methods and Practice (cemmap)

(wo): London

(wann): 2017

DOI: doi:10.1920/wp.cem.2017.2817

Handle: http://hdl.handle.net/10419/189736

Letzte Aktualisierung: 10.03.2025, 11:42 MEZ

Datenpartner

Dieses Objekt wird bereitgestellt von:
ZBW - Deutsche Zentralbibliothek für Wirtschaftswissenschaften - Leibniz-Informationszentrum Wirtschaft. Bei Fragen zum Objekt wenden Sie sich bitte an den Datenpartner.

Original beim Datenpartner anzeigen

Objekttyp

Arbeitspapier

Beteiligte

Chernozhukov, Victor
Chetverikov, Denis
Demirer, Mert
Duflo, Esther
Hansen, Christian B.
Newey, Whitney K.
Robins, James
Centre for Microdata Methods and Practice (cemmap)

Entstanden

2017

Ähnliche Objekte (12)

Arbeitspapier

Double/de-biased machine learning using regularized Riesz representers

Arbeitspapier

Simultaneous inference for best linear predictor of the conditional average treatment effect and other structural functions

Arbeitspapier

Double machine learning for treatment and causal parameters

Arbeitspapier

High dimensional and inference methods on structural an treatment effects

Arbeitspapier

Generic machine learning inference on heterogenous treatment effects in randomized experiments

Arbeitspapier

Estimation of treatment effects with high-dimensional controls

Arbeitspapier

Inference on average treatment effects in aggregate panel data settings

Arbeitspapier

Inference on treatment effects after selection amongst high-dimensional controls

Arbeitspapier

Inference on causal and structural parameters using many moment inequalities

Arbeitspapier

Exact and robust conformal inference methods for predictive machine learning with dependent data

Arbeitspapier

Posterior inference in curved exponential families under increasing dimensions

Arbeitspapier

On the computational complexity of MCMC-based estimators in large samples

Arbeitspapier

Double/de-biased machine learning using regularized Riesz representers

Arbeitspapier

Simultaneous inference for best linear predictor of the conditional average treatment effect and other structural functions

Arbeitspapier

Double machine learning for treatment and causal parameters

Arbeitspapier

High dimensional and inference methods on structural an treatment effects

Arbeitspapier

Generic machine learning inference on heterogenous treatment effects in randomized experiments

Arbeitspapier

Estimation of treatment effects with high-dimensional controls

Arbeitspapier

Inference on average treatment effects in aggregate panel data settings

Arbeitspapier

Inference on treatment effects after selection amongst high-dimensional controls

Arbeitspapier

Inference on causal and structural parameters using many moment inequalities

Arbeitspapier

Exact and robust conformal inference methods for predictive machine learning with dependent data

Arbeitspapier

Posterior inference in curved exponential families under increasing dimensions

Arbeitspapier

On the computational complexity of MCMC-based estimators in large samples

Arbeitspapier

Double/de-biased machine learning using regularized Riesz representers

Arbeitspapier

Simultaneous inference for best linear predictor of the conditional average treatment effect and other structural functions

Arbeitspapier

Double machine learning for treatment and causal parameters

Arbeitspapier

High dimensional and inference methods on structural an treatment effects

Arbeitspapier

Generic machine learning inference on heterogenous treatment effects in randomized experiments

Arbeitspapier

Estimation of treatment effects with high-dimensional controls

Arbeitspapier

Inference on average treatment effects in aggregate panel data settings

Arbeitspapier

Inference on treatment effects after selection amongst high-dimensional controls

Arbeitspapier

Inference on causal and structural parameters using many moment inequalities

Arbeitspapier

Exact and robust conformal inference methods for predictive machine learning with dependent data

Arbeitspapier

Posterior inference in curved exponential families under increasing dimensions

Arbeitspapier

On the computational complexity of MCMC-based estimators in large samples

Informationen zur Registrierung von Kultur- und Wissenseinrichtungen finden Sie hier.

Felder mit * müssen ausgefüllt werden.

Benutzername*

Bitte geben Sie Ihren Benutzernamen ein

E-Mail*

Bitte geben Sie Ihre E-Mail ein

Bitte füllen Sie dieses Feld nicht aus

Vorname

Nachname

Passwort*

Bitte geben Sie Ihr Passwort ein

Passwort bestätigen*

Bitte geben Sie das gleiche Passwort ein

Ich habe die Nutzungsbedingungen und die Datenschutzerklärung zur Erhebung persönlicher Daten gelesen und stimme ihnen zu. *

Dieses Feld ist ein Pflichtfeld.

Ich möchte den Newsletter der Deutschen Digitalen Bibliothek abonnieren. Siehe Informationen zum Newsletter-Abonnement.

Benutzerkonto angelegt

Ihr „Meine DDB“-Konto wurde erfolgreich angelegt. Bevor Sie sich in Ihrem Konto anmelden können, müssen Sie auf den Bestätigungslink in der Nachricht klicken, die wir gerade an die von Ihnen angegebene E-Mail-Adresse geschickt haben

Double/debiased machine learning for treatment and structural parameters

Angaben zum Objekt

Verweise und Beziehungen

Klassifikation und Themen

Beteiligte, Orts- und Zeitangaben

Weitere Informationen

Datenpartner

Objekttyp

Beteiligte

Entstanden

Ähnliche Objekte (12)

Double/de-biased machine learning using regularized Riesz representers

Simultaneous inference for best linear predictor of the conditional average treatment effect and other structural functions

Double machine learning for treatment and causal parameters

High dimensional and inference methods on structural an treatment effects

Generic machine learning inference on heterogenous treatment effects in randomized experiments

Estimation of treatment effects with high-dimensional controls

Inference on average treatment effects in aggregate panel data settings

Inference on treatment effects after selection amongst high-dimensional controls

Inference on causal and structural parameters using many moment inequalities

Exact and robust conformal inference methods for predictive machine learning with dependent data

Posterior inference in curved exponential families under increasing dimensions

On the computational complexity of MCMC-based estimators in large samples

Double/de-biased machine learning using regularized Riesz representers

Simultaneous inference for best linear predictor of the conditional average treatment effect and other structural functions

Double machine learning for treatment and causal parameters

High dimensional and inference methods on structural an treatment effects

Generic machine learning inference on heterogenous treatment effects in randomized experiments

Estimation of treatment effects with high-dimensional controls

Inference on average treatment effects in aggregate panel data settings

Inference on treatment effects after selection amongst high-dimensional controls

Inference on causal and structural parameters using many moment inequalities

Exact and robust conformal inference methods for predictive machine learning with dependent data

Posterior inference in curved exponential families under increasing dimensions

On the computational complexity of MCMC-based estimators in large samples

Double/de-biased machine learning using regularized Riesz representers

Simultaneous inference for best linear predictor of the conditional average treatment effect and other structural functions

Double machine learning for treatment and causal parameters

High dimensional and inference methods on structural an treatment effects

Generic machine learning inference on heterogenous treatment effects in randomized experiments

Estimation of treatment effects with high-dimensional controls

Inference on average treatment effects in aggregate panel data settings

Inference on treatment effects after selection amongst high-dimensional controls

Inference on causal and structural parameters using many moment inequalities

Exact and robust conformal inference methods for predictive machine learning with dependent data

Posterior inference in curved exponential families under increasing dimensions

On the computational complexity of MCMC-based estimators in large samples

Verbundene Objekte

Passwort zurücksetzen