Research

Constrained Learning for Causal Inference and Semiparametric Statistics

Tiffany Cai*, Yuri Fonseca*, Kaiwen Hou, Hongseok Namkoong (* denotes co-first authorship)

Paper link

Forthcoming

Poster at CODE@MIT 2024 and ACIC 2024

Summary In challenging settings with limited overlap between treatment and control, causal estimators with desirable asymptotic properties require ad hoc adjustments in order to produce stable estimates. In contrast, simple plug-in estimators produce stable estimates but lack important asymptotic properties. We propose a new estimation framework, using constrained optimization, that combines the best of both worlds, and demonstrate its superior performance across settings, including with text covariates.

Posterior Sampling via Autoregressive Generation

Kelly Wang Zhang*, Tiffany Cai*, Hongseok Namkoong, Daniel Russo (* denotes co-first authorship)

Paper link

Poster at Neurips 2024 Workshop: Bayesian Decision-Making and Uncertainty and talk at 2024 Economics and AI+ML Meeting

Summary We propose a scalable solution to the problem of decision-making under uncertainty in a meta-bandit setting by using a calibrated generative model to impute a sequence of missing (e.g. future) rewards. Our proposed method is a principled implementation of Thompson (a.k.a. posterior) sampling. We prove decision-making performance is controlled by the log loss of the generative model, and we demonstrate on a news recommendation setting with text covariates.

Diagnosing Model Performance Under Distribution Shift

Tiffany Cai, Steve Yadlowsky, Hongseok Namkoong

Paper link, GitHub link

Under revision at Operations Research; presented at FORC 2023, INFORMS 2023

Summary When your model performs worse out of distribution, should you use a domain adaptation method, or do you need to collect more data? If the latter, from where should you collect more data? We propose a new diagnostic using causal inference methods to attribute changes in performance to X shifts and Y|X shifts. We demonstrate its utility in settings with tabular and image data.

Tutorial: Modeling and Exploiting Data Heterogeneity under Distribution Shifts

Jiashuo Liu, Tiffany Cai, Peng Cui, Hongseok Namkoong

Tutorial link

Presented at NeurIPS 2023