Academia.eduAcademia.edu

Machine Learning for Causal Inference

description12 papers
group20 followers
lightbulbAbout this topic
Machine Learning for Causal Inference is an interdisciplinary field that combines machine learning techniques with causal inference methodologies to identify and estimate causal relationships from observational data, enabling researchers to draw conclusions about the effects of interventions or treatments while accounting for confounding variables.
lightbulbAbout this topic
Machine Learning for Causal Inference is an interdisciplinary field that combines machine learning techniques with causal inference methodologies to identify and estimate causal relationships from observational data, enabling researchers to draw conclusions about the effects of interventions or treatments while accounting for confounding variables.

Key research themes

1. How can machine learning methods be integrated with causal discovery algorithms to improve causal model identification and estimation from observational data?

This theme investigates the development and application of machine learning (ML) approaches to enhance causal discovery from purely observational data, addressing challenges of small samples, complex high-dimensional data, and model misspecifications. It is crucial because traditional causal discovery methods often rely on restrictive assumptions or experiments that are infeasible, and ML offers novel tools to deal with these limitations by learning flexible, data-driven representations and causal structures that can generalize beyond mere associations.

Key finding: Proposes SLdisco, a supervised ML-based causal discovery algorithm trained on simulated data to jointly learn mapping from observational data to causal equivalence classes. SLdisco outperforms existing sequential, conditional... Read more
Key finding: Reviews constraint-based and structural equation model (SEM) approaches to causal discovery from i.i.d and time series data, emphasizing use of conditional independence tests and structural constraints for identifiability.... Read more
Key finding: Surveys methods for learning causal networks represented as graphs from data, focusing on the challenges of soundness, completeness, and scalability. Introduces techniques that incorporate learning heuristics and evaluation... Read more
Key finding: Extends causal discovery to relational data by introducing relational causal models (RCM) and the RCD-Light algorithm, a constraint-based supervised method that learns causal structures considering adjacency-faithfulness and... Read more

2. What are the methodological advances and applications of causal machine learning in high-dimensional and heterogeneous healthcare data?

This theme explores how causal machine learning (CML) methods address unique challenges of healthcare data — such as multi-modal, high-dimensional, temporal, and confounded observational datasets — to estimate individualized treatment effects and enable actionable, personalized decision-making. It matters because causal predictions, unlike association-based ML, allow clinical decision-support systems (CDSs) to predict responses to interventions robustly, improving precision medicine and overcoming issues like out-of-distribution generalization.

Key finding: Synthesizes three main CML directions: causal representation learning, causal discovery, and causal reasoning, focusing on their application in healthcare. Demonstrates that CML can handle high-dimensional and unstructured... Read more
Key finding: Develops a fully data-driven causal model of Alzheimer's disease biomarker trajectories derived from large multi-center biomarker datasets. The approach integrates causal discovery, sensitivity analysis, and patient-level... Read more
Key finding: Proposes a novel framework combining model-based estimation with offline unlabeled datasets to improve causal estimation in dynamic systems under model misspecification and dataset shift. Through theoretical analysis and... Read more
by David HASON RUDD and 
1 more
Key finding: Integrates recursive feature elimination, ensemble neural networks, and Bayesian networks to perform causal discovery and prediction in a high-dimensional financial dataset predicting customer churn. Demonstrates identifying... Read more

3. How can causal inference validate models and identify causal direction using observational and two-variable data, particularly under constraints like latent variables or unmeasured confounders?

This theme examines approaches focused on resolving causal directionality and validating causal models from observational data, especially when randomized experiments or multi-variable graph-based methods are unavailable or infeasible. Key issues addressed include overcoming the limitations of conditional independence methods in bivariate settings, utilizing independence of cause and mechanism postulates, and employing influence functions for model validation. This is critical for ensuring robustness and interpretability of causal claims in observational studies.

Key finding: Develops a novel validation procedure using influence functions to estimate the estimation error of causal inference models without access to counterfactual data, enabling cross-validation-like model selection in... Read more
Key finding: Bridges two causal inference paradigms—conditional independence-based graph methods and independence of cause and mechanism methods—by theoretically showing how latent instrumental variables manifest indirectly in causal... Read more
Key finding: Proposes simple criteria based on the principle that predicting effect from cause should be algorithmically simpler than the inverse (cause from effect), operationalized via complexity measures related to minimum description... Read more
Key finding: Establishes that bounding individualized causal effects (probabilities of causation) is not identifiable from population experimental or observational data alone. Demonstrates that incorporating structural assumptions via... Read more

All papers in Machine Learning for Causal Inference

Traditional spatiotemporal data analysis often relies on predictive models that overlook causal relationships, making it difficult to identify true drivers and formulate effective interventions. To bridge this gap, we review causal... more
“Causal Computation Theory” introduces a comprehensive framework for structuring computation around causal reasoning, enabling intelligent systems to move beyond pattern recognition into structured, adaptive decision-making under... more
Effective customer relationship management (CRM) techniques are essential in today's business environments for companies looking to maximize client interactions and increase revenue. This paper addresses customer churn, a significant... more
This is one of the challenges that new and fast-growing econometric literature is beginning to tackle in addressing causal inference problems with machine learning methods. Yet, empirical economics still has not really made use of the... more
Customer churn describes terminating a relationship with a business or reducing customer engagement over a specific period. Customer acquisition cost can be five to six times that of customer retention, hence investing in customers with... more
We propose counterfactual reasoning through probabilistic logic twin networks (PLTNs) to prevent collisions in self-driving cars. The basis of a PLTNs is a causal Bayesian network (cBN ) partially learned from simulated self-driving car... more
by David HASON RUDD and 
1 more
Customer churn describes terminating a relationship with a business or reducing customer engagement over a specific period. Customer acquisition cost can be five to six times that of customer retention, hence investing in customers with... more
We apply causal machine learning algorithms to assess the causal effect of a marketing intervention, namely a coupon campaign, on the sales of a retailer. Besides assessing the average impacts of different types of coupons, we also... more
At the heart of causal structure learning from observational data lies a deceivingly simple question: given two statistically dependent random variables, which one has a causal effect on the other? This is impossible to answer using... more
At the heart of causal structure learning from observational data lies a deceivingly simple question: given two statistically dependent random variables, which one has a causal effect on the other? This is impossible to answer using... more
At the heart of causal structure learning from observational data lies a deceivingly simple question: given two statistically dependent random variables, which one has a causal effect on the other? This is impossible to answer using... more
We consider evaluating the causal effects of dynamic treatments, i.e. of multiple treatment sequences in various periods, based on double machine learning to control for observed, time-varying covariates in a data-driven way under a... more
At the heart of causal structure learning from observational data lies a deceivingly simple question: given two statistically dependent random variables, which one has a causal effect on the other? This is impossible to answer using... more
Download research papers for free!