Machine Learning for Causal Inference

description12 papers

group20 followers

lightbulbAbout this topic

Machine Learning for Causal Inference is an interdisciplinary field that combines machine learning techniques with causal inference methodologies to identify and estimate causal relationships from observational data, enabling researchers to draw conclusions about the effects of interventions or treatments while accounting for confounding variables.

lightbulbAbout this topic

Key research themes

1. How can machine learning methods be integrated with causal discovery algorithms to improve causal model identification and estimation from observational data?

This theme investigates the development and application of machine learning (ML) approaches to enhance causal discovery from purely observational data, addressing challenges of small samples, complex high-dimensional data, and model misspecifications. It is crucial because traditional causal discovery methods often rely on restrictive assumptions or experiments that are infeasible, and ML offers novel tools to deal with these limitations by learning flexible, data-driven representations and causal structures that can generalize beyond mere associations.

Causal discovery for observational sciences using supervised machine learning

by Peter Spirtes

2024, arXiv (Cornell University)

Key finding: Proposes SLdisco, a supervised ML-based causal discovery algorithm trained on simulated data to jointly learn mapping from observational data to causal equivalence classes. SLdisco outperforms existing sequential, conditional... Read more

articleView Paper downloadDownload

Causal discovery and inference: concepts and recent methodological advances

by Peter Spirtes

2022, Applied informatics

Key finding: Reviews constraint-based and structural equation model (SEM) approaches to causal discovery from i.i.d and time series data, emphasizing use of conditional independence tests and structural constraints for identifiability.... Read more

articleView Paper downloadDownload

Learning Causal Networks from Data: A Survey and a New Algorithm for Recovering Possibilistic Causal Networks

by Ulises Cortés

2015, Ai Communications

Key finding: Surveys methods for learning causal networks represented as graphs from data, focusing on the challenges of soundness, completeness, and scalability. Introduces techniques that incorporate learning heuristics and evaluation... Read more

articleView Paper downloadDownload

On Learning Causal Models from Relational Data

by Vasant G Honavar

2016, Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence

Key finding: Extends causal discovery to relational data by introducing relational causal models (RCM) and the RCD-Light algorithm, a constraint-based supervised method that learns causal structures considering adjacency-faithfulness and... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

2. What are the methodological advances and applications of causal machine learning in high-dimensional and heterogeneous healthcare data?

This theme explores how causal machine learning (CML) methods address unique challenges of healthcare data — such as multi-modal, high-dimensional, temporal, and confounded observational datasets — to estimate individualized treatment effects and enable actionable, personalized decision-making. It matters because causal predictions, unlike association-based ML, allow clinical decision-support systems (CDSs) to predict responses to interventions robustly, improving precision medicine and overcoming issues like out-of-distribution generalization.

Causal machine learning for healthcare and precision medicine

by Pedro Mata Sanchez

2023, Royal Society Open Science

Key finding: Synthesizes three main CML directions: causal representation learning, causal discovery, and causal reasoning, focusing on their application in healthcare. Demonstrates that CML can handle high-dimensional and unstructured... Read more

articleView Paper downloadDownload

Data-driven causal model discovery and personalized prediction in Alzheimer's disease

by Haoyang Zheng

2023, npj Digital Medicine

Key finding: Develops a fully data-driven causal model of Alzheimer's disease biomarker trajectories derived from large multi-center biomarker datasets. The approach integrates causal discovery, sensitivity analysis, and patient-level... Read more

articleView Paper downloadDownload

Enhancing Causal Estimation through Unlabeled Offline Data

by Ron Teichner

2024, 2022 7th International Conference on Frontiers of Signal Processing (ICFSP)

Key finding: Proposes a novel framework combining model-based estimation with offline unlabeled datasets to improve causal estimation in dynamic systems under model misspecification and dataset shift. Through theoretical analysis and... Read more

articleView Paper downloadDownload

Improved Churn Causal Analysis Through Restrained High-Dimensional Feature Space Effects in Financial Institutions

by David HASON RUDD and

2023, Human-Centric Intelligent Systems

Key finding: Integrates recursive feature elimination, ensemble neural networks, and Bayesian networks to perform causal discovery and prediction in a high-dimensional financial dataset predicting customer churn. Demonstrates identifying... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

3. How can causal inference validate models and identify causal direction using observational and two-variable data, particularly under constraints like latent variables or unmeasured confounders?

This theme examines approaches focused on resolving causal directionality and validating causal models from observational data, especially when randomized experiments or multi-variable graph-based methods are unavailable or infeasible. Key issues addressed include overcoming the limitations of conditional independence methods in bivariate settings, utilizing independence of cause and mechanism postulates, and employing influence functions for model validation. This is critical for ensuring robustness and interpretability of causal claims in observational studies.

Validating Causal Inference Models via Influence Functions

by Ahmed Alaa

2024

Key finding: Develops a novel validation procedure using influence functions to estimate the estimation error of causal inference models without access to counterfactual data, enabling cross-validation-like model selection in... Read more

articleView Paper downloadDownload

The Role of Instrumental Variables in Causal Inference Based on Independence of Cause and Mechanism

by Pierre-Henri Wuillemin

2022, Entropy

Key finding: Bridges two causal inference paradigms—conditional independence-based graph methods and independence of cause and mechanism methods—by theoretically showing how latent instrumental variables manifest indirectly in causal... Read more

articleView Paper downloadDownload

Inferring Causal Direction from Observational Data: A Complexity Approach

by NIKOS NIKOLAOU

2022

Key finding: Proposes simple criteria based on the principle that predicting effect from cause should be algorithmically simpler than the inverse (cause from effect), operationalized via complexity measures related to minimum description... Read more

articleView Paper downloadDownload

Causes of Effects: Learning Individual Responses from Population Data

by Scott Mueller

2024, Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence

Key finding: Establishes that bounding individualized causal effects (probabilities of causation) is not identifiable from population experimental or observational data alone. Demonstrates that incorporating structural assumptions via... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

All papers in Machine Learning for Causal Inference

Applying Causal Machine Learning to Spatiotemporal Data Analysis: An Investigation of Opportunities and Challenges

by christian mulomba

2025, IEEE Access

Traditional spatiotemporal data analysis often relies on predictive models that overlook causal relationships, making it difficult to identify true drivers and formulate effective interventions. To bridge this gap, we review causal machine learning (CML) techniques for spatiotemporal data, aiming to provide robust insights into their unique advantages. Our literature review reveals that fewer than 1% of studies in major databases explicitly integrate CML with spatiotemporal analysis. After rigorous screening, we analyze 51 relevant papers, categorizing their contributions into four key areas (totaling 62 methodological approaches due to multi-category papers): 1) causal effect discovery and estimation (32 approaches), 2) prediction accuracy enhancement (19), 3) pattern recognition limitations (10), and 4) interpretability (1). This distribution highlights a critical research gap, particularly in interpretability and comprehensive frameworks. We further examine unique challenges in spatiotemporal data, such as spatial autocorrelation and temporal dependencies, that complicate causal inference but also present opportunities for innovation. Promising approaches include the synergy of spatiotemporal Granger causality and structural equation modeling with spatial lags, which capture complex interdependencies while preserving interpretability. Future directions include developing interpretable causal models, advancing real-time causal inference in dynamic environments, and addressing computational challenges (scalability, efficiency, and complexityinterpretability trade-offs). We also discuss ethical considerations, such as bias mitigation in causal discovery and societal implications of spatiotemporal causal inference. By synthesizing challenges and opportunities, this work advances the application of CML in spatiotemporal analysis, with implications for climate science, economics, epidemiology, and urban planning. INDEX TERMS Causal machine learning, spatiotemporal data analysis, synergy methods, ethics.

descriptionView Paper arrow_downwardDownload

Causal Computation Theory

by WUN J I A SYU

2025

“Causal Computation Theory” introduces a comprehensive framework for structuring computation around causal reasoning, enabling intelligent systems to move beyond pattern recognition into structured, adaptive decision-making under... more

descriptionView Paper arrow_downwardDownload

ENHANCING CUSTOMER RELATIONSHIP MANAGEMENT WITH ARTIFICIAL INTELLIGENCE AND DEEP LEARNING: A CASE STUDY ANALYSIS

by Mohan Reddy Sareddy

2025, International Journal of Management Research and Reviews

Effective customer relationship management (CRM) techniques are essential in today's business environments for companies looking to maximize client interactions and increase revenue. This paper addresses customer churn, a significant... more

descriptionView Paper arrow_downwardDownload

Econometric advances in causal inference: The machine learning revolution

by Imran Uddin

2025, GSC Advanced Research and Reviews

This is one of the challenges that new and fast-growing econometric literature is beginning to tackle in addressing causal inference problems with machine learning methods. Yet, empirical economics still has not really made use of the... more

descriptionView Paper arrow_downwardDownload

Improved Churn Causal Analysis Through Restrained High-Dimensional Feature Space Effects in Financial Institutions

by David Hason

2025, Human-Centric Intelligent Systems

Customer churn describes terminating a relationship with a business or reducing customer engagement over a specific period. Customer acquisition cost can be five to six times that of customer retention, hence investing in customers with... more

descriptionView Paper arrow_downwardDownload

Preventing Collisions in Self-driving Cars using Probabilistic Logic Counterfactual Reasoning

by Héctor H Avilés Arriaga

2024, Workshop on Causal Discovery (CaDis) 2024

We propose counterfactual reasoning through probabilistic logic twin networks (PLTNs) to prevent collisions in self-driving cars. The basis of a PLTNs is a causal Bayesian network (cBN ) partially learned from simulated self-driving car... more

Fig. 1. Race-like environment considered in this study for our self-driving car (in bright red).

Fig. 2. Predefined locations for other vehicles around the self-driving car (dashed red lines indicate the space the self-driving car occupy on each lane). When the vehicle is on the left (resp. right) lane, only the locations Northwest, Northeast, East and Southeast (resp. Northeast, Northwest, West and Southwest) are meaningful.

Fig. 3. Causal Bayesian network (cBN) proposed in this work.

Table 1. Number of actions in state-action pairs labeled as potential collisions and non-potential collisions in the integrated dataset. Table 2. Number of unique state-action pairs labeled as non-potential collisions and potential collisions (“safe” and “unsafe”, respectively).

3.2 Learning of the causal Bayesian networks

Table 3. Number of unique state-action pairs as a function of the action used fo training and testing each cBN. are used for training and testing) does not represent a major concern in this setup. This is because the core of the evaluation focuses on comparing condi- tional probabilities under interventions for a counterfactual model, rather than studying the generalization capabilities of PLTNs to previously unknown data. Despite this, we consider constructing cBNs using training sets of varying sizes for comparison purposes. To achieve this, we randomly selected 1%, 50%, and 100% of the training dataset for structural and parameter learning of the cBNs. Table 3 summarizes the number of unique state-action pairs used for training and testing for each driving action and cBN.

Table 4. Number of groups with a unique minimum probability value and the numbe! of ties for first place (ranging from 2 to 5) across all groups and counterfactual models 4 Evaluation and results

Table 5. Number of actions selected as the optimal intervention in the three counter- factual models. Table 6. State-action pairs with 5 tied actions. The first two pairs, from top to bottom, correspond to the counterfactual model trained with 1% of the data, and the last one is the same for the models trained with 50% and 100% of the data (all actions other than the observed are safe).

descriptionView Paper arrow_downwardDownload

Improved Churn Causal Analysis Through Restrained High-Dimensional Feature Space Effects in Financial Institutions

by David HASON RUDD and

2023, Human-Centric Intelligent Systems

descriptionView Paper arrow_downwardDownload

How causal machine learning can leverage marketing strategies: Assessing and improving the performance of a coupon campaign

by Martin Huber

2022

We apply causal machine learning algorithms to assess the causal effect of a marketing intervention, namely a coupon campaign, on the sales of a retailer. Besides assessing the average impacts of different types of coupons, we also... more

descriptionView Paper arrow_downwardDownload

Inferring Causal Direction from Observational Data: A Complexity Approach

by NIKOS NIKOLAOU

2022

At the heart of causal structure learning from observational data lies a deceivingly simple question: given two statistically dependent random variables, which one has a causal effect on the other? This is impossible to answer using... more

descriptionView Paper arrow_downwardDownload

Inferring Causal Direction from Observational Data: A Complexity Approach

by NIKOS NIKOLAOU

2022, Machine Learning for Pharma and Healthcare Applications ECML PKDD 2020 Workshop (PharML 2020)

descriptionView Paper arrow_downwardDownload

Inferring Causal Direction from Observational Data: A Complexity Approach

by Nikolaos (Nikos) Nikolaou

2022

descriptionView Paper arrow_downwardDownload

Evaluating (weighted) dynamic treatment effects by double machine learning

by Martin Huber

2020

We consider evaluating the causal effects of dynamic treatments, i.e. of multiple treatment sequences in various periods, based on double machine learning to control for observed, time-varying covariates in a data-driven way under a... more

descriptionView Paper arrow_downwardDownload

Inferring Causal Direction from Observational Data: A Complexity Approach

by Nikolaos (Nikos) Nikolaou

2020, Machine Learning for Pharma and Healthcare Applications ECML PKDD 2020 Workshop (PharML 2020)

descriptionView Paper arrow_downwardDownload