Key research themes
1. How can machine learning methods be integrated with causal discovery algorithms to improve causal model identification and estimation from observational data?
This theme investigates the development and application of machine learning (ML) approaches to enhance causal discovery from purely observational data, addressing challenges of small samples, complex high-dimensional data, and model misspecifications. It is crucial because traditional causal discovery methods often rely on restrictive assumptions or experiments that are infeasible, and ML offers novel tools to deal with these limitations by learning flexible, data-driven representations and causal structures that can generalize beyond mere associations.
2. What are the methodological advances and applications of causal machine learning in high-dimensional and heterogeneous healthcare data?
This theme explores how causal machine learning (CML) methods address unique challenges of healthcare data — such as multi-modal, high-dimensional, temporal, and confounded observational datasets — to estimate individualized treatment effects and enable actionable, personalized decision-making. It matters because causal predictions, unlike association-based ML, allow clinical decision-support systems (CDSs) to predict responses to interventions robustly, improving precision medicine and overcoming issues like out-of-distribution generalization.
3. How can causal inference validate models and identify causal direction using observational and two-variable data, particularly under constraints like latent variables or unmeasured confounders?
This theme examines approaches focused on resolving causal directionality and validating causal models from observational data, especially when randomized experiments or multi-variable graph-based methods are unavailable or infeasible. Key issues addressed include overcoming the limitations of conditional independence methods in bivariate settings, utilizing independence of cause and mechanism postulates, and employing influence functions for model validation. This is critical for ensuring robustness and interpretability of causal claims in observational studies.