Academia.eduAcademia.edu

Outline

When are graphical causal models not good models?

2011, Causality in the Sciences

https://doi.org/10.1093/ACPROF:OSO/9780199574131.003.0027

Abstract

The principle of Kolmogorov Minimal Sufficient Statistic (KMSS) states that a model should capture all regularities of the data. The conditional independencies following from the causal structure of the system are the regularities incorporated in a graphical causal model. We prove that for joint probability distributions, the KMSS is described by the Directed Acyclic Graph (DAG) of the minimal Bayesian network if this results in an incompressible description. We prove that a Bayesian network that is the KMSS is faithful. In that case it can be learned from observations and modularity is the most plausible hypothesis. From modularity follows the ability to predict the effect of interventions. On the other hand, if the minimal Bayesian network is compressible, and thus not the KMSS, the above implications cannot be guaranteed. When the non-minimality of the description is due to the compressibility of an individual Conditional Probability Distribution (CPD), the true causal model is an element of the set of minimal Bayesian networks and modularity is still plausible. Faithfulness cannot be guaranteed though. When the concatenation of the descriptions of the CPDs is compressible, the true causal model is not necessarily an element of the set of minimal Bayesian networks. Also modularity may become implausible. This suggests that either there is a kind of meta-mechanism governing some of the mechanisms or either a single mechanism responsible for setting the state of multiple variables.

References (30)

  1. Craig Boutilier, Nir Friedman, Moises Goldszmidt, and Daphne Koller. Context-specific indepen- dence in Bayesian networks. In Uncertainty in Artificial Intelligence, pages 115-123, 1996.
  2. Nancy Cartwright. What is wrong with Bayes nets? The Monist, pages 242-264, 2001.
  3. G.F. Cooper and E. Herskovits. A Bayesian method for the induction of probabilistic networks from data. Machine Learning, 9:309-347, 1992.
  4. Thomas M. Cover and Joy A. Thomas. Elements of Information Theory. John Wiley & Sons, Inc., 1991.
  5. D.A. Freedman and P. Humphreys. Are there algorithms that discover causal structure? Synthese, 121:2954, 1999.
  6. N. Friedman and M. Goldszmidt. Learning bayesian networks with local structure. In In Proc. of the 12th Conf. on Uncertainty in Artificial Intelligence, 1996.
  7. Péter Gács, J. Tromp, and Paul M. B. Vitányi. Algorithmic statistics. IEEE Trans. Inform. Theory, 47(6):2443-2463, 2001.
  8. P. Grünwald. The Minimum Description Length Principle and Reasoning under Uncertainty, ILLC Dissertation series 1998-03. PhD thesis, University of Amsterdam, 1998.
  9. Daniel M. Hausman and James Woodward. Independence, invariance and the causal Markov con- dition. Bristish Journal For the Philosophy Of Science, 50(4):521-583, 1999.
  10. D. Heckerman, D. Geiger, and D.M. Chickering. Learning Bayesian networks: the combination of knowledge and statistical data. Technical Report MSR-TR-94-09, Microsoft Research, 1994.
  11. E.H. Herskovits. Computer-Based Probabilistic Network Construction. PhD thesis, Medical infor- mation sciences, Stanford University, CA, 1991.
  12. G. Hesslow. Discussion: Two notes on the probabilistic approach to causality. Philosophy of Science, 43:290-2, 1976.
  13. Daphne Koller and Avi Pfeffer. Object-oriented Bayesian networks. In Proceedings of the Thir- teenth Conference on Uncertainty in Artificial Intelligence (UAI-97), pages 302-313, 1997.
  14. Kevin B. Korb and Erik Nyberg. The power of intervention. Minds and Machines, 16(3):289-302, 2006.
  15. Jan Lemeire. Learning Causal Models of Multivariate Systems and the Value of it for the Perfor- mance Modeling of Computer Programs. PhD thesis, Vrije Universiteit Brussel, 2007.
  16. Ming Li and Paul M. B. Vitányi. An Introduction to Kolmogorov Complexity and Its Applications. Springer Verlag, 1997.
  17. Anders L. Madsen and Bruce D'Ambrosio. A factorized representation of independence of causal influence and lazy propagation. Int. J. Uncertain. Fuzziness Knowl.-Based Syst., 8(2):151-165, 2000.
  18. Judea Pearl. Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. San Mateo, CA, Morgan Kaufman Publishers, 1988.
  19. Judea Pearl. Causality. Models, Reasoning, and Inference. Cambridge University Press, 2000.
  20. J. Rissanen. Modeling by shortest data description. Automatica, 14:465-471, 1978.
  21. P. Spirtes, C. Meek, and T. Richardson. Causal inference in the presence of latent variables and selection bias. In Proc. of the 11th Conference on Uncertainty in Artificial Intelligence, ed P. Besnard and S. Hanks, pages 499-506. Morgan Kaufmann, 1995.
  22. Peter Spirtes, Clark Glymour, and Richard Scheines. Causation, Prediction, and Search. Springer Verlag, 2nd edition, 1993.
  23. Wolfgang Spohn. Bayesian nets are all there is to causal dependence. In In Stochastic Causality, Maria Carla Galaviotti, Eds. CSLI Lecture Notes, 2001.
  24. Milan Studeny. On non-graphical description of models of conditional independence structure. In HSSS Workshop on Stochastic Systems for Individual Behaviours, Louvain la Neuve, Belgium, January 2001.
  25. J. Suzuki. Learning Bayesian belief networks based on the MDL principle: An efficient algorithm using the branch and bound technique. In Procs of the International Conf. on Machine Learning, Bally, Italy, 1996.
  26. Jin Tian and Judea Pearl. A general identification condition for causal effects. In AAAI/IAAI, pages 567-573, 2002.
  27. Paul M. B. Vitányi. Meaningful information. In Prosenjit Bose and Pat Morin, editors, ISAAC, volume 2518 of Lecture Notes in Computer Science, pages 588-599. Springer, 2002.
  28. Chris S. Wallace and David L. Dowe. An information measure for classification. Computer Journal, 11(2):185-194, 1968.
  29. Jon Williamson. Bayesian Nets And Causality: Philosophical And Computational Foundations. Oxford University Press, 2005.
  30. Yang Xiang, S. K. Wong, and N. Cercone. Critical remarks on single link search in learning belief networks. In Proceedings of the 12th Annual Conference on Uncertainty in Artificial Intelligence (UAI-96), pages 564-571, San Francisco, CA, 1996. Morgan Kaufmann Publishers.