Academia.eduAcademia.edu

Outline

First-order Bayesian logic

2005

Abstract

Uncertainty is a fundamental and irreducible aspect of our knowledge about the world. Until recently, classical first-order logic has reigned as the de facto standard logical foundation for artificial intelligence. The lack of a built-in, semantically grounded capability for reasoning under uncertainty renders classical first-order logic inadequate for many important classes of problems. General-purpose languages are beginning to emerge for which the fundamental logical basis is probability. Increasingly expressive probabilistic languages demand a theoretical foundation that fully integrates classical first-order logic and probability. In first-order Bayesian logic (FOBL), probability distributions are defined over interpretations of classical first-order axiom systems. Predicates and functions of a classical first-order theory correspond to a random variables in the corresponding first-order Bayesian theory. This is a natural correspondence, given that random variables are formalized in mathematical statistics as measurable functions on a probability space. A formal system called Multi-Entity Bayesian Networks (MEBN) is presented for composing distributions on interpretations by instantiating and combining parameterized fragments of directed graphical models. A construction is given of a MEBN theory that assigns a non-zero probability to any satisfiable sentence in classical first-order logic. By conditioning this distribution on consistent sets of sentences, FOBL can represent a probability distribution over interpretations of any finitely axiomatizable first-order theory, as well as over interpretations of infinite axiom sets when a limiting distribution exists. FOBL is inherently open, having the ability to incorporate new axioms into existing theories, and to modify probabilities in the light of evidence. Bayesian inference provides both a proof theory for combining prior knowledge with observations, and a learning theory for refining a representation as evidence accrues. The results of this paper provide a logical foundation for the rapidly evolving literature on first-order Bayesian knowledge representation, and point the way toward Bayesian languages suitable for generalpurpose knowledge representation and computing. Because FOBL contains classical first-order logic as a deterministic subset, it is a natural candidate as a universal representation for integrating domain ontologies expressed in languages based on classical first-order logic or subsets thereof.

References (81)

  1. Alghamdi, G., Laskey, K.B., Wright, E., Barbara, D., and Chang, K.-C., 2005. "Modeling Insider Behavior Using Multi-Entity Bayesian Networks." 10th Annual Command and Control Research and Technology Symposium.
  2. Bacchus, F., 1990. "Representing and Reasoning with Probabilistic Knowledge: A Logical Approach to Probabilities." Boston, MA, MIT Press.
  3. Bacchus, F., Grove, A., Halpern, J.Y., and Koller, D., 1997. "From statistical knowledge bases to degrees of belief." Artificial Intelligence, Vol. 87: 75-143
  4. Bangsø, O., Langseth, H., and Nielsen, T., 2001. "Structural Learning in Object Oriented Domains." FLAIRS.
  5. Bangsø, O. and Wuillemin, P.H., 2000. Object Oriented Bayesian Networks: A Framework for Topdown Specification of Large Bayesian Networks and Repetitive Structures. Technical Report CIT-87.2-00-obphw1. Aalborg: Department of Computer Science, Aalborg University Billingsley, P., 1995. Probability and Measure. New York, NY: Wiley.
  6. Binford, T. and Levitt, T.S., 2003. "Evidential reasoning for object recognition." IEEE Transactions on Pattern Analysis and Machine Intelligence, 25(7), pp. 837-51.
  7. Boutilier, C., Dean, T., and Hanks, S., 1999. "Decision-Theoretic Planning: Structural Assumptions and Computational Leverage." Journal of Artificial Intelligence Research, 11, pp. 1-94.
  8. Brachman, R.J., Fikes, R.E., and Levesque, H.J., 1983. "KRYPTON: A Functional Approach to Knowledge Representation." IEEE Computer Society, 16(10), pp. 67-73.
  9. Buntine, W.L., 1994. "Operations for Learning with Graphical Models." Journal of Artificial Intelligence Research, 2, pp. 159-225.
  10. Charniak, E. and Goldman, R.P., 1993. "A Bayesian Model of Plan Recognition." Artificial Intelligence, 64, pp. 53-79.
  11. Costa, P., 2005. Bayesian Semantics for the Semantic Web. Doctoral Dissertation, Fairfax, VA: School of Information Technology and Engineering, George Mason University. http://hdl.handle.net/1920/455.
  12. Costa, P., Laskey, K.B., Fung, F., Pool, M., Takikawa, M., and Wright, E., 2005. "MEBN Logic: A Key Enabler for Network-Centric Warfare." 10th Annual Command and Control Research and Technology Symposium.
  13. Cowell, R.G., 1999. Probabilistic Networks and Expert Systems. Berlin: Springer-Verlag.
  14. d'Ambrosio, B., 1991. "Local expression languages for probabilistic dependency." Uncertainty in Artificial Intelligence: Proceedings of the Seventh Conference, San Mateo, California, Morgan Kaufmann.
  15. D'Ambrosio, B., 1999. "Inference in Bayesian Networks." AI Magazine, 20(2), pp. 21-36.
  16. d'Ambrosio, B., Takikawa, M., Fitzgerald, J., Upper, D., and Mahoney, S.M., 2001. "Security situation assessment and response evaluation (SSARE)." DARPA Information Survivability Conference & Exposition II, IEEE Computer Society.
  17. Davis, E., 1990. Representations of Commonsense Knowledge. San Mateo, California: Morgan Kaufmann.
  18. Dawid, A.P., 1984. "Statistical Theory, the Prequential Approach." Journal of the Royal Statistical Society, 147, pp. 278-92.
  19. de Finetti, B., 1974-75. Theory of Probability: A Critical Introductory Treatment. New York: Wiley.
  20. De Raedt, L. and Kersting, K., 2003. "Probabilistic Logic Learning." ACM-SIGKDD Explorations: Special Issue on Multi-Relational Data Mining, 5(1), pp. 31-48.
  21. DeGroot, M.H. and Schervish, M.J., 2002. Probability and Statistics. Boston, Massachusetts: Addison Wesley. 2/27/06
  22. Dybowski, R., Laskey, K.B., Myers, J.W., and Parsons, S., 2003. "Introduction to the Special Issue on the Fusion of Domain Knowledge with Data for Decision Support." Journal of Machine Learning Research, 4(July), pp. 293-94.
  23. Elliott, R.J., Aggoun, L., and Moore, J.B., 1995. Hidden Markov Models: Estimation and Control. Berlin: Springer-Verlag.
  24. Enderton, H.B., 2001. A Mathematical Introduction to Logic: Harcourt Academic Press.
  25. Frege, G., 1967. Begriffsschrift. Cambridge, MA: Harvard University Press.
  26. Fung, F., Laskey, K.B., Pool, M., Takikawa, M., and Wright, E., 2005. "PLASMA: Combining Predicate Logic and Probability for Information Fusion and Decision Support." AAAI Spring Symposium on Decision Support in a Changing World.
  27. Geiger, D. and Heckerman, D., 1991. "Advances in Probabilistic Reasoning." Uncertainty in Artificial Intelligence: Proceedings of the Seventh Conference, San Mateo, CA, Morgan Kaufmann Publishers.
  28. Genesereth, M., R. and Nilsson, N.J., 1987. Logical Foundations of Artificial Intelligence. San Mateo, California: Morgan Kaufmann Publishers.
  29. Getoor, L., Friedman, N., Koller, D., and Pfeffer, A., 2001. "Learning Probabilistic Relational Models," in Relational Data Mining. Saso Dzeroski and Nada Lavrac (ed.), Berlin: Springer- Verlag.
  30. Getoor, L., Koller, D., Taskar, B., and Friedman, N., 2000. "Learning Probabilistic Relational Models with Structural Uncertainty." ICML-2000 Workshop on Attribute-Value and Relational Learning:Crossing the Boundaries, Standford, California.
  31. Ghahramani, Z., 1998. "Learning Dynamic Bayesian Networks," in Adaptive Processing of Sequences and Data Structures: Lecture Notes in Artificial Intelligence. C.L. Giles and M. Gori (eds.), Berlin: Springer-Verlag, pp. 168-97.
  32. Gilks, W., Thomas, A., and Spiegelhalter, D.J., 1994. "A language and program for complex Bayesian modeling." The Statistician, 43, pp. 169-78.
  33. Glesner, S. and Koller, D., 1995. "Constructing Flexible Dynamic Belief Networks from First- Order Probabilistic Knowledge Bases." ECSQARU, pp. 217-26.
  34. Grenander, U., 1996. Elements of Pattern Theory. Baltimore, MD: Johns Hopkins University Press.
  35. Gruber, T.R., 1993. "A Translation Approach to Portable Ontology Specifications." Knowledge Acquisition, 5(2), pp. 199-220.
  36. Halpern, J.Y., 1991. "An Analysis of First-Order Logics of Probability." Artificial Intelligence, 46(May), pp. 311-50.
  37. Heckerman, D., Geiger, D., and Chickering, D.M., 1995. "Learning Bayesian Networks: The Combination of Knowledge and Statistical Data." Machine Learning, (20), pp. 197-243.
  38. Heckerman, D., Meek, C., and Koller, D., 2004. Probabilistic Models for Relational Data. MSR- TR-2004-30. Redmond, WA: Microsoft Corporation
  39. Howson, C. and Urbach, P., 1993. Scientific Reasoning: The Bayesian Approach. Chicago, IL: Open Court.
  40. IET, 2004. "Quddity*Suite Technical Guide." Arlington, VA: Information Extraction and Transport, Inc.
  41. Jaeger, M., 1998. "Reasoning About Infinite Random Structures with Relational Bayesian Networks." Proceedings of the 6th International Conference (KR '98).
  42. Jaeger, M., 2001. "Complex Probabilistic Modeling with Recursive Relational Bayesian Networks." Annals of Mathematics and Artificial Intelligence, 32, pp. 179-220.
  43. Jaynes, E.T., 2003. Probability Theory: The Logic of Science. Cambridge, UK: Cambridge University Press.
  44. Jensen, F.V., 2001. Bayesian Networks and Decision Graphs. Berlin: Springer-Verlag. 2/27/06
  45. Jensen, F.V., Chamberlain, B., Nordahl, T., and Jensen, F., 1990. "Analysis in HUGIN of Data Conflict." Uncertainty in Artificial Intelligence: Proceedings of the Sixth Conference, New York, NY, Elsevier.
  46. Kersting, K. and De Raedt, L., 2001. "Adaptive Bayesian Logic Programs." Proceedings of the Eleventh International Conference on Inductive Logic Programming (ILP 2001), Springer- Verlag.
  47. Koller, D. and Pfeffer, A., 1997. "Object-Oriented Bayesian Networks." Uncertainty in Artificial Intelligence: Proceedings of the Thirteenth Conference, San Francisco, CA, Morgan Kaufmann.
  48. Langseth, H. and Nielsen, T., 2003. "Fusion of Domain Knowledge with Data for Structured Learning in Object-Oriented Domains." Journal of Machine Learning Research, 4, pp. 339- 68.
  49. Laskey, K.B., 1991. "Conflict and Surprise: Heuristics for Model Revision." Uncertainty in Artificial Intelligence: Proceedings of the Seventh Conference, San Mateo, CA, Morgan Kaufmann.
  50. Laskey, K.B., 2006. MEBN: A Logic for Open-World Probabilistic Reasoning. C4I Center Technical Report C4I06-01. Fairfax, VA: George Mason University Laskey, K.B. and Costa, P., 2005. "Of Klingons and Starships: Bayesian Logic for the 23rd Century." Uncertainty in Artificial Intelligence: Proceedings of the Twenty-first Conference, Arlington, VA, AUAI Press.
  51. Laskey, K.B., D'Ambrosio, B., Levitt, T.S., and Mahoney, S.M., 2000. "Limited Rationality in Action: Decision Support for Military Situation Assessment." Minds and Machines, Vol. 10: 53-77
  52. Laskey, K.B. and Mahoney, S.M., 1997. "Network Fragments: Representing Knowledge for Constructing Probabilistic Models." Uncertainty in Artificial Intelligence: Proceedings of the Thirteenth Conference, San Mateo, CA, Morgan Kaufmann.
  53. Laskey, K.B., Mahoney, S.M., and Wright, E., 2001. "Hypothesis Management in Situation- Specific Network Construction." Uncertainty in Artificial Intelligence: Proceedings of the Seventeenth Conference, San Mateo, CA, Morgan Kaufman.
  54. Lauritzen, S., 1996. Graphical Models. Oxford: Oxford Science Publications.
  55. Levitt, T.S., Winter, C.L., Turner, C., J., Chestek, R.A., Ettinger, G.J., and Sayre, S.M., 1995. "Bayesian Inference-Based Fusion of Radar Imagery, Military Forces and Tactical Terrain Models in the Image Exploitation System/Balanced Technology Initiative." International Journal of Human-Computer Studies, 42.
  56. Lin, Y. and Druzdzel, M.J., 1997. "Computational Advantages of Relevance Reasoning in Bayesian Belief Networks." Uncertainty in Artificial Intelligence: Proceedings of the Thirteenth Conference, San Francisco, CA, Morgan Kaufmann.
  57. Mahoney, S.M., 1999. Network Fragments. Faifax, VA: School of Information Technology and Engineering, George Mason University.
  58. Mahoney, S.M. and Laskey, K.B., 1998. "Constructing Situation Specific Networks." Uncertainty in Artificial Intelligence: Proceedings of the Fourteenth Conference, San Mateo, CA, Morgan Kaufmann.
  59. Mahoney, S.M. and Laskey, K.B., 1999. "Representing and Combining Partially Specified Conditional Probability Tables." Uncertainty in Artificial Intelligence: Proceedings of the Fifteenth Conference, San Mateo, CA, Morgan Kaufmann.
  60. Milch, B., Marthi, B., Russell, S., Sontag, D., Ong, D.L., and Kolobov, A., 2005. "Blog: Probabilistic Models with Unknown Objects." Proceedings of the Nineteenth Joint Conference on Artificial Intelligence.
  61. Murphy, K., 1998. Dynamic Bayesian Networks: Representation, Inference and Learning. Berkeley, CA: Computer Science Division, University of California. 2/27/06
  62. Natarajan, S., Tadepalli, P., Altendorf, E., Dietterich, T.G., Fern, A., and Restificar, A., 2005. "Learning First-Order Probabilistic Models with Combining Rules." Proceedings of the 22nd International Conference on Machine Learning.
  63. Neapolitan, R.E., 2003. Learning Bayesian Networks. New York: Prentice Hall.
  64. Ngo, L. and Haddawy, P., 1997. "Answering Queries from Context-Sensitive Probabilistic Knowledge Bases." Theoretical Computer Science, 171, pp. 147-77.
  65. Oakes, D., 1986. "Self-Calibrating Priors Do Not Exist." Journal of the American Statistical Association, 80(390), pp. 339.
  66. Pearl, J., 1988. Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. San Mateo, CA: Morgan Kaufmann.
  67. Peirce, C.S., 1885. "On the Algebra of Logic." American Journal of Mathematics, 7, pp. 180-202.
  68. Pfeffer, A., 2000. Probabilistic Reasoning for Complex Systems. Stanford, CA, Stanford University.
  69. Pfeffer, A., 2001. "IBAL: An Integrated Bayesian Agent Language." Joint Conference on Artificial Intelligence (IJCAI).
  70. Poole, D., 1993. "Probabilistic Horn Abduction and Bayesian Networks." Artificial Intelligence, 64(1), pp. 81-129.
  71. Poole, D., 2003. "First-Order Probabilistic Inference." Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence.
  72. Russell, S. and Norvig, P., 2002. Artificial Intelligence: A Modern Approach. Upper Saddle River, NJ: Prentice-Hall.
  73. Sato, T., 1998. "Modeling Scientific Theories as PRISM Programs." ECAI98 Workshop on Machine Discovery.
  74. Savage, L.J., 1954. The Foundations of Statistics. New York: Wiley.
  75. Sowa, J.F., 2000. "Knowledge Representation: Logical, Philosophical and Computational Foundations," Brooks-Cole Publishers.
  76. Spiegelhalter, D.J., Thomas, A., and Best, N., 1996. "Computation on Graphical Models." Bayesian Statistics, 5, pp. 407-25.
  77. Stoll, R.P., 1963. Set Theory and Logic. New York: Dover Publications Inc.
  78. Stone, L.D., Barlow, C.A., and Corwin, T.L., 1999. Bayesian Multiple Target Tracking. Boston, MA: Artech House.
  79. Tarski, A., 1944. "The Semantical Concept of Truth and the Foundations of Semantics." Philosophy and Phenomenological Research, 4.
  80. Wellman, M.P., Breese, J.S., and Goldman, R.P., 1992. "From knowledge bases to decision models." The Knoweldge Engineering Review, 7(1), pp. 35-53.
  81. Whittaker, J., 1990. Graphical Models in Applied Multivariate Statistics. Chichester: John Wiley & Sons.