First-order Bayesian logic
2005
Abstract
Uncertainty is a fundamental and irreducible aspect of our knowledge about the world. Until recently, classical first-order logic has reigned as the de facto standard logical foundation for artificial intelligence. The lack of a built-in, semantically grounded capability for reasoning under uncertainty renders classical first-order logic inadequate for many important classes of problems. General-purpose languages are beginning to emerge for which the fundamental logical basis is probability. Increasingly expressive probabilistic languages demand a theoretical foundation that fully integrates classical first-order logic and probability. In first-order Bayesian logic (FOBL), probability distributions are defined over interpretations of classical first-order axiom systems. Predicates and functions of a classical first-order theory correspond to a random variables in the corresponding first-order Bayesian theory. This is a natural correspondence, given that random variables are formalized in mathematical statistics as measurable functions on a probability space. A formal system called Multi-Entity Bayesian Networks (MEBN) is presented for composing distributions on interpretations by instantiating and combining parameterized fragments of directed graphical models. A construction is given of a MEBN theory that assigns a non-zero probability to any satisfiable sentence in classical first-order logic. By conditioning this distribution on consistent sets of sentences, FOBL can represent a probability distribution over interpretations of any finitely axiomatizable first-order theory, as well as over interpretations of infinite axiom sets when a limiting distribution exists. FOBL is inherently open, having the ability to incorporate new axioms into existing theories, and to modify probabilities in the light of evidence. Bayesian inference provides both a proof theory for combining prior knowledge with observations, and a learning theory for refining a representation as evidence accrues. The results of this paper provide a logical foundation for the rapidly evolving literature on first-order Bayesian knowledge representation, and point the way toward Bayesian languages suitable for generalpurpose knowledge representation and computing. Because FOBL contains classical first-order logic as a deterministic subset, it is a natural candidate as a universal representation for integrating domain ontologies expressed in languages based on classical first-order logic or subsets thereof.
References (81)
- Alghamdi, G., Laskey, K.B., Wright, E., Barbara, D., and Chang, K.-C., 2005. "Modeling Insider Behavior Using Multi-Entity Bayesian Networks." 10th Annual Command and Control Research and Technology Symposium.
- Bacchus, F., 1990. "Representing and Reasoning with Probabilistic Knowledge: A Logical Approach to Probabilities." Boston, MA, MIT Press.
- Bacchus, F., Grove, A., Halpern, J.Y., and Koller, D., 1997. "From statistical knowledge bases to degrees of belief." Artificial Intelligence, Vol. 87: 75-143
- Bangsø, O., Langseth, H., and Nielsen, T., 2001. "Structural Learning in Object Oriented Domains." FLAIRS.
- Bangsø, O. and Wuillemin, P.H., 2000. Object Oriented Bayesian Networks: A Framework for Topdown Specification of Large Bayesian Networks and Repetitive Structures. Technical Report CIT-87.2-00-obphw1. Aalborg: Department of Computer Science, Aalborg University Billingsley, P., 1995. Probability and Measure. New York, NY: Wiley.
- Binford, T. and Levitt, T.S., 2003. "Evidential reasoning for object recognition." IEEE Transactions on Pattern Analysis and Machine Intelligence, 25(7), pp. 837-51.
- Boutilier, C., Dean, T., and Hanks, S., 1999. "Decision-Theoretic Planning: Structural Assumptions and Computational Leverage." Journal of Artificial Intelligence Research, 11, pp. 1-94.
- Brachman, R.J., Fikes, R.E., and Levesque, H.J., 1983. "KRYPTON: A Functional Approach to Knowledge Representation." IEEE Computer Society, 16(10), pp. 67-73.
- Buntine, W.L., 1994. "Operations for Learning with Graphical Models." Journal of Artificial Intelligence Research, 2, pp. 159-225.
- Charniak, E. and Goldman, R.P., 1993. "A Bayesian Model of Plan Recognition." Artificial Intelligence, 64, pp. 53-79.
- Costa, P., 2005. Bayesian Semantics for the Semantic Web. Doctoral Dissertation, Fairfax, VA: School of Information Technology and Engineering, George Mason University. http://hdl.handle.net/1920/455.
- Costa, P., Laskey, K.B., Fung, F., Pool, M., Takikawa, M., and Wright, E., 2005. "MEBN Logic: A Key Enabler for Network-Centric Warfare." 10th Annual Command and Control Research and Technology Symposium.
- Cowell, R.G., 1999. Probabilistic Networks and Expert Systems. Berlin: Springer-Verlag.
- d'Ambrosio, B., 1991. "Local expression languages for probabilistic dependency." Uncertainty in Artificial Intelligence: Proceedings of the Seventh Conference, San Mateo, California, Morgan Kaufmann.
- D'Ambrosio, B., 1999. "Inference in Bayesian Networks." AI Magazine, 20(2), pp. 21-36.
- d'Ambrosio, B., Takikawa, M., Fitzgerald, J., Upper, D., and Mahoney, S.M., 2001. "Security situation assessment and response evaluation (SSARE)." DARPA Information Survivability Conference & Exposition II, IEEE Computer Society.
- Davis, E., 1990. Representations of Commonsense Knowledge. San Mateo, California: Morgan Kaufmann.
- Dawid, A.P., 1984. "Statistical Theory, the Prequential Approach." Journal of the Royal Statistical Society, 147, pp. 278-92.
- de Finetti, B., 1974-75. Theory of Probability: A Critical Introductory Treatment. New York: Wiley.
- De Raedt, L. and Kersting, K., 2003. "Probabilistic Logic Learning." ACM-SIGKDD Explorations: Special Issue on Multi-Relational Data Mining, 5(1), pp. 31-48.
- DeGroot, M.H. and Schervish, M.J., 2002. Probability and Statistics. Boston, Massachusetts: Addison Wesley. 2/27/06
- Dybowski, R., Laskey, K.B., Myers, J.W., and Parsons, S., 2003. "Introduction to the Special Issue on the Fusion of Domain Knowledge with Data for Decision Support." Journal of Machine Learning Research, 4(July), pp. 293-94.
- Elliott, R.J., Aggoun, L., and Moore, J.B., 1995. Hidden Markov Models: Estimation and Control. Berlin: Springer-Verlag.
- Enderton, H.B., 2001. A Mathematical Introduction to Logic: Harcourt Academic Press.
- Frege, G., 1967. Begriffsschrift. Cambridge, MA: Harvard University Press.
- Fung, F., Laskey, K.B., Pool, M., Takikawa, M., and Wright, E., 2005. "PLASMA: Combining Predicate Logic and Probability for Information Fusion and Decision Support." AAAI Spring Symposium on Decision Support in a Changing World.
- Geiger, D. and Heckerman, D., 1991. "Advances in Probabilistic Reasoning." Uncertainty in Artificial Intelligence: Proceedings of the Seventh Conference, San Mateo, CA, Morgan Kaufmann Publishers.
- Genesereth, M., R. and Nilsson, N.J., 1987. Logical Foundations of Artificial Intelligence. San Mateo, California: Morgan Kaufmann Publishers.
- Getoor, L., Friedman, N., Koller, D., and Pfeffer, A., 2001. "Learning Probabilistic Relational Models," in Relational Data Mining. Saso Dzeroski and Nada Lavrac (ed.), Berlin: Springer- Verlag.
- Getoor, L., Koller, D., Taskar, B., and Friedman, N., 2000. "Learning Probabilistic Relational Models with Structural Uncertainty." ICML-2000 Workshop on Attribute-Value and Relational Learning:Crossing the Boundaries, Standford, California.
- Ghahramani, Z., 1998. "Learning Dynamic Bayesian Networks," in Adaptive Processing of Sequences and Data Structures: Lecture Notes in Artificial Intelligence. C.L. Giles and M. Gori (eds.), Berlin: Springer-Verlag, pp. 168-97.
- Gilks, W., Thomas, A., and Spiegelhalter, D.J., 1994. "A language and program for complex Bayesian modeling." The Statistician, 43, pp. 169-78.
- Glesner, S. and Koller, D., 1995. "Constructing Flexible Dynamic Belief Networks from First- Order Probabilistic Knowledge Bases." ECSQARU, pp. 217-26.
- Grenander, U., 1996. Elements of Pattern Theory. Baltimore, MD: Johns Hopkins University Press.
- Gruber, T.R., 1993. "A Translation Approach to Portable Ontology Specifications." Knowledge Acquisition, 5(2), pp. 199-220.
- Halpern, J.Y., 1991. "An Analysis of First-Order Logics of Probability." Artificial Intelligence, 46(May), pp. 311-50.
- Heckerman, D., Geiger, D., and Chickering, D.M., 1995. "Learning Bayesian Networks: The Combination of Knowledge and Statistical Data." Machine Learning, (20), pp. 197-243.
- Heckerman, D., Meek, C., and Koller, D., 2004. Probabilistic Models for Relational Data. MSR- TR-2004-30. Redmond, WA: Microsoft Corporation
- Howson, C. and Urbach, P., 1993. Scientific Reasoning: The Bayesian Approach. Chicago, IL: Open Court.
- IET, 2004. "Quddity*Suite Technical Guide." Arlington, VA: Information Extraction and Transport, Inc.
- Jaeger, M., 1998. "Reasoning About Infinite Random Structures with Relational Bayesian Networks." Proceedings of the 6th International Conference (KR '98).
- Jaeger, M., 2001. "Complex Probabilistic Modeling with Recursive Relational Bayesian Networks." Annals of Mathematics and Artificial Intelligence, 32, pp. 179-220.
- Jaynes, E.T., 2003. Probability Theory: The Logic of Science. Cambridge, UK: Cambridge University Press.
- Jensen, F.V., 2001. Bayesian Networks and Decision Graphs. Berlin: Springer-Verlag. 2/27/06
- Jensen, F.V., Chamberlain, B., Nordahl, T., and Jensen, F., 1990. "Analysis in HUGIN of Data Conflict." Uncertainty in Artificial Intelligence: Proceedings of the Sixth Conference, New York, NY, Elsevier.
- Kersting, K. and De Raedt, L., 2001. "Adaptive Bayesian Logic Programs." Proceedings of the Eleventh International Conference on Inductive Logic Programming (ILP 2001), Springer- Verlag.
- Koller, D. and Pfeffer, A., 1997. "Object-Oriented Bayesian Networks." Uncertainty in Artificial Intelligence: Proceedings of the Thirteenth Conference, San Francisco, CA, Morgan Kaufmann.
- Langseth, H. and Nielsen, T., 2003. "Fusion of Domain Knowledge with Data for Structured Learning in Object-Oriented Domains." Journal of Machine Learning Research, 4, pp. 339- 68.
- Laskey, K.B., 1991. "Conflict and Surprise: Heuristics for Model Revision." Uncertainty in Artificial Intelligence: Proceedings of the Seventh Conference, San Mateo, CA, Morgan Kaufmann.
- Laskey, K.B., 2006. MEBN: A Logic for Open-World Probabilistic Reasoning. C4I Center Technical Report C4I06-01. Fairfax, VA: George Mason University Laskey, K.B. and Costa, P., 2005. "Of Klingons and Starships: Bayesian Logic for the 23rd Century." Uncertainty in Artificial Intelligence: Proceedings of the Twenty-first Conference, Arlington, VA, AUAI Press.
- Laskey, K.B., D'Ambrosio, B., Levitt, T.S., and Mahoney, S.M., 2000. "Limited Rationality in Action: Decision Support for Military Situation Assessment." Minds and Machines, Vol. 10: 53-77
- Laskey, K.B. and Mahoney, S.M., 1997. "Network Fragments: Representing Knowledge for Constructing Probabilistic Models." Uncertainty in Artificial Intelligence: Proceedings of the Thirteenth Conference, San Mateo, CA, Morgan Kaufmann.
- Laskey, K.B., Mahoney, S.M., and Wright, E., 2001. "Hypothesis Management in Situation- Specific Network Construction." Uncertainty in Artificial Intelligence: Proceedings of the Seventeenth Conference, San Mateo, CA, Morgan Kaufman.
- Lauritzen, S., 1996. Graphical Models. Oxford: Oxford Science Publications.
- Levitt, T.S., Winter, C.L., Turner, C., J., Chestek, R.A., Ettinger, G.J., and Sayre, S.M., 1995. "Bayesian Inference-Based Fusion of Radar Imagery, Military Forces and Tactical Terrain Models in the Image Exploitation System/Balanced Technology Initiative." International Journal of Human-Computer Studies, 42.
- Lin, Y. and Druzdzel, M.J., 1997. "Computational Advantages of Relevance Reasoning in Bayesian Belief Networks." Uncertainty in Artificial Intelligence: Proceedings of the Thirteenth Conference, San Francisco, CA, Morgan Kaufmann.
- Mahoney, S.M., 1999. Network Fragments. Faifax, VA: School of Information Technology and Engineering, George Mason University.
- Mahoney, S.M. and Laskey, K.B., 1998. "Constructing Situation Specific Networks." Uncertainty in Artificial Intelligence: Proceedings of the Fourteenth Conference, San Mateo, CA, Morgan Kaufmann.
- Mahoney, S.M. and Laskey, K.B., 1999. "Representing and Combining Partially Specified Conditional Probability Tables." Uncertainty in Artificial Intelligence: Proceedings of the Fifteenth Conference, San Mateo, CA, Morgan Kaufmann.
- Milch, B., Marthi, B., Russell, S., Sontag, D., Ong, D.L., and Kolobov, A., 2005. "Blog: Probabilistic Models with Unknown Objects." Proceedings of the Nineteenth Joint Conference on Artificial Intelligence.
- Murphy, K., 1998. Dynamic Bayesian Networks: Representation, Inference and Learning. Berkeley, CA: Computer Science Division, University of California. 2/27/06
- Natarajan, S., Tadepalli, P., Altendorf, E., Dietterich, T.G., Fern, A., and Restificar, A., 2005. "Learning First-Order Probabilistic Models with Combining Rules." Proceedings of the 22nd International Conference on Machine Learning.
- Neapolitan, R.E., 2003. Learning Bayesian Networks. New York: Prentice Hall.
- Ngo, L. and Haddawy, P., 1997. "Answering Queries from Context-Sensitive Probabilistic Knowledge Bases." Theoretical Computer Science, 171, pp. 147-77.
- Oakes, D., 1986. "Self-Calibrating Priors Do Not Exist." Journal of the American Statistical Association, 80(390), pp. 339.
- Pearl, J., 1988. Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. San Mateo, CA: Morgan Kaufmann.
- Peirce, C.S., 1885. "On the Algebra of Logic." American Journal of Mathematics, 7, pp. 180-202.
- Pfeffer, A., 2000. Probabilistic Reasoning for Complex Systems. Stanford, CA, Stanford University.
- Pfeffer, A., 2001. "IBAL: An Integrated Bayesian Agent Language." Joint Conference on Artificial Intelligence (IJCAI).
- Poole, D., 1993. "Probabilistic Horn Abduction and Bayesian Networks." Artificial Intelligence, 64(1), pp. 81-129.
- Poole, D., 2003. "First-Order Probabilistic Inference." Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence.
- Russell, S. and Norvig, P., 2002. Artificial Intelligence: A Modern Approach. Upper Saddle River, NJ: Prentice-Hall.
- Sato, T., 1998. "Modeling Scientific Theories as PRISM Programs." ECAI98 Workshop on Machine Discovery.
- Savage, L.J., 1954. The Foundations of Statistics. New York: Wiley.
- Sowa, J.F., 2000. "Knowledge Representation: Logical, Philosophical and Computational Foundations," Brooks-Cole Publishers.
- Spiegelhalter, D.J., Thomas, A., and Best, N., 1996. "Computation on Graphical Models." Bayesian Statistics, 5, pp. 407-25.
- Stoll, R.P., 1963. Set Theory and Logic. New York: Dover Publications Inc.
- Stone, L.D., Barlow, C.A., and Corwin, T.L., 1999. Bayesian Multiple Target Tracking. Boston, MA: Artech House.
- Tarski, A., 1944. "The Semantical Concept of Truth and the Foundations of Semantics." Philosophy and Phenomenological Research, 4.
- Wellman, M.P., Breese, J.S., and Goldman, R.P., 1992. "From knowledge bases to decision models." The Knoweldge Engineering Review, 7(1), pp. 35-53.
- Whittaker, J., 1990. Graphical Models in Applied Multivariate Statistics. Chichester: John Wiley & Sons.