A Semantic Question Answering Framework for Large Data Sets
2016, Open J. Semantic Web
Abstract
Traditionally, the task of answering natural language questions has involved a keyword-based document retrieval step, followed by in-depth processing of candidate answer documents and paragraphs. This post-processing uses semantics to various degrees. In this article, we describe a purely semantic question answering (QA) framework for large document collections. Our high-precision approach transforms the semantic knowledge extracted from natural language texts into a language-agnostic RDF representation and indexes it into a scalable triplestore. In order to facilitate easy access to the information stored in the RDF semantic index, a user's natural language questions are translated into SPARQL queries that return precise answers back to the user. The robustness of this framework is ensured by the natural language reasoning performed on the RDF store, by the query relaxation procedures, and the answer ranking techniques. The improvements in performance over a regular free text s...
References (34)
- M. Balakrishna and D. Moldovan, "Automatic Building of Semantically Rich Domain Models from Unstructured Data," in Proceedings of the Twenty-Sixth International Florida Artificial Intelligence Research Society Conference (FLAIRS 2013), St. Pete Beach, Florida, May 22-24, 2013.
- M. Balakrishna, D. Moldovan, M. Tatu, and M. Olteanu, "Semi-Automatic Domain Ontology Creation from Text Resources," in Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC 2010), Valletta, Malta, May 17-23, 2010.
- E. Blanco and D. I. Moldovan, "Unsupervised Learning of Semantic Relation Composition," in Proceedings of Human Language Technology, 2011, pp. 1456-1465.
- A. Bouziane, D. Bouchiha, N. Doumi, and M. Malki, "Question Answering Systems: Survey and Trends," Procedia Computer Science, vol. 73, pp. 366 -375, 2015.
- D. Damljanovic, M. Agatonovic, and H. Cunningham, "FREyA: An Interactive Way of Querying Linked Data Using Natural Language," in Proceedings of 8th Extended Semantic Web Conference, 2012, pp. 125-138.
- H. T. Dang, D. Kelly, and J. Lin, "Overview of the TREC 2007 Question Answering Track," in Proceedings of The Sixteenth Text REtrieval Conference, 2008.
- T. Erekhinskaya and D. Moldovan, "Lexical Chains on WordNet and Extensions," in Proceedings of the Twenty-Sixth International Florida Artificial Intelligence Research Society Conference (FLAIRS 2013), C. Boonthum-Denecke and G. M. Youngblood, Eds. AAAI Press, 2013.
- C. Fellbaum, Ed., WordNet: An Electronic Lexical Database. Cambridge, MA: MIT Press, 1998.
- IARPA, "Knowledge Discovery and Dissemination (KDD)," https://www.iarpa.gov/index.php/ research-programs/kdd, accessed: June 16, 2016.
- E. Kaufmann, "Talking to the Semantic Web? Natural Language Query Interfaces for Casual End-users," Ph.D. dissertation, University of Zurich, February 2009. [Online]. Available: http: //www.ifi.uzh.ch/pax/web/uploads/pdf/publication/ 1202/Dissertation Esther Kaufmann.pdf
- T. Khot, N. Balasubramanian, E. Gribkoff, A. Sabharwal, P. Clark, and O. Etzioni, "Markov Logic Networks for Natural Language Question Answering," Computing Research Repository (CoRR), vol. abs/1507.03045, 2015.
- K. Liu, J. Zhao, S. He, and Y. Zhang, "Question Answering over Knowledge Bases," IEEE Intelligent Systems, vol. 30, no. 5, pp. 26-35, 2015.
- V. Lopez, M. Fernndez, E. Motta, and N. Stieler, "PowerAqua: Supporting users in querying and exploring the Semantic Web." Semantic Web, vol. 3, no. 3, pp. 249-265, 2012.
- V. Lopez, V. Uren, E. Motta, and M. Pasin, "AquaLog: An Ontology-driven Question
- Answering System for Organizational Semantic Intranets," Web Semantics: Science, Services and Agents on the World Wide Web, vol. 5, no. 2, 2007.
- V. Lopez, V. Uren, M. Sabou, and E. Motta, "Is Question Answering Fit for the Semantic Web?: A Survey," Semantic Web, vol. 2, no. 2, pp. 125-155, Apr. 2011.
- D. Moldovan, M. Bowden, and M. Tatu, "A Temporally-Enhanced PowerAnswer in TREC 2006," in Proceedings of Text REtrieval Conference, 2006.
- D. Moldovan, C. Clark, and M. Bowden, "Lymba's PowerAnswer 4 in TREC 2007," in Proceedings of Text REtrieval Conference, 2007.
- Oracle, "RDF Semantic Graph Prerequisites, and Advanced Performance and Scalability for Semantic Web Applications," 2014.
- A.-M. Popescu, O. Etzioni, and H. Kautz, "Towards a Theory of Natural Language Interfaces to Databases," in Proceedings of 2003 International Conference on Intelligent User Interfaces (IUI'03), 2003, pp. 149-157.
- RDF Working Group, "Resource Description Framework (RDF)," http://www.w3.org/RDF/, 2014.
- M. Richardson and P. Domingos, "Markov Logic Networks," Mach. Learn., vol. 62, no. 1-2, pp. 107- 136, Feb. 2006.
- M. Tatu, M. Balakrishna, S. Werner,
- T. Erekhinskaya, and D. Moldovan, "Automatic Extraction of Actionable Knowledge," in Proceedings of IEEE Tenth International Conference on Semantic Computing, 2016.
- M. Tatu, S. Werner, M. Balakrishna, T. Erekhinskaya, and D. Moldovan, "Semantic Question Answering on Big Data," in Proceedings of International Workshop on Semantic Big Data (SBD 2016), 2016.
- The Apache Software Foundation, "Apache Jena - A free and open source Java framework for building Semantic Web and Linked Data applications," http: //jena.apache.org/, accessed: June 16, 2016.
- C. Unger, L. Bühmann, J. Lehmann, A.-C. Ngonga Ngomo, D. Gerber, and P. Cimiano, "Template-based Question Answering over RDF Data," in Proceedings of WWW '12, 2012, pp. 639- 648.
- C. Unger, C. Forascu, V. Lopez, A.-C. N. Ngomo, E. Cabrio, P. Cimiano, and S. Walter, "Question Answering over Linked Data (QALD- 5)," in Cross-Language Evaluation Forum CLEF (Working Notes), ser. CEUR Workshop Proceedings, L. Cappellato, N. Ferro, G. J. F. Jones, and E. SanJuan, Eds., vol. 1391, 2015.
- E. M. Voorhees, "The TREC-8 Question Answering Track Report," in Proceedings of the Eighth Text REtrieval Conference, 1999, pp. 77-82.
- W3C, "Oracle OWLPrime," www.w3.org/2007/ OWL/wiki/OracleOwlPrime, Jan 24, 2008.
- W3C, "SPARQL Query Language for RDF," http: //www.w3.org/TR/rdf-sparql-query/, January 15, 2008.
- "WordNet RDF," http://wordnet-rdf.princeton. edu/, November 7, 2013.
- M. Yahya, K. Berberich, S. Elbassuoni, M. Ramanath, V. Tresp, and G. Weikum, "Natural Language Questions for the Web of Data," in Proceedings of the 2012 Conference on Empirical Methods on Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL '12), 2012, pp. 379-390.
- X. Yao, J. Berant, and B. Van Durme, "Freebase QA: Information Extraction or Semantic Parsing?" in Proceedings of ACL Workshop on Semantic Parsing, 2014.