Computer-aided Ontology Development: an integrated environment
2010
Abstract
In this paper we introduce CODA (Computer-aided Ontology Development Architecture), an Architecture and a Framework for semiautomatic development of ontologies through analysis of heterogeneous information sources. We have been motivated in its design by observing that several fields of research provided interesting contributions towards the objective of augmenting/enriching ontology content, but that they lack a common perspective and a systematic approach. While in the context of Natural Language Processing specific architectures and frameworks have been defined, time is not yet completely mature for systems able to reuse the extracted information for ontology enrichment purposes: several examples do exist, though they do not comply with any leading model or architecture. Objective of CODA is to acknowledge and improve existing frameworks to cover the above gaps, by providing: a conceptual systematization of data extracted from unstructured information to enrich ontology content, an architecture defining the components which take part in such a scenario, and a framework supporting all of the above through standard implementations. This paper provides a first overview of the whole picture, and introduces UIMAST, an extension for the Knowledge Management and Acquisition Platform Semantic Turkey, that implements CODA principles by allowing reuse of components developed inside UIMA framework to drive semi-automatic Acquisition of Knowledge from Web Content.
References (26)
- References
- Baker, C., Fillmore, C., & Lowe, J. (1998). The Berkeley FrameNet project. COLING-ACL. Montreal, Canada.
- Basili, R., Vindigni, M., & Zanzotto, F. (2003). Integrating Ontological and Linguistic Knowledge for Conceptual Information Extraction. IEEE/WIC International Conference on Web Intelligence. Washington, DC, USA.
- Bouquet, P., Stoermer, H., & Bazzanella, B. (2008). An Entity Naming System for the Semantic Web. In Proceedings of the 5th European Semantic Web Conference (ESWC 2008). Springer Verlag.
- Buitelaar, P., Declerck, T., Frank, A., Racioppa, S., Kiesel, M., Sintek, M., et al. (2006). LingInfo: Design and Applications of a Model for the Integration of Linguistic Information in Ontologies. OntoLex06. Genoa, Italy.
- Buitelaar, P., Olejnik, D., & Sintek, M. (2004). A Protégé Plug-In for Ontology Extraction from Text Based on Linguistic Analysis. Proceedings of the 1st European Semantic Web Symposium (ESWS). Heraklion, Greece.
- Carpenter, B. (1992). The Logic of Typed Feature Structures. Cambridge Tracts in Theoretical Computer Science ((hardback) ed., Vol. 32). Cambridge University Press.
- Chaudhri, V. K., Farquhar, A., Fikes, R., Karp, P., & Rice, J. P. (1998). OKBC: A programmatic foundation for knowledge base interoperability. In Proceedings of the Fifteenth National Conference on Artificial Intelligence (AAAI-98) (pp. 600-607). Madison, Wisconsin, USA: MIT Press.
- Cimiano, P. (2006). Ontology Learning and Population from Text Algorithms, Evaluation and Applications (Vol. XXVIII). Springer.
- Cimiano, P., & Völker, J. (2005). Text2Onto -A Framework for Ontology Learning and Data-driven Change Discovery. Proceedings of the 10th International Conference on Applications of Natural Language to Information Systems, (pp. 227-238). Alicante.
- Cimiano, P., Haase, P., Herold, M., Mantel, M., & Buitelaar, P. (2007). LexOnto: A Model for Ontology Lexicons for Ontology-based NLP. In Proceedings of the OntoLex07 Workshop (held in conjunction with ISWC'07).
- Cunningham, H., Maynard, D., Bontcheva, K., & Tablan, V. (2002). GATE: A Framework and Graphical Development Environment for Robust NLP Tools and Applications. In Proceedings of the 40th Anniversary Meeting of the Association for Computational Linguistics (ACL'02). Philadelphia.
- Ferrucci, D., & Lally, A. (2004). Uima: an architectural approach to unstructured information processing in the corporate research environment. Nat. Lang. Eng. , 10 (3-4), 327-348.
- Gennari, J., Musen, M., Fergerson, R., Grosso, W., Crubézy, M., Eriksson, H., et al. (2003). The evolution of Protégé-2000: An environment for knowledge-based systems development,. International Journal of Human-Computer Studies , 58 (1), 89-123.
- Griesi, D., Pazienza, M., & Stellato, A. (2007). Semantic Turkey -a Semantic Bookmarking tool (System Description). In E. Franconi, M. Kifer, & W. May (A cura di), The Semantic Web: Research and Applications, 4th European Semantic Web Conference, ESWC 2007, Innsbruck, Austria, June 3-7, 2007, Proceedings. Lecture Notes in Computer Science. 4519, p. 779-788. Springer.
- Harman, D. (1992). The DARPA TIPSTER project. SIGIR Forum , 26 (2), 26-28.
- Miller, G. A., Beckwith, R., Fellbaum, C., Gross, D., & Miller, K. (1993). Introduction to WordNet: An On-line Lexical Database.
- Pazienza, M. T., & Stellato, A. (2006). Exploiting Linguistic Resources for building linguistically motivated ontologies in the Semantic Web. Second Workshop on Interfacing Ontologies and Lexical Resources for Semantic Web Technologies (OntoLex2006), held jointly with LREC2006. Magazzini del Cotone Conference Center, Genoa, Italy.
- Pazienza, M. T., Stellato, A., & Turbati, A. (2010). A Suite of Semantic Web Tools Supporting Development of Multilingual Ontologies. In G. Armano, M. de Gemmis, G. Semeraro, & E. Vargiu (Eds.), Intelligent Information Access. Studies in Computational Intelligence Series. Springer-Verlag.
- Pazienza, M., & Stellato, A. (2006). Linguistic Enrichment of Ontologies: a methodological framework. Second Workshop on Interfacing Ontologies and Lexical Resources for Semantic Web Technologies (OntoLex2006). Genoa, Italy.
- Pazienza, M., Scarpato, N., Stellato, A., & Turbati, A. (2008). Din din! The (Semantic) Turkey is served! Semantic Web Applications and Perspectives. Rome, Italy.
- Pazienza, M., Stellato, A., & Turbati, A. (2008). Linguistic Watermark 3.0: an RDF framework and a software library for bridging language and ontologies in the Semantic Web. Semantic Web Applications and Perspectives, 5th Italian Semantic Web Workshop (SWAP2008). FAO-UN, Rome, Italy.
- Peter, H., Sack, H., & Beckstein, C. (2006). SMARTINDEXER -Amalgamating Ontologies and Lexical Resources for Document Indexing. Workshop on Interfacing Ontologies and Lexical Resources for Semantic Web Technologies (OntoLex2006). Genoa, Italy.
- Peters, W., Montiel-Ponsoda, E., Aguado de Cea, G., & Gómez-Pérez, A. (2007). Localizing Ontologies in OWL. In Proceedings of the OntoLex07 Workshop (held in conjunction with ISWC'07).
- Shi, L., & Mihalcea, R. (2005). Putting Pieces Together: Combining FrameNet, VerbNet and WordNet for Robust Semantic Parsing. CICLing 2005, (pp. 100- 111). Mexico.
- Velardi, P., Navigli, R., Cucchiarelli, A., & Neri, F. (2005). Evaluation of ontolearn, a methodology for automatic population of domain ontologie. In Ontology Learning from Text: Methods, Applications and Evaluation. IOS Press.