An Ontology based document management

Ján Hreňo

Outline

An Ontology based document management

Ján Hreňo

2008

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

In this article an approach to the problem of associations of documents with a knowledge base is demonstrated in a real world application. It is based on combination of annotating documents with concepts from a knowledge base and grouping documents together into clusters. Our knowledge base is an ontology provided by a dedicated ontology server.

Peter Robinson

ADHO, 2018

Explains and presents the three-part ontology of documents, acts of communication and texts developed by the author, proposing that a text is an instance of an act of communication inscribed in a document. Accordingly, all texts have a double aspect: they are acts of communication, and they are present in documents. Each aspect may be represented as a tree, with each tree independent of the the other. Text may therefore be conceived as a collection of leaves, with each leaf present on both the document and act of communication trees. The talk describes briefly how this is model is implemented in the textual communities system.

downloadDownload free PDF View PDFchevron_right

An Infrastructure for Managing Semantic Documents

Ricardo Falbo

2010

Software Documentation is an important mean for stakeholders to collaborate in the software development context. However, several works point out that gathering relevant information from different documents can be so wearing that involved people may tend not to do it. Combining ontologies and documents by adding semantic annotations to documents can help diminish the burden of gathering information later on. However, this approach also adds an overhead in documentation, concerning the time spent on document annotation. In order to overcome some of these obstacles, we developed an infrastructure for managing semantic documents, combining semantic annotation on document templates, versioning data extracted from semantic documents and notifying interested people when extracted data has changed.

downloadDownload free PDF View PDFchevron_right

Ontology-enablement of a system for semantic annotation of digital documents

Kalliopi Zervanou, Babis Theodoulidis

2004

Abstract. We describe the recent enhancement of the CAFETIERE formalism (Conceptual Annotation of Facts, Events, Terms, Individual Entities and RElations) with the ability to link natural language words and phrases in textual documents with instances and classes from a language-enabled ontology. The language-enabled ontology is one with an index from one or more natural language expressions to each concept (as in WordNet). In an information extraction application.

downloadDownload free PDF View PDFchevron_right

Digital Documents as Data Carriers and a Method of Data Management Guaranteeing the Unambiguity of the Recorded Information: Ontology-Oriented Data Management and Document Databases

Jarosław Żeliński

Emerging Challenges, Solutions, and Best Practices for Digital Enterprise Transformation, 2021

This study presents a method for the storage of data organized in digital documents, which is proven in practice. The discussed method does not bear any disadvantages of the relational model used for data organization, such as the loss of data context and complications evoked by the lack of data redundancy. The method presented here can be used for data organization into documents (digital and paper) as classified aggregates and for data classification. The study also describes a new metamodel for the data structure which assumes that documents, being data structures, form compact aggregates, classified as objects, or event descriptions, thus always assigning them a specific and unambiguous context. Furthermore, the study presents a design method for documents as context aggregates that allows level-ing the disadvantages of the relational model and ensures efficient information management. The work also contains practical examples of the application of the described method.

downloadDownload free PDF View PDFchevron_right

Hybrid KM: Integrating Documents, Knowledge Bases, Databases, and the Web

Doug Skuce

Knowledge is a critical resource but we still do not have many new ideas on how to manage it. Most (online) knowledge is currently kept in conventional documents that are hard to structure, classify, browse, search, and even find. Organizations struggle with masses of such documents in hundreds of formats. Classical AI has largely ignored this real and serious problem, and while information retrieval research has tackled some of the problems, it is totally at odds with how AI tries to deal with knowledge problems. Cooperative work systems such as the Web and Lotus Notes are beginning to tackle that aspect. Database systems can contribute much of the required functionality. Hence we seek to integrate functionality and ideas from these sources.

downloadDownload free PDF View PDFchevron_right

Knowledge-Based Semantic Information Indexing and Management Framework: Integration of Structured Knowledge and Information Management Systems

Journal of Computer and Knowledge Engineering

Journal of Computer and Knowledge Engineering, 2020

One of the most challenging aspects of developing information systems is the processing and management of large volumes of information. One way to overcome this problem is to implement efficient data indexing and classification systems. As large volumes of generated data comprise of non-structured textual data, developing text processing, management and indexing frameworks can play an important role in providing users with accurate information according to their preferences. In this paper, a novel method of semantic information processing, management and indexing is introduced. The main goals of this study is to integrate structured knowledge of ontology and Knowledge Bases (KBs) in the core components of the method, to enrich the contents of the documents, to have multi-level semantic network representation of textual resources, to introduce a hybrid weighting schema (salient score) and finally to propose a hybrid method of semantic similarity computation. The structured knowledge of ontology and KBs are integrated from all aspects of the proposed method. The obtained results indicate the accuracy and optimal performance of the proposed framework. The obtained results suggest that using knowledge-based models leads to higher performance and accuracy in identifying and classifying documents according to user preferences; however, if learning-based models are not provided with sufficient amount of training data, they cannot yield satisfying results. The results also demonstrate that the complete integration of ontology and KBs in information systems can significantly contribute to a better representation of documents and evidently superior functionality of information processing, management and indexing systems.

downloadDownload free PDF View PDFchevron_right

Integrated document and knowledge management for the knowledge-based enterprise

Elisa Bertino

2000

Abstract The CONCERTO project is concerned with the creation and management of knowledge repositories. The distinctive approach is to maintain an association between the textual form in which knowledge is expressed in source documents, and an expressive narrative knowledge representation language that supports inference and query operations.

downloadDownload free PDF View PDFchevron_right

Towards a semantic web: ontology development based on the extraction of semantic concepts from digital documents

Rocío Abascal-Mena

2009

As an extension of the Web, in the highway of the construction of the Semantic Web we find the same problems such as the difficulty to share and reuse knowledge. The aim of this article is to present the development of an ontology in the context of a digital library, based on the use of Natural Language Processing (NLP) tools. Our approach is based on the analysis of scientific documents and the use of the tool for acquisition of terms called Nomino. A corpus was treated by extracting noun phrases in order to been used with LIKES, a tool capable to identify relationships between concepts. The final ontology was modeled using Protégé-2000. This way, our ontology provides a comprehensive representation of scientific terms and it's used to enhance user's requests.

downloadDownload free PDF View PDFchevron_right

Classification and Ontology Maintenance in Agent-Based Knowledge Management Frameworks: A Prototypical Approach

Clarissa Falge

2007

Being able to create views on the document space via grouping the documents is a key functionality in intelligent document management in view of browsing and querying. Hierarchically grouped sets of Documents can be viewed as simple extensionally defined ontological concepts. In an example Knowledge Management system (KnowCat) developed at UAM, Madrid, we investigate how agents for the maintenance of this ontology (these document groupings) can be constructed. We discuss two examples: A classification agent and a maintenance agent support users and administrators of the system to keep the ontology tight and functional. The agents are tested, developed targeted toward Spanish natural language documents, which requires adapted NLP techniques.

downloadDownload free PDF View PDFchevron_right

Expert knowledge management based on ontology in a digital library

Carlos Leon

2010

The architecture of the future Digital Libraries should be able to allow any users to access available knowledge resources from anywhere and at any time and efficient manner. Moreover to the individual user, there is a great deal of useless information in addition to the substantial amount of useful information. The goal is to investigate how to best combine Artificial Intelligent and Semantic Web technologies for semantic searching across largely distributed and heterogeneous digital libraries. The Artificial Intelligent and Semantic Web have provided both new possibilities and challenges to automatic information processing in search engine process. The major research tasks involved are to apply appropriate infrastructure for specific digital library system construction, to enrich metadata records with ontologies and enable semantic searching upon such intelligent system infrastructure. We study improving the efficiency of search methods to search a distributed data space like a Digital Library. This paper outlines the development of a Case-Based Reasoning prototype system based in an ontology for retrieval information of the Digital Library University of Seville. The results demonstrate that the used of expert system and the ontology into the retrieval process, the effectiveness of the information retrieval is enhanced.

downloadDownload free PDF View PDFchevron_right

Loading Preview

Sorry, preview is currently unavailable. You can download the paper by clicking the button above.

References (4)

Gruber, T., R. (1993): A translation approach to portable ontologies. Knowledge Acquisition, 5(2):199- 220.
Mach, M.; Dridi, F.; Furdik, K. (2001): Webocrat System Architecture and Functionality. Webocracy report 2.4.
Noy, N., F.; Fergerson, R., W.; Musen, M., A. (2000): The knowledge model of Protégé-2000: combining interoperability and flexibility. International Conference on Knowledge Engineering and Knowledge Management (EKAW '2000), Juan-les-Pins, France.
Sabol, T.; Jackson, M.; Dridi, F.; Palola, I.; Novacek, E.; Cizmarik, T.; Thompson, P. (2001): Dissemination and Use Plan. Webocracy report 15.2.1.

Miquel Casals

2007

Electronic Document Management Systems (EDMS) is an Information Technology (IT) application that has started to be used in the construction industry as a tool to reduce some of the problems generated by fragmentation. However, EDMS have also some limitations, most of them related to the interoperability and information exchange between systems. In order to solve these problems, different projects, standards and initiatives based on information classification systems and ontologies are being developed, such as ISO 12006 series, Industry Foundation Classes (IFC), Lexicon and e-Construct, e-Cognos, among others. This paper describes the development of an ontology for the AEC/FM projects’ documentation management aimed at establishing a hierarchical structure of the different areas that conform the lifecycle of AEC/FM projects and an interrelationship system between them, where all the documentation created along a project is classified. Therefore, this ontology provides a context-relat...

downloadDownload free PDF View PDFchevron_right

A Semantic-Based Approach for the Management of Digital Documents

JUAN CAMILO OROZCO GIRALDO

2008 11th IEEE International Conference on Computational Science and Engineering - Workshops, 2008

The Semantic Web intends to transform information into knowledge, it is an extension of the current Web in which the semantics of information is defined through structured and legible metadata, making it possible for software agents to understand and retrieve this information. This paper presents the project "SABIOS", which proposes the introduction of emerging technologies based on semantic, combined with multi-agent systems and information retrieval techniques in order to improve processes of insertion, cataloging, and retrieval of digital documents, thereby implementing a system composed by three modules: A knowledge module, a semantic search module, and a visualization and navigation module.

downloadDownload free PDF View PDFchevron_right

Ontology-Based Framework for Document Indexing

P. Maret

Proceedings of the 4th International Workshop on Pattern Recognition in Information Systems, 2004

The work presented in this paper addresses a project of the Computer Centre CIRTIL who supported it. This company wants to save and capitalize its knowledge and its know-how about the production activities, especially concerning the technical incidents relating to software applications encountered during the exploitation time. Indeed using a well accessing documents base, actors will be able to better solve problems. Our purpose is to focus on ontology-based framework for indexing documents. The domain ontology OntoCIRTIL has a structure which supports a semantic model based on semantic links and inference mechanisms. In this paper, we present a new model called S 3 which permits to model knowledge in upstream and index documents (or formalized knowledge) in downstream. To illustrate partial results, this model is then applied to OntoCIRTIL.

downloadDownload free PDF View PDFchevron_right

An ontology-based knowledge management platform

Antonio Moreno

IIWeb, 2003

We describe the development of a knowledge management platform for web-enabled environments featuring intelligence and insight capabilities. The effort is the result of a FP5 project under the IST initiative involving 3 universities, a technology provider and 5 user companies. The main objective of the platform is to analyse, search and present information retrieved from the web) (or any other type of document). This is achieved through the use of Multi-Agent Systems and ontologies. The automatic evolution of dynamic ontologies requires the action of a collection of agents to extract information and discover links using classification and learning techniques. These general-purpose agents will maintain a goal to periodically access the ontology and support search functions. Conceptually similar documents would get clustered into categories and information could then be retrieved by statistical approaches. Discovery of new knowledge would lead to modifications in the ontology by pruning irrelevant sections, refining its granularity and/or testing its consistency.

downloadDownload free PDF View PDFchevron_right

DOCUMENT INDEXING - PROVIDING A BASIS FOR SEMANTIC DOCUMENT ANNOTATION

Harald Sack, Clemens Beckstein

A document index represents a concise ordered compilation of the document's most important topics. It provides direct and fast access to the document parts related to the index information. Together with structural knowledge of the document itself in connection with general knowledge about indexing a 2-layered Index Graph is defined that is further mapped to an ontology representation. By defining suitable metrics it is shown how the Index Graph can be utilised to augment semantic applications. We have developed a system for supporting the author of a document in the process of index compilation. Other possible applications include document visualisation, and semantic document annotation.

downloadDownload free PDF View PDFchevron_right

An Ontology based document management

Sign up for access to the world's latest research

Abstract

Related papers

References (4)

Related papers

Related topics