Academia.eduAcademia.edu

Distributed Heterogeneous Data Access

description8 papers
group0 followers
lightbulbAbout this topic
Distributed Heterogeneous Data Access refers to the methodologies and technologies that enable the retrieval and manipulation of data stored across multiple, diverse systems and formats. This field addresses challenges related to data integration, interoperability, and efficient querying in environments where data is distributed across various locations and platforms.
lightbulbAbout this topic
Distributed Heterogeneous Data Access refers to the methodologies and technologies that enable the retrieval and manipulation of data stored across multiple, diverse systems and formats. This field addresses challenges related to data integration, interoperability, and efficient querying in environments where data is distributed across various locations and platforms.

Key research themes

1. How do federated database architectures manage distribution, heterogeneity, and autonomy in distributed heterogeneous data access?

This theme investigates architectural models and system design methodologies for integrating multiple autonomous and heterogeneous database systems into federated database systems (FDBS). It focuses on defining reference architectures, managing varying degrees of distribution, heterogeneity in data models and query languages, and autonomy of component databases, with emphasis on controlled cooperation and schema integration.

Key finding: This paper defines a reference architecture for federated database systems that manages distribution by enabling data to be horizontally or vertically partitioned over multiple potentially geographically dispersed databases,... Read more
Key finding: This overview updates federated database concepts incorporating modern data management technologies including NoSQL stores and distributed file systems. It characterizes architecture components addressing heterogeneity at... Read more
Key finding: Examines seven heterogeneous distributed database systems targeted for production use, analyzing their architectural features around schema integration, distributed query management, and transaction management. It emphasizes... Read more
Key finding: Presents the architecture of the Federated Utah Research and Translational Health e-Repository (FURTHeR) Federated Query Engine (FQE) designed for on-the-fly federated querying of heterogeneous biomedical data sources. The... Read more
Key finding: Describes the Carnot system architecture which provides a layered approach to heterogeneous database integration through mediated access, constraint specification among resources, and distributed computing actors to... Read more

2. What methodologies and technologies enable semantic integration and query processing over distributed heterogeneous data repositories?

This theme focuses on semantic integration techniques that employ ontologies, description logics, and data federation strategies to enable uniform querying and information retrieval over distributed, heterogeneous repositories. It studies how semantic relationships, inter-ontology mappings, and reasoning capabilities inform query rewriting, optimization, and incremental answer generation to address schema heterogeneity and vocabulary sharing challenges in multi-source, distributed environments.

Key finding: Proposes a query processing framework based on Description Logics (DL) using domain-specific ontologies to represent the information content of distributed and heterogeneous data repositories. The system employs semantic... Read more
Key finding: Introduces the OBSERVER system that addresses vocabulary heterogeneity by supporting multiple pre-existing domain ontologies and managing semantic inter-ontology relationships such as synonyms, hyponyms, and hypernyms during... Read more
Key finding: Develops a comprehensive evaluation framework for data federation systems which emphasizes the use of unified schemas (e.g., RDF/OWL ontologies or relational schemas) and federated query languages (e.g., SPARQL, SQL) enabling... Read more
Key finding: Presents the XMedia mediator system that integrates heterogeneous data sources by representing data as XML documents and supporting uniform querying via XQuery over XML views. The system features an XML algebra extending... Read more
Key finding: Proposes the DISCO distributed mediator architecture addressing fragile mediators by managing data source connections via data modeling, enabling transparent addition of new sources and accommodating differing query language... Read more

3. What strategies and architectures facilitate efficient, secure, and consistent distributed querying and metadata management in heterogeneous data environments?

Research under this theme explores architectural designs and system-level strategies for managing metadata, enforcing security, ensuring data consistency and autonomy, and optimizing distributed queries across heterogeneous data sources. It covers middleware components for interoperability, indexing architectures for metadata discovery, update management, and query engines that translate, distribute, and aggregate queries ensuring scalability, reliability, and compliance with privacy requirements in distributed heterogeneous databases.

Key finding: Proposes a distributed index architecture using descriptive hierarchies to organize metadata and route queries efficiently in large-scale networks. Applied in the NASA EOS domain, it departs from centralized metadata crawlers... Read more
Key finding: Compares the Common Data Model (CDM) approach and Semantic Web principles for secure access to distributed clinical data, evaluating parameters like cost, data quality, interoperability, and efficiency. It identifies... Read more
Key finding: Introduces an architecture comprising modules like the Integrator to abstract and hide network heterogeneity, enabling update and notification services of managed objects representing network resources. The system handles... Read more
Key finding: Details the architecture of the Mermaid system that provides a front-end interface to multiple heterogeneous relational databases, using a common query language (ARIEL or SQL) with an intermediate language layer decoupling... Read more
Key finding: Describes Pegasus, an object-oriented multidatabase system that integrates heterogeneous distributed object, relational, and other databases through uniform schemas using HOSQL language. It uses type and function abstractions... Read more

All papers in Distributed Heterogeneous Data Access

The huge number of autonomous and heterogeneous data repositories accessible on the "global information infrastructure" makes it impossible for users to be aware of the locations, structure/organization, query languages and semantics of... more
The huge number of autonomous and heterogeneous data repositories accessible on the "global information infrastructure" makes it impossible for users to be aware of the locations, structure/organization, query languages and semantics of... more
The huge number of autonomous and heterogeneous data repositories accessible on the "global information infrastructure" makes it impossible for users to be aware of the locations, structure/organization, query languages and semantics of... more
Considerable progress has been achieved in the area of virtual database systems, but substantial problems still persist. In this paper we discuss two current research directions: The resolution of extensional inconsistencies among... more
Considerable progress has been achieved in the area of virtual database systems, but substantial problems still persist. In this paper we discuss two current research directions: The resolution of extensional inconsistencies among... more
The World Wide Web is fast becoming a ubiquitous computing environment. Prevalent keyword-based search techniques are scalable, but are incapable of accessing information based on concepts. We investigate the use of concepts from... more
Recent emerging technologies such as internetworking and the World Wide Web (WWW) have signi cantly expanded the types, availability, and volume of data accessible to an information management system. In this new environment it is... more
The World Wide Web is fast becoming a ubiquitous computing environment. Prevalent keyword-based search techniques are scalable, but are incapable of accessing information based on concepts. We investigate the use of concepts from... more
Recent emerging technologies such as internetworking and the World Wide Web (WWW) have signi cantly expanded the types, availability, and volume of data accessible to an information management system. In this new environment it is... more
There has been an explosion in the types, availability and volume of data accessible in an information system, thanks to the World Wide Web (the Web) and related inter-networking technologies. In this environment, there is a critical need... more
This paper addresses the development o f a c ooperative database system, called Distributed Object Kernel DOK, which provides computation over heterogeneous databases. The DOK logical architecture has a clear separation of concerns, and... more
The World Wide Web is fast becoming a ubiquitous computing environment. Prevalent keyword-based search techniques are scalable, but are incapable of accessing information based on concepts. We investigate the use of concepts from... more
Abstract: Recent emerging technologies such as internetworking and the World WideWeb (WWW) have significantly expanded the types, availability, and volumeof data accessible to an information management system. In this new environmentit is... more
by Eduardo Mena and 
1 more
Recent emerging technologies such as internetworking and the World Wide Web (WWW) have signi cantly expanded the types, availability, and volume of data accessible to an information management system. In this new environment it is... more
There has been an explosion in the types, availability and volume of data accessible in an information system, thanks to the World Wide Web (the Web) and related inter-networking technologies. In this environment, there is a critical need... more
There has been an explosion in the types, availability and volume of data accessible in an information system, thanks to the World Wide Web (the Web) and related inter-networking technologies. In this environment, there is a critical need... more
Abstract The World Wide Web is fast becoming a ubiquitous computing environment. Prevalent keyword-based search techniques are scalable, but are incapable of accessing information based on concepts. We investigate the use of concepts from... more
ABSTRACT In this paper, we discuss how ontologies can represent and handle enormous and heterogeneous information in order to organize information according to our intention. We discuss ontologies from two points of view in this paper.... more
Download research papers for free!