Academia.eduAcademia.edu

Online Analytical Processing

description1,538 papers
group15 followers
lightbulbAbout this topic
Online Analytical Processing (OLAP) is a category of software technology that enables analysts, managers, and executives to gain insight into data through fast, consistent, interactive access in a variety of ways. It supports complex calculations, trend analysis, and sophisticated data modeling, facilitating multidimensional analysis of business data.
lightbulbAbout this topic
Online Analytical Processing (OLAP) is a category of software technology that enables analysts, managers, and executives to gain insight into data through fast, consistent, interactive access in a variety of ways. It supports complex calculations, trend analysis, and sophisticated data modeling, facilitating multidimensional analysis of business data.

Key research themes

1. How can real-time and online processing techniques enhance the efficiency and interactivity of Online Analytical Processing (OLAP) systems?

This research theme explores advancements in OLAP systems that move beyond traditional batch processing to support continuous, incremental, and adaptive analysis of streaming or large-scale data. The focus is on methodologies and system architectures enabling users to receive progressive aggregation results with confidence intervals, reduce latency in analytical queries, sustain high throughput, and provide interactive control over query execution. Such capabilities are fundamental to adapting OLAP for modern big data and real-time analytics scenarios, improving user experience and decision-making timeliness.

Key finding: Introduced a novel online aggregation interface within relational DBMS that allows users to observe and control query progress dynamically, providing running confidence intervals and output in random order. Demonstrated... Read more
Key finding: Developed Druid, an open-source, distributed, column-oriented analytical data store optimized for real-time ingestion and sub-second query latency over billion-row tables. Its architecture combines columnar storage, advanced... Read more
Key finding: Proposed AIM, an integrated system designed for telecom applications to process high-volume event streams while concurrently executing complex adhoc analytical queries with strict latency guarantees. The architecture combines... Read more
Key finding: Presented Squall, an open-source continuous query processing engine supporting SQL-like queries on streaming data, extended to handle arbitrary theta-joins with adaptive data routing and repartitioning. The system ensures... Read more
Key finding: Demonstrated the integration of OLAP functionality into a Grid computing environment within the GridMiner project, providing a functional and architectural framework that enables distributed, parallel OLAP over large,... Read more

2. What architectural principles and resource management strategies optimize the design and scalability of stream processing and federated analytical query systems?

This theme investigates high-level architectural designs and middleware services that enable scalable, efficient processing of continuous data streams and federated query execution across heterogeneous computational and data resources. These systems address challenges in balancing latency, throughput, load distribution, and integration of diverse data sources or processing engines, crucial for extending OLAP and continuous analytics to complex, real-world distributed environments such as telecom, scientific grids, and hybrid database-stream setups.

Key finding: Outlined comprehensive design principles distilled from practical implementation experiences in IBM System S and Spade for building scalable, low-latency distributed stream processing applications. Key considerations include... Read more
Key finding: Developed a suite of interactive Grid middleware services (steering, job monitoring, and estimator) providing end-users with greater visibility and control over job scheduling and data access in Grid environments. These... Read more
Key finding: Designed and implemented a cost-based federated query optimizer that partitions continuous queries operator-level across stream processing engines and columnar in-memory databases to maximize throughput. By incorporating plan... Read more
Key finding: Provided an in-depth analysis and architectural proposal of data warehouse systems focusing on the Extract-Transform-Load (ETL) process as a critical component for integrating heterogeneous data sources. The study emphasizes... Read more
Key finding: Presented a cloud-enabled, automated machine learning pipeline that seamlessly integrates dataset preprocessing, model training, evaluation, and result visualization into an accessible web-based interface and API. This... Read more

3. How can multidimensional data modeling and sensitivity analysis improve the interpretability and decision-making capabilities of OLAP systems in business contexts?

This theme focuses on extensions to traditional OLAP frameworks through advanced multidimensional equation systems, sensitivity analysis, and integration of business models to enable rigorous exploration of how input variations affect aggregated outputs. It addresses challenges like maintaining consistency and solvability in OLAP-derived equations, equipping decision-makers with tools to perform 'what-if' analyses that enhance understanding of business impacts at different aggregation levels.

Key finding: Implemented a domain-specific data warehouse leveraging OLAP to centralize operational and historical data for supplier performance analysis. This enables multidimensional querying and reporting that supports construction... Read more
Key finding: Designed a data warehouse for aggregating and analyzing student academic data across multiple universities to support informed decision-making. The system enables multidimensional analysis of academic performance and... Read more
Key finding: Combined a data warehouse architecture with the VIKOR multi-criteria decision-making method to provide executives with accurate, consistent, and timely information. This approach supports strategic and tactical decisions by... Read more
Key finding: Critically evaluated OLAP enhancements focused on cube computation, aggregation, and data hierarchy management. The survey identifies the importance of multidimensional data models and operations such as rollup, drill-down,... Read more

All papers in Online Analytical Processing

Parallel and distributed data warehouse architectures have been evolved to support online queries on massive data in a short time. Unfortunately, the emergence of e-application has been creating extremely high volume of data that reaches... more
Досліджено, що в діяльності компанії важлива роль приділяється інформаційному забезпеченню, оскільки підприємству важливо інформувати та забезпечувати інформаційним менеджментом підприємство. Досліджено процес прийняття управлінських... more
Real data mining analysis applications call for a framework which adequately supports knowledge discovery as a multi-step process, where the input of one mining operation can be the output of another. Previous studies, primarily focusing... more
Conversational interfaces to Business Intelligence (BI) applications enable data analysis using a natural language dialog in small incremental steps. To truly unleash the power of conversational BI to democratize access to data, a system... more
Visual big data analytics aims at supporting big data analytics via visual metaphors, with a plethora of applications in modern settings and scenarios. In all these domains, visual big data analytics paradigms offer several advantages,... more
Data Warehouses and OLAP (On Line Analytical Processing) technologies are dedicated to analyzing structured data issued from organizations' OLTP (On Line Transaction Processing) systems. Furthermore, in order to enhance their decision... more
Les entrepôts de données sont actuellement reconnus comme étant un composant essentiel des systèmes d'aide à la décision dans la mesure où ils offrent la meilleure réponse aux problèmes de prise de décision des différents domaines... more
Faced with the continuous increasing amount of data and the complexity of multidimensional queries, data warehouses must be split to be stored in distributed architectures and to enhance the performance of the queries. On the other hand,... more
During the constantly increasing competition, higher educational institutions try to develop strategies and apply new instruments to ensure learning and teaching quality. The education sector's decision-making process faces strict... more
During the constantly increasing competition, higher educational institutions try to develop strategies and apply new instruments to ensure learning and teaching quality. The education sector's decision-making process faces strict... more
An OLAP cube is typically explored with multiple aggregations selecting different subsets of cube dimensions to analyze trends or to discover unexpected results. Unfortunately, such analytic process is generally manual and fails to... more
We propose a framework for efficient OLAP on information networks with a focus on the most interesting kind, the topological OLAP (called "T-OLAP"), which incurs topological changes in the underlying networks. T-OLAP operations generate... more
Nowadays automated data collection tools and mature database technology lead to tremendous amounts of data stored in databases, data warehouses and other information repositories in business organizations. In this paper, a novel approach... more
A tool which algorithmically traces the effectiveness of the text files would be helpful in determining whether the text file have all the characteristic of important concepts. Every text source is build up on key phrases, and these... more
FunctionalLinkNeuralNetwork(FLNN)hasemergedasanimportanttoolforsolvingnon-linear classificationproblemandhasbeensuccessfullyappliedinmanyengineeringandscientificproblems.... more
In the Big Data community, MapReduce has been seen as one of the key enabling approaches for meeting continuously increasing demands on computing resources imposed by massive data sets. The reason for this is the high scalability of the... more
Providing a customized support for the OLAP brings tremendous challenges to the OLAP technology. Standing at the crossroads of the preferences and the data warehouse, two emerging trends are pointed out; namely: (i) the personalization... more
Bitmap indices have become popular access methods for data warehouse applications and decision support systems with large amounts of read-mostly data. This paper could arrive a number of results such as ; Bitmap Index highly improves the... more
The key objective of the chapter would be to study the classification accuracy, using feature selection with machine learning algorithms. The dimensionality of the data is reduced by implementing Feature selection and accuracy of the... more
The goal of this article is to depict Object Oriented Conceptual Model Data Cube using it as an example. In short, the paper has described: 1) Structural level in object oriented class definition of data cube; 2) The basic methods for... more
Data warehousing has become an essential tool in modern organizations driven by increasing business complexity and technological advancements. Organizations collect vast amounts of data from multiple sources that require efficient storage... more
Big Data is an important topic for discussion and research. It has gained this importance due to the meaningful value that could be extracted from these data. The application of Big Data in the modern business allows enterprises to take... more
Modeling plays an important role for the solution of the complex research problems. When the database became large and complex then it is necessary to create a unified model for getting the desired information in the minimum time and to... more
The process of selecting suppliers is a formidable challenge in managing industrial construction projects. Construction companies generate a huge amount of data that is spread across multiple databases, but this data is not used to... more
Transformation design is a key step in model-driven engineering, and it is a very challenging task, particularly in context of the model-driven data warehouse. Currently, this process is ensured by human experts. The authors propose a new... more
This paper studies a new machine learning application with possibly a challenging benchmark for relational learning systems. We are interesting in the automation of Model-Driven Data Warehouse [6, 13, 2] using machine learning techniques.... more
Nowadays, there are an emergence of spatial or geographic data stored in several and heterogeneous databases, mostly in Geographic Information Systems (GIS). The diversity of GIS and the increasing accumulation of non-spatial (simple... more
DBPubs is a system for effectively analyzing and exploring the content of database publications by combining keyword search with OLAP-style aggregations, navigation, and reporting. DBPubs starts with keyword search over the content of... more
Content Management Systems (CMS) store enterprise data such as insurance claims, insurance policies, legal documents, patent applications, or archival data like in the case of digital libraries. Search over content allows for information... more
Visualization of multivariate data is a big challenge to problems of visual analytics. A system of data visualization is composed of visual mapping stage and visual display stage. The stage of visual mapping converts data to graph and the... more
In this paper the so-called Event Cube is introduced, a multidimensional data structure that can hold information about all business dimensions. Like the data cubes of online analytic processing (OLAP) systems, the Event Cube can be used... more
Universities collect and generate a considerable amount of data on students throughout their academic career. Currently in South Kivu, most universities have an information system in the form of a database made up of several disparate... more
Due to the increasingly di culty of discovering patterns in real-world databases using only conventional OLAP tools, an automated p r ocess such as data mining is currently essential. As data mining over large data sets can take a... more
Recent years have shown the need of an automated p r ocess to discover interesting and hidden patterns in real-world databases, handling large volumes of data. This sort of process implies a lot of computational power, memory and disk I... more
In data warehousing applications, numerous OLAP queries involve the processing of holistic aggregators such as computing the "top n," median, quantiles, etc. In this paper, we present a novel approach called dynamic bucketing to... more
Agradeço aos amigos do doutorado, Ruben, Ellison e Cristiano, pela amizade. Aos amigos Daniel Diaz, Mateus, Marcelo e Werther pela amizade e disposição em ajudar-me. A Alberto Sulaiman e Rosalvo Streit pelo apoio e incentivo dado ao meu... more
Universities collect and generate a considerable amount of data on students throughout their academic career. Currently in South Kivu, most universities have an information system in the form of a database made up of several disparate... more
In many application contexts, like statistical databases, scientific databases, query optimizers, OLAP, and so on, data are often summarized into synopses of aggregate values. Summarization has the great advantage of saving space, but... more
The continuous growth of OLAP users and data impose additional stress on data management and hardware infrastructure. The distribution of multidimensional data through a number of servers allows the increasing of storage and processing... more
In OLAP, the materialization of multidimensional structures is a sine qua non condition of performance. Problems that come along with this need have triggered a huge variety of proposals: the picking of the optimal set of aggregation... more
Answering performance to business queries, mainly of aggregated nature, known as On-Line Analytical Processing queries, depends heavily on the proper selection of multidimensional structures, known as materialized sub- cubes or views. As... more
The problem of understanding the behavior of business processes and of services is rapidly becoming a priority in medium and large companies. To this end, recently, analysis tools as well as variations of data mining techniques have been... more
Materialized views in data warehouses are typically complicated, making the maintenance of such views difficult. However, they are also very important for improving the speed of access to the information in the data warehouse. So, the... more
In the context of Volunteered Geographic Information (VGI), volunteers are not involved in the decisional processes. Moreover, VGI systems do not offer advanced historical analysis tools. Therefore, in this work, we propose to use Data... more
Universities collect and generate a considerable amount of data on students throughout their academic career. Currently in South Kivu, most universities have an information system in the form of a database made up of several disparate... more
One of the critical deficiencies of SQL is lack of support for ndimensional array-based computations which are frequent in OLAP environments. Relational OLAP (ROLAP) applications have to emulate them using joins, recently introduced SQL... more
One of the critical deficiencies of SQL is lack of support for ndimensional array-based computations which are frequent in OLAP environments. Relational OLAP (ROLAP) applications have to emulate them using joins, recently introduced SQL... more
Download research papers for free!