Academia.eduAcademia.edu

Data Modeling

description2,060 papers
group4,705 followers
lightbulbAbout this topic
Data modeling is the process of creating a conceptual representation of data structures, relationships, and constraints within a specific domain. It serves as a blueprint for organizing and managing data, facilitating communication between stakeholders and guiding the design of databases and information systems.
lightbulbAbout this topic
Data modeling is the process of creating a conceptual representation of data structures, relationships, and constraints within a specific domain. It serves as a blueprint for organizing and managing data, facilitating communication between stakeholders and guiding the design of databases and information systems.

Key research themes

1. How can computational and statistical modeling integrate formal theories and empirical data to advance social science data modeling?

This theme focuses on the development of data modeling approaches in social sciences that combine formal mathematical models, computational simulations, and empirical data to overcome the limitations of traditional game theory and statistical models. The central research question is how to build more complex, verisimilar behavioral models while maintaining rigorous empirical validation and theoretical coherence. This integration addresses challenges like the curse of dimensionality, overfitting, and the gap between deductive theory and data.

Key finding: The author critiques traditional mathematical models, especially game theory, for brittle assumptions and lack of empirical alignment, proposing a unified approach that combines formal deductive models, empirical analysis,... Read more
Key finding: This work introduces the Schema-Model Framework, characterizing algorithms learning probabilistic models from relational social data as comprising two parts: a schema defining meaningful data groupings and a probabilistic... Read more
Key finding: The paper conceptualizes a data model as an architectural view within software systems, articulating how the data model not only represents data entities and relationships but also guides database schema creation, code... Read more

2. What innovative programming models facilitate integration and processing of heterogeneous human-centric data for health and well-being applications?

This research theme investigates novel data modeling and programming frameworks designed to integrate and manipulate heterogeneous, continuous data streams originating from human-related sources, including sensor data, personal devices, and health records. The focus is on enhancing programmability, interoperability, and meaningful interpretation of intimate and variable human data to support real-time monitoring, personalized interventions, and improved healthcare or education outcomes.

Key finding: The authors present the Human Data Model (HDM), a JavaScript-based programming abstraction enabling the combination, computation, and scheduling of human-centric data from diverse sensor and digital sources. HDM's modular API... Read more
Key finding: This study leverages conditional generative adversarial networks to model and simulate dynamic contrast enhancement in breast MRI without requiring actual physical contrast agents. By synthesizing realistic and temporally... Read more
Key finding: Using advanced Python programming tools for statistical data analysis, this work models complex ecological datasets correlating climatic variables (e.g., temperature, precipitation) and vegetation responses in Alpine forests.... Read more
Key finding: This empirical and statistical modeling study links forest soil conditions, lichen epiphyte presence, and precipitation dynamics in subalpine coniferous forests. By integrating biophysical data with hydrological measurements,... Read more

3. How do advanced statistical and machine learning methodologies contribute to modeling complex ecohydrological systems in alpine forest ecosystems?

This theme addresses the application and development of multitask learning, statistical parameterization, and ontological frameworks to model and analyze ecohydrological dynamics in Alpine forest settings. The research integrates environmental data such as fog presence, forest age, soil conditions, and water balance components to improve understanding of interactions influencing hydrology and forest health under climate variability.

Key finding: The paper proposes a multitask statistical learning framework that concurrently models the effects of fog and forest age on water balance mechanisms in the Alps. Results demonstrate that interception significantly affects... Read more
Key finding: This study proposes an ontologically-driven framework combining statistical metrics and machine learning to dynamically explore models representing qualitative text-based data, such as interview transcripts, relevant for... Read more
Key finding: This survey presents a comprehensive overview of data modeling and analytics challenges arising from Big Data in environmental and operational settings. It delineates evolving paradigms beyond traditional relational... Read more
Key finding: The authors apply Python-based statistical data modeling to characterize climate-environment interactions affecting subalpine coniferous forests. The study quantifies how forest age modulates ecophysiological responses... Read more

All papers in Data Modeling

This paper discusses conceptual and logical data models for social media datasets and applications. On the one hand, we focus on the data representation requirements from the available APIs of some social network systems. On the other... more
Modeling the dependencies among multiple temporal attributes derived from integrated healthcare databases represents an unprecedented opportunity to support medical and administrative decisions. However, existing predictive models are not... more
The complex physical processes controlling ceiling and visibility (for example, the formation, evolution and motion of low cloud, precipitation and fog) and the diverse seasonal and geographic influences that modulate these controls... more
Healthcare insurance processes millions of claims daily, which makes it a prime target for fraud and errors. Due to the mistakes, there has been a massive increase in health insurance costs in recent years, and it's because of the payment... more
UML Activity Diagrams are dynamic blueprints of how tasks, actors, and other objects (e.g., data) weave together to achieve goals. Whether you are developing an app, streamlining business operations, or simply making sense of chaos,... more
UML class diagrams serve as one of the most powerful tools for representing systems. Class diagrams provide a precise, yet intuitive view of how a system is organized in terms of classes, attributes, relationships and operations. Through... more
This paper explores the use of deep learning for leak localization in Water Distribution Networks (WDNs) using pressure measurements. By using a training data set including enough samples of all possible leak localizations, a... more
The definition of the exact meaning of conceptual modeling constructs is considered a relevant issue since it contributes to their effective and appropriate use by conceptual modelers. This paper studies three related constructs that... more
Non-technical losses (NTLs), particularly due to electricity theft, pose challenges to distribution systems in the form of revenue loss, grid instability, and safety hazards. While machine learning (ML) models have been proposed for... more
An offshoot of research on developing methods typical business professionals can use to analyze systems for themselves, Sysperanto is being developed as an ontology codifying concepts and knowledge useful in describing and analyzing... more
other projects in the database. Software development data is highly variable, which often result,s in underlying trends being hidden. In order to address this problem, a method of data analysis, adapted from the financial community, is... more
Propelled by the increasing availability of large-scale high-quality data, advanced data modeling and analysis techniques are enabling many novel and significant scientific understanding of a wide range of complex social, natural, and... more
This paper focuses on the need for knowledge organization (KO) tools, such as library classifications, thesauri and subject heading systems, to be fully disclosed and available in the open network environment. The authors look at the... more
Maria Inês Cordeiro, Art Library, Calouste Gulbenkian Foundation, Lisbon, Portugal. Aida Slavic, SLAIS, University College London, UK. Data Models for Knowledge Organization Tools: Evolution and Perspectives. Abstract: This ...
In this article we compare two contrasting methods, active set method ASM and genetic algorithms, for learning the weights in aggregation operators, such as weighted mean Ž . Ž . WM , ordered weighted average OWA , and weighted ordered... more
XPDI: A System for Expert Systems Development This technical document describes XPDI (eXpert Product Data Interchange Station), a software environment developed by Patrice POYET and his team at CSTB in 1989-1992, aimed at integrating... more
Production Planning and Production Control systems are some of the keys that determine the development of modern production systems. The article presents the importance of a flexibility factor in the process of manufacturing, planning,... more
This project focuses on predicting employee attrition using machine learning techniques. By analyzing employee data such as age, salary, job satisfaction, and work experience, the model helps identify employees who are likely to leave the... more
Considerando-se um contexto de busca de recursos humanos qualificados e dispostos a inovar e aprender foi criado o Módulo de Estágios do sistema Integrado de Gestão e Pesquisa da Embrapa Gado de Corte ? Pandora ? no qual este Manual está... more
A grey wolf optimization algorithm is a newly developed metaheuristic algorithm. GWO has given a better solution to the optimization problem as compare to other swarm intelligence. It is a very simple and easy to implement this algorithm.... more
The FAIR Data Principles are widely applied to research data. These principles have been broadly adopted by scientific and scholarly institutions to guide research data infrastructures and services, ensuring data is findable, accessible,... more
The combination of social network extraction from texts, network analytics to identify key actors, and then simulation to assess alternative interventions in terms of their impact on the network is a powerful approach for supporting... more
The delicate ecosystems of the Alps' subalpine forests are crucial to water supplies as well as the local and mesoscale climate regulators. Although earlier research has assessed various aspects of the water balance, there is currently a... more
Typically, in simulating the dynamic processes in buildings, data modeling efforts require the modeling of the building geometry, its components and the relationship between these components, as well as the modeling of the process that is... more
In recent years, leakage detection models using artificial intelligence have been widely used by researchers. In this study, acoustic sound data were recorded at 10 observation points, focusing on ductile iron pipe and vinyl polyethylene.... more
Detailed spatial and temporal data on plant growth are critical to guide crop management. Conventional methods to determine field plant traits are intensive, time-consuming, expensive, and limited to small areas. The objective of this... more
Spatio-Temporal data is related to many of the issues around us such as satellite images, weather maps, transportation systems and so on. Furthermore, this information is commonly not static and can change over the time. Therefore the... more
The wave of interest in data-centric applications has spawned a high variety of data models, making it extremely difficult to evaluate, integrate or access them in a uniform way. Moreover, many recent models are too specific to allow... more
This paper introduces a strategy for both the retrieval and analysis of linked open data (LOD) based on the use of visual tools. Retrieving and understanding data from triplestores (such as SPARQL) requires technical knowledge and proves... more
Traditional data models often emphasize structure over context-dependent reality, leading to a loss of control by data creators. Consequently, protecting and providing on-demand access control to sensitive data remains a critical... more
The research presented here is carried out within the INTERREG EU project framework, which aims to the valorisation and dissemination of the role of the Church of St. Maria di Scaria (Como, Italy). It mainly focuses on the Carloni's... more
Purpose Deep generative models and synthetic data generation have become essential for advancing computer-assisted diagnosis and treatment. We explore one such emerging and particularly promising application of deep generative models,... more
Forests affect climate parameters through impacts from biochemical and biophysical processes. However, the effects of the coniferous stands on hydrological setting in temperate regions have received little attention. This study... more
Climate plays a pivotal role in construction of relationships between coniferous forest health and water balance for efficient biosynthesis under changing meteorological variables. Here we identify that the age of the forest (young <30... more
Due to the difficulty of employing real data centres' infrastructure for assessing the effectiveness of energy-aware algorithms, many researchers resort to use simulation tools. These tools require precise and detailed models for... more
This paper proposes a novel multi-task statistical learning framework which aims to concurrently address all the environmental challenges in the Alps. The goal is to analyse the effects of lichen and fog on water balance. The objective is... more
M2M (Machine-to-Machine) technologies provide a number of solutions in logistics and transport, such as fleet management solutions, asset tracking systems, parking space management and payment, road tolls, traffic volume monitoring,... more
Big theory requires more technique to transfer huge amount of data daily. The requirement of this technique is to manage this transfer of data from source to destination. But in big data environment data from different sources are of... more
Intelligent systems in automobiles need to be aware of the driving and driver context. Available sensor data stream has to be modeled and monitored in order to do so. Currently there exist no building blocks for hierarchical modeling of... more
Maintenance processes of repairable systems have been extensively studied in the past. The resulting simple solutions have proven to be remarkably effective. It requires complex and time-consuming simulations to improve on those simple... more
The high Alpine region of northern Italy is characterized by unique ecosystems, a complex hydrogeological setting, steep topographic gradients, variety of vegetation types and landscape patches, and varied in climatic and meteorological... more
Pengelolaan data historis dalam sistem data warehouse menjadi kebutuhan utama dalam organisasi modern, khususnya untuk analisis jangka panjang. Salah satu pendekatan yang digunakan adalah Slowly Changing Dimension (SCD), yang terbagi... more
A structured conceptual design of reporting systems is a crucial task that has to precede implementation and monitoring. A basic challenge can be seen in information requirements engineering. On the one hand, users need certain... more
Accurate wind speed forecasting is of great importance for many economic, business and management sectors. This paper introduces a new model based on convolutional neural networks (CNNs) for wind speed prediction tasks. In particular, we... more
Accurate wind speed forecasting is of great importance for many economic, business and management sectors. This paper introduces a new model based on convolutional neural networks (CNNs) for wind speed prediction tasks. In particular, we... more
Download research papers for free!