Academia.eduAcademia.edu

Unstructured Data Analytics

description13 papers
group9 followers
lightbulbAbout this topic
Unstructured Data Analytics refers to the process of examining and interpreting non-organized data, such as text, images, and videos, to extract meaningful insights and patterns. This field employs various techniques, including natural language processing and machine learning, to analyze data that does not fit traditional structured formats.
lightbulbAbout this topic
Unstructured Data Analytics refers to the process of examining and interpreting non-organized data, such as text, images, and videos, to extract meaningful insights and patterns. This field employs various techniques, including natural language processing and machine learning, to analyze data that does not fit traditional structured formats.

Key research themes

1. How can data preprocessing techniques be optimized for effective unstructured data analytics?

This theme investigates preprocessing challenges inherent in unstructured data and explores techniques to enhance data quality and understanding prior to analysis. Preprocessing is crucial due to the issues of missing data, outliers, varied granularity, and incomplete records that unstructured datasets frequently present. Optimizing preprocessing strategies impacts the accuracy and reliability of downstream analytics and extraction processes.

Key finding: This paper provides systematic treatment of preprocessing challenges with real-world datasets, highlighting techniques such as iterative data elimination and domain-expert feedback integration. It emphasizes that selecting... Read more
Key finding: The review distinguishes varied approaches (programmatic, workflow, dataset-centric, and automation-based) to data preparation, underscoring that automation-supported and interactive methods enhance handling of diverse... Read more
Key finding: This study advances a validated usability enhancement model specific to unstructured text data, incorporating subjective intention awareness and systematic usability dimension considerations. It identifies determinants and... Read more

2. What methodologies facilitate knowledge extraction and semantic understanding from unstructured textual data?

This research theme focuses on advanced techniques for extracting meaningful information, associations, and semantic structures from large volumes of unstructured text data. It addresses challenges including natural language processing, text mining, knowledge graph construction, entity recognition, and the use of ontologies to represent complex, heterogeneous data, which are pivotal for enabling effective analytics, prediction, and domain-specific insights from unstructured corpora.

Key finding: The paper develops association rule mining and prototypical document extraction methods for unstructured text collections, extending traditional data mining techniques. The formalization of keyword associations and document... Read more
Key finding: The work implements a knowledge graph framework that integrates named entity recognition and ontology-based semantic structuring for cyber incident data extracted from free-text online sources. By utilizing a machine learning... Read more
Key finding: This paper synthesizes methods linking unstructured text mining and behavioral science, particularly personality trait extraction from social media and other text sources. It shows that stable personality traits can be... Read more
Key finding: The paper proposes XDM, a semi-structured XML-based data model harmonizing heterogeneous mined patterns and raw data. XDM facilitates unified storage, query, and manipulation of various pattern types and data mining... Read more

3. How can unstructured data from social media and web sources be leveraged for domain-specific big data analytics?

This theme explores methods to collect, process, and analyze large-scale unstructured data originating from social media and web platforms, specifically targeting applications such as customer service analytics, disaster management, and business intelligence. Key concerns include data capture strategies, text mining challenges, integration with structured data, and extraction of actionable insights at scale for domain-specific decision support.

Key finding: The study demonstrates a framework combining tools to simultaneously retrieve and analyze voluminous Twitter-based customer service interactions from parcel shipping companies across multiple countries. It advances the field... Read more
Key finding: This paper presents a tailored model integrating tokenization, filtering, stemming, similarity measures, and named entity recognition to mine social media texts for natural disaster management. The system enables mapping of... Read more
Key finding: The study proposes a repeatable process for detecting, extracting, and integrating distributed athletics competition results from heterogeneous unstructured online PDF sources into a data warehouse. By complementing results... Read more

All papers in Unstructured Data Analytics

Unstructured data, which is the heart of big data, are generating in every moment due to the revolution of Internet. Relational data model is not the right tool to handle such unstructured data because it has limitation of scalability,... more
the law is a vast and complicated body of knowledge, and being able to access the right information quickly andaccurately can make the difference. Having access to information is essential to providing the best possible legal advice and... more
Unstructured data, which is the heart of big data, are generating in every moment due to the revolution of Internet. Relational data model is not the right tool to handle such unstructured data because it has limitation of scalability,... more
Trademarked names, logos, and images may appear in this book. Rather than use a trademark symbol with every occurrence of a trademarked name, logo, or image, we use the names, logos, and images only in an editorial fashion and to the... more
This paper examines the key lines of inquiry that have been used in research focused on the identity, dynamics, and diffusion of MIS, as well as the strengths and weaknesses associated with each approach. We present five primary means:... more
We also owe a great debt of gratitude to editors of several of the journals covered by this analysis for going to great lengths to grant us access to data. Table 7. Systems and Software Engineering (SSE) research Community Topic Areas... more
Unstructured data, which is the heart of big data, are generating in every moment due to the revolution of Internet. Relational data model is not the right tool to handle such unstructured data because it has limitation of scalability,... more
Unstructured data, which is the heart of big data, are generating in every moment due to the revolution of Internet. Relational data model is not the right tool to handle such unstructured data because it has limitation of scalability,... more
Mainstream knowledge management researchers generally agree that knowledge extracted from unstructured data and semi-structured data has become imperative for organizational strategic decision making. In this research, we develop a... more
Big Data has gained an enormous momentum the past few years because of the tremendous volume of generated and processed Data from diverse application domains. Nowadays, it is estimated that 80% of all the generated data is unstructured.... more
The Information Systems field is structured by the research topics emphasized by communities of journals. The Latent Categorization Method categorized and automatically named IS research topics in 14,510 abstracts from 65 Information... more
Download research papers for free!