Unstructured Data Analytics

description13 papers

group9 followers

lightbulbAbout this topic

Unstructured Data Analytics refers to the process of examining and interpreting non-organized data, such as text, images, and videos, to extract meaningful insights and patterns. This field employs various techniques, including natural language processing and machine learning, to analyze data that does not fit traditional structured formats.

lightbulbAbout this topic

Key research themes

1. How can data preprocessing techniques be optimized for effective unstructured data analytics?

This theme investigates preprocessing challenges inherent in unstructured data and explores techniques to enhance data quality and understanding prior to analysis. Preprocessing is crucial due to the issues of missing data, outliers, varied granularity, and incomplete records that unstructured datasets frequently present. Optimizing preprocessing strategies impacts the accuracy and reliability of downstream analytics and extraction processes.

Data preprocessing and intelligent data analysis

by Bharath Kumar

2021, Intelligent Data …

Key finding: This paper provides systematic treatment of preprocessing challenges with real-world datasets, highlighting techniques such as iterative data elimination and domain-expert feedback integration. It emphasizes that selecting... Read more

articleView Paper downloadDownload

Data Preparation: A Technological Perspective and Review

by Pavel Pankin

2023, SN Computer Science

Key finding: The review distinguishes varied approaches (programmatic, workflow, dataset-centric, and automation-based) to data preparation, underscoring that automation-supported and interactive methods enhance handling of diverse... Read more

articleView Paper downloadDownload

Usability enhancement model for unstructured text in big data

by Khor Wang

2024, Journal of Big Data

Key finding: This study advances a validated usability enhancement model specific to unstructured text data, incorporating subjective intention awareness and systematic usability dimension considerations. It identifies determinants and... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

2. What methodologies facilitate knowledge extraction and semantic understanding from unstructured textual data?

This research theme focuses on advanced techniques for extracting meaningful information, associations, and semantic structures from large volumes of unstructured text data. It addresses challenges including natural language processing, text mining, knowledge graph construction, entity recognition, and the use of ontologies to represent complex, heterogeneous data, which are pivotal for enabling effective analytics, prediction, and domain-specific insights from unstructured corpora.

Text mining-knowledge extraction from unstructured textual data

by Martin Rajman

2021, Proceedings of the 6th Conference of International …

Key finding: The paper develops association rule mining and prototypical document extraction methods for unstructured text collections, extending traditional data mining techniques. The formalization of keyword associations and document... Read more

articleView Paper downloadDownload

Knowledge mining of unstructured information: application to cyber domain

by Martti Lehto

2023, Scientific Reports

Key finding: The work implements a knowledge graph framework that integrates named entity recognition and ontology-based semantic structuring for cyber incident data extracted from free-text online sources. By utilizing a machine learning... Read more

articleView Paper downloadDownload

Identification and Prediction of Human Behavior through Mining of Unstructured Textual Data

by Edgar Gutiérrez

2023, Symmetry

Key finding: This paper synthesizes methods linking unstructured text mining and behavioral science, particularly personality trait extraction from social media and other text sources. It shows that stable personality traits can be... Read more

articleView Paper downloadDownload

Toward XML-based knowledge discovery systems

by Rosa Meo

2025, 2002 IEEE International Conference on Data Mining, 2002. Proceedings.

Key finding: The paper proposes XDM, a semi-structured XML-based data model harmonizing heterogeneous mined patterns and raw data. XDM facilitates unified storage, query, and manipulation of various pattern types and data mining... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

3. How can unstructured data from social media and web sources be leveraged for domain-specific big data analytics?

This theme explores methods to collect, process, and analyze large-scale unstructured data originating from social media and web platforms, specifically targeting applications such as customer service analytics, disaster management, and business intelligence. Key concerns include data capture strategies, text mining challenges, integration with structured data, and extraction of actionable insights at scale for domain-specific decision support.

Creation of unstructured big data from customer service

by Arda Gezdur

2021, The International Journal of Logistics Management

Key finding: The study demonstrates a framework combining tools to simultaneously retrieve and analyze voluminous Twitter-based customer service interactions from parcel shipping companies across multiple countries. It advances the field... Read more

articleView Paper downloadDownload

Mining Unstructured Data in Social Media for Natural Disaster Management in Indonesia

by Harco Leslie Hendric Spits Warnars

2025

Key finding: This paper presents a tailored model integrating tokenization, filtering, stemming, similarity measures, and named entity recognition to mine social media texts for natural disaster management. The system enables mapping of... Read more

articleView Paper downloadDownload

Extraction and Multidimensional Analysis of Data from Unstructured Data Sources: A Case Study

by Estrela Cruz

2019, 21st International Conference on Enterprise Information Systems

Key finding: The study proposes a repeatable process for detecting, extracting, and integrating distributed athletics competition results from heterogeneous unstructured online PDF sources into a data warehouse. By complementing results... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

All papers in Unstructured Data Analytics

A Comparative Study on Big Data Handling Using Relational and Non-Relational Data Model

by Jannatul Maowa

2025, International Journal of Data Mining & Knowledge Management Process

Unstructured data, which is the heart of big data, are generating in every moment due to the revolution of Internet. Relational data model is not the right tool to handle such unstructured data because it has limitation of scalability,... more

descriptionView Paper arrow_downwardDownload

Legal Text Mining

by IJRASET Publication

2023, International Journal for Research in Applied Science & Engineering Technology (IJRASET)

the law is a vast and complicated body of knowledge, and being able to access the right information quickly andaccurately can make the difference. Having access to information is essential to providing the best possible legal advice and... more

descriptionView Paper arrow_downwardDownload

A Comparative Study on Big Data Handling Using Relational and Non-Relational Data Model

by Sajedul Hoque

2023, International Journal of Data Mining & Knowledge Management Process

descriptionView Paper arrow_downwardDownload

Interview Questions in Business Analytics

by amysoe DREAM-education

2022

Trademarked names, logos, and images may appear in this book. Rather than use a trademark symbol with every occurrence of a trademarked name, logo, or image, we use the names, logos, and images only in an editorial fashion and to the... more

descriptionView Paper arrow_downwardDownload

The Identity, Dynamics, and Diffusion of MIS

by Tor Magnus T M L Larsen

2022, IFIP International Federation for Information Processing

This paper examines the key lines of inquiry that have been used in research focused on the identity, dynamics, and diffusion of MIS, as well as the strengths and weaknesses associated with each approach. We present five primary means:... more

descriptionView Paper arrow_downwardDownload

Analyzing unstructured text data: Using latent categorization to identify intellectual communities in information systems

by David Monarchi

2022, Decision Support Systems

We also owe a great debt of gratitude to editors of several of the journals covered by this analysis for going to great lengths to grant us access to data. Table 7. Systems and Software Engineering (SSE) research Community Topic Areas... more

descriptionView Paper arrow_downwardDownload

A Comparative Study on Big Data Handling Using Relational and Non-Relational Data Model

by Rashed Mustafa

2021, International Journal of Data Mining & Knowledge Management Process

descriptionView Paper arrow_downwardDownload

A Comparative Study on Big Data Handling Using Relational and Non-Relational Data Model

by JannaTul Maowa

2021

descriptionView Paper arrow_downwardDownload

A FRAMEWORK FOR CAPTURING AND ANALYZING UNSTRUCTURED AND SEMI- STRUCTURED DATA FOR A KNOWLEDGE MANAGEMENT SYSTEM

by Computer Science & Information Technology (CS & IT) Computer Science Conference Proceedings (CSCP)

2020

Mainstream knowledge management researchers generally agree that knowledge extracted from unstructured data and semi-structured data has become imperative for organizational strategic decision making. In this research, we develop a... more

descriptionView Paper arrow_downwardDownload

Big Data Quality Assessment Model for Unstructured Data

by Ikbal Taleb

2019, Conference: IIT 2018 : 13th International Conference on Innovations in Information TechnologyAt: Al Ain, United Arab Emirates

Big Data has gained an enormous momentum the past few years because of the tremendous volume of generated and processed Data from diverse application domains. Nowadays, it is estimated that 80% of all the generated data is unstructured.... more

Figure | describe the most important stages that the data goes through till the purpose that it was gathered and used for. From the data inception, collection, transport through inter-networks, saved into distributed storage around the world that offers the best quality price with a reliable network. Then pre-processed to filter only the best quality data and forwarded to processing and analytics for insight extraction.

‘ig. 2. Data Quality and Data Structure With a set of Metrics, it is possible now to evaluate quantitatively the quality when following a data driven strategy on existing data. For structured data, its quality assessment is apparent as data is available and attributes with thei corresponding values are accessible. However, for unstructurec data, needs a different approach when we don’t know how it is organized, and what are we are going to assess. The introductior of a module that extract, discover, or define attributes anc features with specific DQD mapping is mandatory to proceec with the quality exploration.

Fig. 4. Unstructured Big Data Quality Assessment Model To address the challenges of assessing quality of unstructured Big Data, we propose a quality assessment model that selects quality dimensions specific to each data type and evaluates its extracted features. Since unstructured data has no columnar values, we use a quantitative approach of data quality based on data contents. The model illustrated in Figure 4, has several components that the data goes through to achieve at the end a quality assessment report.

Table 1. The 5 V’s characteristics of Big Data

descriptionView Paper arrow_downwardDownload

Analyzing Unstructured Text Data: Using Latent Categorization to Identify Intellectual Communities in Information Systems

by David Monarchi and

2018, Decision Support Systems

The Information Systems field is structured by the research topics emphasized by communities of journals. The Latent Categorization Method categorized and automatically named IS research topics in 14,510 abstracts from 65 Information... more

descriptionView Paper arrow_downwardDownload

Unstructured Data Analytics

Key research themes

1. How can data preprocessing techniques be optimized for effective unstructured data analytics?

2. What methodologies facilitate knowledge extraction and semantic understanding from unstructured textual data?

3. How can unstructured data from social media and web sources be leveraged for domain-specific big data analytics?

Related Topics

All papers in Unstructured Data Analytics