Academia.eduAcademia.edu

Evolutionary Clustering

description15 papers
group10 followers
lightbulbAbout this topic
Evolutionary clustering is a data analysis technique that combines principles of evolutionary biology with clustering algorithms to identify and group similar data points. It emphasizes the dynamic nature of data, allowing for the adaptation of clusters over time as new data emerges, thereby reflecting changes in the underlying structure of the dataset.
lightbulbAbout this topic
Evolutionary clustering is a data analysis technique that combines principles of evolutionary biology with clustering algorithms to identify and group similar data points. It emphasizes the dynamic nature of data, allowing for the adaptation of clusters over time as new data emerges, thereby reflecting changes in the underlying structure of the dataset.

Key research themes

1. How do evolutionary algorithms optimize clustering with automatic or variable cluster number estimation?

This research area focuses on developing evolutionary algorithm (EA) methods that can automatically determine or adapt the number of clusters k during the clustering process, addressing limitations of classical K-means and other fixed-k algorithms. It tackles challenges like unknown cluster count, stream data dynamics, and the NP-hardness of clustering, employing multi-objective and metaheuristic strategies to balance cluster compactness and separation. This theme is vital because real-world data often lack prior knowledge of cluster numbers, and adaptive clustering enhances clustering robustness, quality, and practical applicability.

Key finding: Proposes FEAC-Stream, a fast evolutionary clustering algorithm that uses the Page-Hinkley test to detect quality degradation in data stream partitions and triggers re-estimation of the number of clusters k online. This method... Read more
Key finding: Introduces a multi-objective automatic clustering (MOAC) evolutionary framework that generates clusterings with different k values simultaneously, guided by internal cluster validity indices (CVIs) such as Silhouette,... Read more
Key finding: Presents AEC-RAM, an auto-evolving evolutionary clustering algorithm that starts with all data points in a single cluster and iteratively splits clusters by migrating 'flabby' items (outliers with low fitness) into new... Read more
Key finding: Provides a comprehensive survey highlighting approaches that handle fixed or variable cluster numbers within evolutionary clustering, including context-sensitive and guided operators for automatic cluster number adaptation.... Read more

2. What are effective decision-making strategies for selecting final solutions in evolutionary multi-objective clustering?

This theme addresses the crucial challenge of choosing a single best clustering solution from the set of nondominated trade-off solutions yielded by multi-objective evolutionary clustering (EMC) algorithms. Since EMC produces multiple Pareto-optimal partitions balancing conflicting criteria (e.g., cohesion vs separation), proper decision-making methods must evaluate and rank these solutions based on problem-specific context and quality. Research in this area explores machine learning-based decision-making, consensus methods, and geometric approaches to improve robustness and generalization, enabling practical application of EMC.

Key finding: Proposes framing the EMC decision-making stage as a supervised learning problem, building predictive models that estimate the quality of candidate solutions based on their characteristics and context within the Pareto front.... Read more
Key finding: Introduces the Clustering Pareto Evolutionary Algorithm (CPEA) that maintains multiple local Pareto-optimal fronts simultaneously by clustering solutions in variable space, thus preserving diverse local optima in... Read more

3. How do hybrid and nature-inspired evolutionary algorithms enhance clustering quality and overcome classical clustering limitations?

This research area investigates combining evolutionary algorithms (EAs), such as genetic algorithms (GAs), swarm intelligence, and black hole algorithms, with classical clustering methods like K-means to boost clustering optimization. Hybrid approaches address common issues of classical techniques (e.g., local optima, initial centroid sensitivity, fixed k) by leveraging heuristic global search and bio-inspired operators to improve convergence, solution quality, and applicability to complex or large-scale data, including image segmentation and high-dimensional data. This theme bridges optimization theory with practical applications in unsupervised learning.

Key finding: Develops a hybrid algorithm integrating GAs with K-means to overcome K-means' sensitivity to initial centroids and empty clusters. Experimental results demonstrate that the hybrid effectively avoids local optima by exploring... Read more
Key finding: Proposes a multi-population version of the Black Hole Algorithm (MBHA) that mitigates original BHA's exploration weakness by maintaining and evolving multiple best candidate solutions. MBHA achieves superior convergence and... Read more
Key finding: Evaluates multiple evolutionary algorithms (ABC, GSA, CS, JADE, DSA, BSA) alongside classical clustering methods (K-means, FCM, SOM) on image clustering tasks. Results highlight that evolutionary algorithms generate more... Read more
Key finding: Proposes Multi-K-means (MK-means), a parallel clustering method combining multi-agent systems with K-means to improve accuracy and manage large data volumes efficiently. Agents observe cluster boundaries and coordinate... Read more

All papers in Evolutionary Clustering

The proliferation of smart devices in the Internet of Things (IoT) networks creates significant security challenges for the communications between such devices. Blockchain is a decentralized and distributed technology that can potentially... more
In dealing with big data, we need effective algorithms; effectiveness that depends, among others, on the ability to remove outliers from the data set, especially when dealing with classification problems. To this aim, support vector... more
The proliferation of smart devices in the Internet of Things (IoT) networks creates significant security challenges for the communications between such devices. Blockchain is a decentralized and distributed technology that can potentially... more
In dealing with big data, we need effective algorithms; effectiveness that depends, among others, on the ability to remove outliers from the data set, especially when dealing with classification problems. To this aim, support vector... more
In dealing with big data, we need effective algorithms; effectiveness that depends, among others, on the ability to remove outliers from the data set, especially when dealing with classification problems. To this aim, support vector... more
In this paper we attempt to solve an automatic clustering problem by optimizing multiple objectives such as automatic k-determination and a set of cluster validity indices concurrently. The proposed automatic clustering technique uses the... more
Identifying strongly associated clusters in large complex networks has received an increased amount of interest since the past decade. The problem of community detection in complex networks is an NP complete problem that necessitates the... more
Community detection in social networks has become a dominating topic of the current data research as it has a direct implication in many different areas of importance, whether social network, citation network, traffic network, metabolic... more
Community detection in social networks has become a dominating topic of the current data research as it has a direct implication in many different areas of importance, whether social network, citation network, traffic network, metabolic... more
Community detection in social networks has become a dominating topic of the current data research as it has a direct implication in many different areas of importance, whether social network, citation network, traffic network, metabolic... more
Community detection in social networks has become a dominating topic of the current data research as it has a direct implication in many different areas of importance, whether social network, citation network, traffic network, metabolic... more
In recent years, Evolutionary clustering is an evolving research area in data mining. The evolution diagnosis of any homogeneous as well as heterogeneous network will provide an overall view about the network. Applications of evolutionary... more
In this paper we attempt to solve an automatic clustering problem by optimizing multiple objectives such as automatic k-determination and a set of cluster validity indices concurrently. The proposed automatic clustering technique uses the... more
Discovering communities in time-varying social networks is one of the highly challenging area of research and researchers are welcome to propose new models for this domain. The issue is more problematic when overlapping structure of... more
A multi-mode network typically consists of multiple heterogeneous social actors among which various types of interactions could occur. Identifying communities in a multi-mode network can help understand the structural properties of the... more
In recent years, Evolutionary clustering is an evolving research area in data mining. The evolution diagnosis of any homogeneous as well as heterogeneous network will provide an overall view about the network. Applications of evolutionary... more
Abstract A multi-mode network typically consists of multiple heterogeneous social actors among which various types of interactions could occur. Identifying communities in a multi-mode network can help understand the structural properties... more
Download research papers for free!