Academia.eduAcademia.edu

Em Clustering

description8 papers
group1 follower
lightbulbAbout this topic
EM Clustering, or Expectation-Maximization Clustering, is a statistical technique used for finding clusters in data by iteratively estimating the parameters of a probabilistic model. It alternates between assigning data points to clusters based on current parameter estimates (Expectation step) and updating the parameters to maximize the likelihood of the data (Maximization step).
lightbulbAbout this topic
EM Clustering, or Expectation-Maximization Clustering, is a statistical technique used for finding clusters in data by iteratively estimating the parameters of a probabilistic model. It alternates between assigning data points to clusters based on current parameter estimates (Expectation step) and updating the parameters to maximize the likelihood of the data (Maximization step).
Nowadays, financial data analysis is becoming increasingly important in the business market. As companies collect more and more data from daily operations, they expect to extract useful knowledge from existing collected data to help make... more
A performance evaluation of four different clustering techniques was carried out based on segmenting consumer by product type and by product usage in the research. Cobweb, DBSCAN, EM and k-means algorithms were evaluated based on the... more
Classification of KDD intrusion dataset is proposed along with noise reduction, clustering and feature selection. DBSCAN algorithm has been applied to reduce noise present in KDD dataset. After noise removal genetic search approach is... more
Clustering is a data mining problem of dividing documents into groups, such that documents in one group are more similar than those in other groups. The aim of this study is to propose a framework for comparing the accuracy of clustering... more
Clustering is a data mining problem of dividing documents into groups, such that documents in one group are more similar than those in other groups. The aim of this study is to propose a framework for comparing the accuracy of clustering... more
An automatic method to combine several local surrogate models is presented. This method is intended to build accurate and smooth approximation of discontinuous functions that are to be used in structural optimization problems. It strongly... more
Semi-supervised data classification is one of significant field of study in machine learning and data mining since it deals with datasets which consists both a few labeled and many unlabeled data. The researchers have interest in this... more
Detection of abnormal events in a traffic scene is a highly challenging task due to vast field of view, continuous stream of video data, various object interactions and complex events in Video Surveillance. Hence, this research proposes... more
Anomaly detection concerns identifying anomalous observations or patterns that are a deviation from the dataset's expected behaviour. The detection of anomalies has significant and practical applications in several industrial domains... more
Human movement anomalies in indoor spaces commonly involve urgent situations, such as security threats, accidents, and fires. This paper proposes a two-phase framework for detecting indoor human trajectory anomalies based on density-based... more
An automatic method to combine several local surrogate models is presented. This method is intended to build accurate and smooth approximation of discontinuous functions that are to be used in structural optimization problems. It strongly... more
Clustering is a data mining problem of dividing documents into groups, such that documents in one group are more similar than those in other groups. The aim of this study is to propose a framework for comparing the accuracy of clustering... more
One way for enterprises to be successful in today's challenging market is to be agile and be flexible to handle market changes. Using a conceptual and operational framework for improving the enterprise and keeping their desired situation,... more
One way for enterprises to be successful in today's challenging market is to be agile and be flexible to handle market changes. Using a conceptual and operational framework for improving the enterprise and keeping their desired situation,... more
Data mining is a method to mine valuable hidden knowledge, patterns and associations from massive and sparse datasets. This process proceeds through various techniques e.g. classification, clustering and association etc. Clustering is an... more
The term internet of things is a buzz word these days and as per Google survey conducted recently, it has even dominated the buzz word big data predominantly. However, IoT area is still not matured and is throwing light on lot of research... more
The term internet of things is a buzz word these days and as per Google survey conducted recently, it has even dominated the buzz word big data predominantly. However, IoT area is still not matured and is throwing light on lot of research... more
Reliable network traffic classification is essential to management and security tasks. Therefore, it is beneficial to analyze and improve existing techniques. Some of the most traditional methodologies for traffic classification are based... more
Semi-supervised data classification is one of significant field of study in machine learning and data mining since it deals with datasets which consists both a few labeled and many unlabeled data. The researchers have interest in this... more
Clustering is a data mining problem of dividing documents into groups, such that documents in one group are more similar than those in other groups. The aim of this study is to propose a framework for comparing the accuracy of clustering... more
Anomaly detection concerns identifying anomalous observations or patterns that are a deviation from the dataset's expected behaviour. The detection of anomalies has significant and practical applications in several industrial domains... more
Precision agriculture is a new approach to farming in which environmental characteristics at a sub-field level are used to guide crop production decisions. Instead of applying management actions and production inputs uniformly across... more
The term internet of things is a buzz word these days and as per Google survey conducted recently, it has even dominated the buzz word big data predominantly. However, IoT area is still not matured and is throwing light on lot of research... more
Traffic modeling and classification find importance in many areas such as bandwidth management, traffic analysis, prediction and engineering, network planning, Quality of Service provisioning and anomalous traffic detection. Much of... more
Anomaly detection is a concept widely applied to numerous domains. Several techniques of anomaly detection have been developed over the years, in practice as well as research. The application of this concept has extended to diverse areas,... more
The term Data Mining is used to refer the process of analyzing large datasets and then extracting the knowledge from the data. In today's world data mining has become very essential in almost every area such as market segmentation,... more
The task of network administrators to identify and determine the type of traffic traversing through the network is very critical to the rapid growth of new traffic each day. As the requirements of networks change over time, the situation... more
The last decade has seen an explosive growth in the generation and collection of data. In the field of data mining there are various techniques are used to extract useful information from the data set. There is various estimation... more
Abstract: Accurate identification and classification of network traffic according to the application that generated them is at the basis of any modern network management platform. Nowadays, many P2P applications using dynamic port... more
Traffic modeling and classification find importance in many areas such as bandwidth management, traffic analysis, prediction and engineering, network planning, Quality of Service provisioning and anomalous traffic detection. Much of... more
This paper proposes a real-time and self-taught anomaly detection scheme for optical networks using hybrid unsupervised/supervised learning. Evaluations with an experimental dataset demonstrate that the proposed scheme can successfully... more
Emergence of modern techniques for scientific data collection has resulted in large scale accumulation of data pertaining to diverse fields. Generally, data mining (sometimes called data or knowledge discovery) is the process of analyzing... more
Network Traffic Classification is an important process in various network management activities like network planning, designing, workload characterization etc. Network traffic classification using traditional techniques such as well... more
Text classification method that uses efficient similarity measures to achieve better performance is being proposed in this paper. Semi-supervised clustering is used as a complementary step to text classification and is used to identify... more
Mobile telecommunication sector has been accelerated with GSM 1800 licenses in the Turkey. Since then, churn management has won vital importance for the GSM operators. Customers should have segmented according to their profitability for... more
Today maritime transportation represents 90% of international trade volume and there are more than 50,000 vessels sailing the ocean every day. Therefore, reducing maritime transportation security risks by systematically modelling and... more
The main aim of this paper focuses on the K-means and the DBSCAN clustering techniques to identify customer behaviors and group the customers in order to improve the customer service and to remove the outliers that are existing in the... more
The fundamental objective behind any network intrusion detection system is to automate the detection process whenever intrusions occur in the network. The problem of the network anomaly detection is to determine, if the network incoming... more
Information overload has raggedly increased as a result of the advances in the aspect of storage capabilities and data collection in previous years. The growth seen in the number of observation has partly cause a collapse in analytical... more
Mobile telecommunication sector has been accelerated with GSM 1800 licenses in the Turkey. Since then, churn management has won vital importance for the GSM operators. Customers should have segmented according to their profitability for... more
Mobile telecommunication sector has been accelerated with GSM 1800 licenses in the Turkey. Since then, churn management has won vital importance for the GSM operators. Customers should have segmented according to their profitability for... more
Because of the increased popularity and fast expansion of the Internet as well as Internet of things, networks are growing rapidly in every corner of the society. As a result, huge amount of data is travelling across the computer networks... more
To detect attacks from IOT[1] and big data application using data mining techniques. Now-a-days internet can be access from anywhere using small devices such as smart phones, sensors [4] and other wearable devices etc. Always these... more
With the advancement of technology and communication system, use of internet is giving at a tremendous role. This causes an exponential growth of data and traffic over the internet. So to correctly classify this traffic is a hot research... more
Data mining is the method of extracting the data from large database. Various data mining techniques are clustering, classification, association analysis, regression, summarization, time series analysis and sequence analysis, etc.... more
Clustering is a data mining problem of dividing documents into groups, such that documents in one group are more similar than those in other groups. The aim of this study is to propose a framework for comparing the accuracy of clustering... more
In recent years, customer segmentation has become one of the most significant and useful tools for e-commerce. It plays a vital role in online product recommendation system and also helps to understand local and global wholesale or retail... more
In recent years, customer segmentation has become one of the most significant and useful tools for e-commerce. It plays a vital role in online product recommendation system and also helps to understand local and global wholesale or retail... more
Download research papers for free!