AutoClass: A Bayesian Classification System

Peter Cheeseman

Outline

AutoClass: A Bayesian Classification System

Peter Cheeseman

1988, Elsevier eBooks

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

This paper describes AutoClass H, a program for automatically discovering (inducing) classes from a database, based on a Bayesian statistical technique which automatically determines the most probable number of classes, their probabilistic descriptions, and the probability that each object is a member of each class. AutoClass has been tested on several large, real databases and has discovered previously unsuspected classes. There is no doubt that these classes represent new phenomena.

jamal abu hassan

In this paper, we demonstrate how semantic categories of images can be learnt from their color distributions using an effective probabilistic approach. Many previous probabilistic approaches are based on the Naïve Bayes that assume independence among attributes, which are represented by a single Gaussian distribution. We use a derivative of the Naïve Bayesian classifier, called Flexible Bayesian classifier, which abandon the normality assumption to better represent the image data. This approach is shown to yield high accuracy results on classifying image databases as compared to it counterpart the "Naïve Bayesian classifier" and the widely used K-Nearest Neighbor classifier.

downloadDownload free PDF View PDFchevron_right

Bayesian classification with correlation and inheritance

Peter Cheeseman

Proceedings of the 12th International …, 1991

The task of inferring a set of classes and class descriptions most likely to explain a given data set can be placed on a firm theoretical foundation using Bayesian statistics. Within this framework, and using various mathematical and algorithmic approximations, the Au-toClass system searches for the most probable classifications, automatically choosing the number of classes and complexity of class descriptions. Simpler versions of AutoClass have been applied to many large real data sets, have discovered new independently-verified phenomena, and have been released as a robust software package. Recent extensions allow attributes to be selectively correlated within particular classes, and allow classes to inherit, or share, model parameters though a class hierarchy.

downloadDownload free PDF View PDFchevron_right

EnBay: A Novel Pattern-Based Bayesian Classifier

E. Baralis

IEEE Transactions on Knowledge and Data Engineering, 2000

A promising approach to Bayesian classification is based on exploiting frequent patterns, i.e., patterns that frequently occur in the training dataset, to estimate the Bayesian probability. Pattern-based Bayesian classification focuses on building and evaluating reliable probability approximations by exploiting a subset of frequent patterns tailored to a given test case. This paper proposes a novel and effective approach to estimate the Bayesian probability. Differently from previous approaches, the Entropy-based Bayesian classifier, namely EnBay, focuses on selecting the minimal set of long and not overlapped patterns that best complies with a conditional-independence model, based on an entropy-based evaluator. Furthermore, the probability approximation is separately tailored to each class. An extensive experimental evaluation, performed on both real and synthetic datasets, shows that EnBay is significantly more accurate than most state-ofthe-art classifiers, Bayesian and not.

downloadDownload free PDF View PDFchevron_right

Representing the behaviour of supervised classification learning algorithms by Bayesian networks

Basilio Sierra

Pattern Recognition Letters, 1999

In this paper, an approach to study the nature of the classi®cation models induced by Machine Learning algorithms is proposed. Instead of the predictive accuracy, the values of the predicted class labels are used to characterize the classi®cation models. Over these predicted class labels Bayesian networks are induced. Using these Bayesian networks, several assertions are extracted about the nature of the classi®cation models induced by Machine Learning algorithms. Ó (I. Inza) 0167-8655/99/$ -see front matter Ó 1999 Elsevier Science B.V. All rights reserved. PII: S 0 1 6 7 -8 6 5 5 ( 9 9 ) 0 0 0 9 5 -1

downloadDownload free PDF View PDFchevron_right

A Comparison of Induction Algorithms for Selective and non-Selective Bayesian Classifiers

Gregory Provan

1995

In this paper we present a novel induction algorithm for Bayesian networks. This selective Bayesian network classi er selects a subset of attributes that maximizes predictive accuracy prior to the network learning phase, thereby learning Bayesian networks with a bias for small, high-predictive-accuracy networks. We compare the performance of this classi er with selective and non-selective naive Bayesian classi ers. We show that the selective Bayesian network classier performs signi cantly better than both versions of the naive Bayesian classi er on almost all databases analyzed, and hence is an enhancement of the naive Bayesian classi er. Relative to the non-selective Bayesian network classi er, our selective Bayesian network classi er generates networks that are computationally simpler to evaluate and that display predictive accuracy comparable to that of Bayesian networks which model all features.

downloadDownload free PDF View PDFchevron_right

A NOVEL METHODOLOGY FOR CONSTRUCTING RULE-BASED NAÏVE BAYESIAN CLASSIFIERS

International Journal of Computer Science & Information Technology (IJCSIT)

Classification is an important data mining technique that is used by many applications. Several types of classifiers have been described in the research literature. Example classifiers are decision tree classifiers, rule-based classifiers, and neural networks classifiers. Another popular classification technique is naïve Bayesian classification. Naïve Bayesian classification is a probabilistic classification approach that uses Bayesian Theorem to predict the classes of unclassified records. A drawback of Naïve Bayesian Classification is that every time a new data record is to be classified, the entire dataset needs to be scanned in order to apply a set of equations that perform the classification. Scanning the dataset is normally a very costly step especially if the dataset is very large. To alleviate this problem, a new approach for using naïve Bayesian classification is introduced in this study. In this approach, a set of classification rules is constructed on top of naïve Bayesian classifier. Hence we call this approach Rule-based Naïve Bayesian Classifier (RNBC). In RNBC, the dataset is canned only once, off-line, at the time of building the classification rule set. Subsequent scanning of the dataset, is avoided. Furthermore, this study introduces a simple three-step methodology for constructing the classification rule set.

downloadDownload free PDF View PDFchevron_right

Bayesian classification for data from the same unknown class

Chun-Nan Hsu

Systems, Man, and Cybernetics, Part B: …, 2002

Abstract| In this paper, we address the problem of how to classify a set of query vectors that belong to the same unknown class. Sets of data known to be sampled from the same class are naturally available in many application domains, such as speaker recognition. We refer to these sets as homologous sets. We show how to take advantage of homologous sets in classi cation to obtain improved accuracy over classifying each query vector individually. Our method, called \homologous naive Bayes" (HNB), is based on the naive Bayes classi er, a simple algorithm shown to be e ective in many application domains. HNB uses a modi ed classi cation procedure that classi es multiple instances as a single unit. Compared with a voting method and several other variants of naive Bayes classi cation, HNB signi cantly outperforms these methods in a variety of test data sets, even when the number of query vectors in the homologous sets is small. We also report a successful application of HNB to speaker recognition. Experimental results show that HNB can achieve classi cation accuracy comparable to the Gaussian mixture model, the most widely used speaker recognition approach, while using less time for both training and classi cation.

downloadDownload free PDF View PDFchevron_right

Learning Terminological Naıve Bayesian Classifiers

Pasquale Minervini, Claudia d'Amato

2011

Knowledge available through Semantic Web standards can easily be missing, generally because of the adoption of the Open World Assumption (i.e. the truth value of an assertion is not necessarily known). However, the rich relational structure that characterizes ontologies can be exploited for handling such missing knowledge in an explicit way. We present a Statistical Relational Learning system designed for learning terminological naïve Bayesian classifiers, which estimate the probability that a generic individual belongs to the target concept given its membership to a set of Description Logic concepts. During the learning process, we consistently handle the lack of knowledge that may be introduced by the adoption of the Open World Assumption, depending on the varying nature of the missing knowledge itself.

downloadDownload free PDF View PDFchevron_right

An Empirical Framework for Automatically Selecting the Best Bayesian Classifler

Kecheng Liu

2009

Data miners have access to a signiflcant number of classiflers and use them on a variety of dif- ferent types of dataset. This large selection makes it di-cult to know which classifler will perform most efiectively in any given case. Usually an understand- ing of learning algorithms is combined with detailed domain knowledge of the dataset at hand to lead to the choice of a classifler. We propose an empirical framework that quantitatively assesses the accuracy of a selection of classiflers on difierent datasets, re- sulting in a set of classiflcation rules generated by the J48 decision tree algorithm. Data miners can follow these rules to select the most efiective classifler for their work. By optimising the parameters used for learning and the sampling techniques applied, a set of rules were learned that select with 78% accuracy, the most efiective classifler.

downloadDownload free PDF View PDFchevron_right

Augmented Naïve Bayesian Model of Classification Learning

Douglas Fisher

The Naïve Bayesian Classifier and an Augmented Naïve Bayesian Classifier are applied to human classification tasks. The Naïve Bayesian Classifier is augmented with feature construction using a Galois lattice. The best features, measured on their within-and between-category overlap, are added to the category's concept description. The results show that space efficient concept descriptions can predict much of the variance in the classification phenomena.

downloadDownload free PDF View PDFchevron_right

Loading Preview

Sorry, preview is currently unavailable. You can download the paper by clicking the button above.

References (3)

A. P. Dempster, N. M. Laird, and D. B. Rubin. Maximum likelihood from incom- plete data via the EM algorithm. Journal of the Royal Statistical Society, Series B, 39(1):1-38, 1977.
W. Dillon and M. Goldstein. Multivariate Analysis: Methods and Applications, chapter
Richard C. Dubes. How many clusters are best? --an experiment. Pattern Recognition, 20(6):645-663, 1987.

Related papers

Bayesian classification

Peter Cheeseman

… National Conference on …, 1988

This paper describes a Bayesian technique for unsupervised classification of data and its computer implementation, AutoClass. Given real valued or discrete data, AutoClass determines the most probable number of classes present in the data, the most probable descriptions of those ...

downloadDownload free PDF View PDFchevron_right

Autoclass: An automatic classification system

Peter Cheeseman

1991

The task of inferring a set of classes and class descriptions most likely to explain a given data set can be placed on a firm theoretical foundation using Bayesian statistics. Within this framework, and using various mathematical and algorithmic approximations, the Au-toClass system searches for the most probable classifications, automatically choosing the number of classes and complexity of class descriptions. A simpler version of AutoClass has been applied to many large real data sets, have discovered new independently-verified phenomena, and have been released as a robust software package. Recent extensions allow attributes to be selectively correlated within particular classes, and allow classes to inherit, or share, model parameters though a class hierarchy. In this paper we summarise the mathematical foundations of Autoclass.

downloadDownload free PDF View PDFchevron_right

Bayesian classification (AutoClass): theory and results

Peter Cheeseman

Knowledge Discovery and Data Mining, 1996

We describe AutoClass, an approach to unsupervised classi cation based upon the classical mixture model, supplemented by a B a yesian method for determining the optimal classes. We include a moderately detailed exposition of the mathematics behind the AutoClass system. We emphasize that no current unsupervised classi cation system can produce maximally useful results when operated alone. It is the interaction between domain experts and the machine searching over the model space, that generates new knowledge. Both bring unique information and abilities to the database analysis task, and each enhances the others' e ectiveness. We illustrate this point with several applications of AutoClass to complex real world databases, and describe the resulting successes and failures.

downloadDownload free PDF View PDFchevron_right

Bayesian classification theory

J. Stutz

1991

The task of inferring a set of classes and class descriptions most likely to explain a given data set can be placed on a firm theoretical foundation using Bayesian statistics. Within this framework, and using various mathematical and algorithmic approximations, the Au-toClass system searches for the most probable classifications, automatically choosing the number of classes and complexity of class descriptions. A simpler version of AutoClass has been applied to many large real data sets, have discovered new independently-verified phenomena, and have been released as a robust software package. Recent extensions allow attributes to be selectively correlated within particular classes, and allow classes to inherit, or share, model parameters though a class hierarchy. In this paper we summarize the mathematical foundations of Autoclass.

downloadDownload free PDF View PDFchevron_right

Statistical Models in Data Mining: A Bayesian Classification

annapurna gummadi

International Journal of Recent Trends in Engineering and Research

The concept of conditional probability is introduced in Elementary Statistics. The conditional probability of an event is a probability obtained with the additional Information that some other event has already occurred. In this section we extend the discussion of conditional probability to include applications of Bayes' theorem, which we use for revising a probability value based on additional information that is later obtained. One key to understanding the essence of Bayes' theorem is to recognize that we are dealing with sequential events, whereby new additional information is obtained for a subsequent event, and that new information is used to revise the probability of the initial event. The paper presets how bayes theorem used in data mining classification and prediction of tuple of class labes. They can predict class membership probabilities, such as the probability that a given tuple belongs to a particular class. Bayesian classification is based on Bayes' theorem, described below. Studies comparing classification algorithms have found a simple Bayesian classifier known as the naïve Bayesian classifier to be comparable in performance with decision tree and selected neural network classifiers. Bayesian classifiers have also exhibited high accuracy and speed when applied to large databases. In this context, the terms prior probability and posterior probability are commonly used.

downloadDownload free PDF View PDFchevron_right

Comparison of Bayesian and Neural Net Unsupervised Classification Techniques

M. Afzal Upal

1994

Unsupervised classi cation is the classi cation of data into a number of classes in such a way that data in each class are all similar to each other. In the past there have been few if any studies done to compare the performance of di erent unsupervised classi cation techniques. In this paper we review Bayesian and neural net approaches to unsupervised classi cation and present results of experiments that we did to compare Autoclass, a Bayesian classi cation system, and ART2, a neural net classi cation algorithm.

downloadDownload free PDF View PDFchevron_right

Hybrid Bayesian Classifier for Improved Classification Accuracy

UTTAM KUMAR

IEEE Geoscience and Remote Sensing Letters, 2011

The widely used Bayesian classifier is based on the assumption of equal prior probabilities for all the classes. However, inclusion of equal prior probabilities may not guarantee high classification accuracy for the individual classes. Here, we propose a novel technique-Hybrid Bayesian Classifier (HBC)-where the class prior probabilities are determined by unmixing a supplemental low spatial-high spectral resolution multispectral (MS) data that are assigned to every pixel in a high spatial-low spectral resolution MS data in Bayesian classification. This is demonstrated with two separate experiments-first, class abundances are estimated per pixel by unmixing Moderate Resolution Imaging Spectroradiometer data to be used as prior probabilities, while posterior probabilities are determined from the training data obtained from ground. These have been used for classifying the Indian Remote Sensing Satellite LISS-III MS data through Bayesian classifier. In the second experiment, abundances obtained by unmixing Landsat Enhanced Thematic Mapper Plus are used as priors, and posterior probabilities are determined from the ground data to classify IKONOS MS images through Bayesian classifier. The results indicated that HBC systematically exploited the information from two image sources, improving the overall accuracy of LISS-III MS classification by 6% and IKONOS MS classification by 9%. Inclusion of prior probabilities increased the average producer's and user's accuracies by 5.5% and 6.5% in case of LISS-III MS with six classes and 12.5% and 5.4% in IKONOS MS for five classes considered. Index Terms-Bayesian classifier, prior probability, unmixing. I. INTRODUCTION I MAGE classification using conventional Bayesian classifier is a supervised method based on prior probabilities [1]. Prior probability resolves confusions among classes that are not well separable [2] and is therefore effective in improving Manuscript

downloadDownload free PDF View PDFchevron_right

Introducing Bayesian Networks

Ann Nicholson

2010

Recent work in supervised learning has shown that a surprisingly simple Bayesian classifier with strong assumptions of independence among features, called naive Bayes, is competitive with state-of-the-art classifiers such as C4.5. This fact raises the question of whether a classifier with less restrictive assumptions can perform even better. In this paper we evaluate approaches for inducing classifiers from data, based on the theory of learning Bayesian networks. These networks are factored representations of probability distributions that generalize the naive Bayesian classifier and explicitly represent statements about independence. Among these approaches we single out a method we call Tree Augmented Naive Bayes (TAN), which outperforms naive Bayes, yet at the same time maintains the computational simplicity (no search involved) and robustness that characterize naive Bayes. We experimentally tested these approaches, using problems from the University of California at Irvine repository, and compared them to C4.5, naive Bayes, and wrapper methods for feature selection.

downloadDownload free PDF View PDFchevron_right

Confidence in Classification: A Bayesian Approach

Trevor Bailey, vitaly schetinin

Journal of Classification, 2006

Bayesian classification is currently of considerable interest. It provides a strategy for eliminating the uncertainty associated with a particular choice of classifiermodel parameters, and is the optimal decision-theoretic choice under certain circumstances when there is no single “true” classifier for a given data set. Modern computing capabilities can easily support the Markov chain Monte Carlo sampling that is necessary to carry out the calculations involved, but the information available in these samples is not at present being fully utilised. We show how it can be allied to known results concerning the “reject option” in order to produce an assessment of the confidence that can be ascribed to particular classifications, and how these confidence measures can be used to compare the performances of classifiers. Incorporating these confidence measures can alter the apparent ranking of classifiers as given by straightforward success or error rates. Several possible methods for obtaining confidence assessments are described, and compared on a range of data sets using the Bayesian probabilistic nearest-neighbour classifier.

downloadDownload free PDF View PDFchevron_right

AutoClass: A Bayesian Classification System

Sign up for access to the world's latest research

Abstract

Related papers

References (3)

Related papers

Related topics