Academia.eduAcademia.edu

Outline

AutoClass: A Bayesian Classification System

1988, Elsevier eBooks

Abstract

This paper describes AutoClass H, a program for automatically discovering (inducing) classes from a database, based on a Bayesian statistical technique which automatically determines the most probable number of classes, their probabilistic descriptions, and the probability that each object is a member of each class. AutoClass has been tested on several large, real databases and has discovered previously unsuspected classes. There is no doubt that these classes represent new phenomena.

References (3)

  1. A. P. Dempster, N. M. Laird, and D. B. Rubin. Maximum likelihood from incom- plete data via the EM algorithm. Journal of the Royal Statistical Society, Series B, 39(1):1-38, 1977.
  2. W. Dillon and M. Goldstein. Multivariate Analysis: Methods and Applications, chapter
  3. Richard C. Dubes. How many clusters are best? --an experiment. Pattern Recognition, 20(6):645-663, 1987.