Academia.eduAcademia.edu

Bayesian Classification

description475 papers
group2 followers
lightbulbAbout this topic
Bayesian Classification is a statistical method that applies Bayes' theorem to classify data points into categories based on prior knowledge and evidence. It calculates the probability of each class given the features of the data, allowing for the incorporation of uncertainty and prior distributions in the classification process.
lightbulbAbout this topic
Bayesian Classification is a statistical method that applies Bayes' theorem to classify data points into categories based on prior knowledge and evidence. It calculates the probability of each class given the features of the data, allowing for the incorporation of uncertainty and prior distributions in the classification process.

Key research themes

1. How can Bayesian classifiers be adapted to structured and complex data representations beyond naive Bayes assumptions?

This research area focuses on extending the classical naive Bayesian classifier, which assumes attribute independence given the class, to handle structured data where dependencies exist among features or when data are represented in richer logical or relational forms. Such adaptations seek to exploit relational information present in structured datasets such as molecular structures, text, or images, while maintaining computational feasibility and classification accuracy.

Key finding: Introduced two systems, 1BC and 1BC2, which extend naive Bayesian classifiers to structured data by either projecting individuals onto first-order features built using structural predicates or modeling probability... Read more
Key finding: Addressed computational complexity in learning Bayesian network structures beyond naive Bayes by proposing a clustering-based approach upstream of structure learning algorithms like K2, PC, and Greedy Search. This reduction... Read more
Key finding: Presented AutoClass, an implementation of Bayesian finite mixture modeling for unsupervised classification, which computes probabilistic class descriptions without assuming attribute independence. AutoClass simultaneously... Read more

2. What are effective Bayesian network learning methods for classification, including structure and parameter optimization under uncertainty?

This area explores the development and optimization of Bayesian networks as classifiers, emphasizing methods for learning both network structure and parameters from data, handling incomplete or missing values, and integrating prior knowledge with observed data. Efficient algorithms to learn network structures that represent variable dependencies allow improved probabilistic modeling and classification performance over naive Bayes.

Key finding: Provided comprehensive methods for constructing Bayesian networks for classification and data mining, discussing prior knowledge integration, parameter and structure learning (including with incomplete data), algorithms for... Read more
Key finding: Besides addressing attribute dependencies, the work detailed the challenges of computational complexity in learning Bayesian network structures for classifiers. It proposed an upstream clustering strategy to reduce the search... Read more
Key finding: Detailed the theoretical foundations and practical algorithms for Bayesian network learning, including structure and parameter estimation using Bayesian statistics, methods for exact inference (junction trees), quality... Read more

3. How do advanced Bayesian classification methods perform in real-world biomedical applications, and how can Bayesian optimization improve classifier tuning?

This theme investigates the application of Bayesian classifiers to biomedical domains such as disease diagnosis including Alzheimer's and Parkinson's diseases and microbiome-based classification. It includes adapting naive Bayes and Dirichlet-multinomial models to high-dimensional, noisy clinical data, leveraging Bayesian optimization for hyperparameter tuning, and comparing Bayesian approaches to other statistical and machine learning techniques to improve diagnostic accuracy and interpretability.

Key finding: Applied naive Bayesian classifiers on neuropsychological test data (10/66 battery) collected from 466 subjects to diagnose Alzheimer's disease, demonstrating the feasibility of naive Bayes for clinical diagnostic tasks... Read more
Key finding: Developed a hybrid Bayesian Optimization-Support Vector Machine method to optimize hyperparameters across several machine learning models (including Naive Bayes and Logistic Regression) applied to Parkinson's disease... Read more
Key finding: Proposed the Dirichlet-multinomial Bayes classifier (DMBC) to model microbial composition data for disease diagnosis, effectively handling overdispersion and high-dimensionality in microbiome counts. Demonstrated superior or... Read more

All papers in Bayesian Classification

Bayesian network based classifiers are only able to handle discrete variables. They assume that variables are sampled from a multinomial distribution and most real-world domains involves continuous variables. A common practice to deal... more
We present a system that is capable of segmenting, detecting and tracking multiple people in a cluttered scene using multiple synchronized cameras located far from each other. The system improves upon existing systems in many ways... more
In this paper, the use of Independent Component Discriminant Analysis (ICDA) for remote sensing classification is proposed. ICDA is a non-parametric method for discriminant analysis based on the application of a Bayesian classification... more
In this paper, the use of Independent Component (IC) Discriminant Analysis (ICDA) for remote sensing classification is proposed. ICDA is a nonparametric method for discriminant analysis based on the application of a Bayesian... more
In this paper, the use of Independent Component Discriminant Analysis (ICDA) for remote sensing classification is proposed. ICDA is a non-parametric method for discriminant analysis based on the application of a Bayesian classification... more
This paper proposes the use of computed tomography (CT) as a reference method for estimating the lean meat percentage (LMP) of pig carcasses. The current reference is manual dissection which has a limited accuracy due to variability... more
We developed a classification approach to multiple quantitative trait loci (QTL) mapping built upon a Bayesian framework that incorporates the important prior information that most genotypic markers are not cotransmitted with a QTL or... more
This paper introduces a evolutionary computation method that applies Bayesian classifiers to optimization problems. This approach is based on Estimation of Distribution Algorithms (EDAs) in which Bayesian or Gaussian networks are applied... more
Evolutionary computation is a discipline that has been emerging for at least 40 or 50 years. All methods within this discipline are characterized by maintaining a set of possible solutions (individuals) to make them successively evolve to... more
We present a rapid method for the identification of viruses using microfluidic chip gel electrophoresis (CGE) of highcopy number proteins to generate unique protein profiles. Viral proteins are solubilized by heating at 95°C in borate... more
Summary We analyse data from a study involving 173 pregnant women. The data are observed values of the β human chorionic gonadotropin hormone measured during the first 80 days of gestational age, including from one up to six longitudinal... more
Question: Is it possible to mathematically classify relevés into vegetation types on the basis of their average indicator values, including the uncertainty of the classification?Location: The Netherlands.Method: A large relevé database... more
A wide panoply of machine learning methods is available for application to the Predictive Toxicology Evaluation (PTE) problem. The authors have built four monolithic classification systems based on Tilde, Progol, C4.5 and naive bayesian... more
This paper proposes a supervised multiscale Bayesian texture classifier. The classifier exploits the dual-tree complex wavelet transform (DT-CWT) to obtain complex-valued multiscale representations of training texture samples for each... more
The medical automatic annotation task issued by the cross language evaluation forum (CLEF) aims at a fair comparison of state-of-the art algorithms for medical content-based image retrieval (CBIR). The contribution of this work is... more
Methods for predicting protein function from structure are becoming more important as the rate at which structures are solved increases more rapidly than experimental knowledge. As a result, protein structures now frequently lack... more
This paper addresses the use of multichannel signal processing methods in analysis of heart rate changes during cycling using the global positioning system (GPS) to record the route conditions. The main objectives of this work are in... more
We present a rapid method for the identification of viruses using microfluidic chip gel electrophoresis (CGE) of highcopy number proteins to generate unique protein profiles. Viral proteins are solubilized by heating at 95°C in borate... more
In this paper, a real time application for visual inspection and classification of cork stoppers is presented. Each cork stopper is represented by a high dimensional set of characteristics corresponding to relevant visual features. We... more
Summary Precise classification of tumours is critical for the diagnosis and treatment of cancer. Diagnostic pathology has traditionally relied on macroscopic and microscopic histology and tumour morphology as the basis for the... more
This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY
This paper deals with new methods capable of solving the optimization problem concerning the allocation of DNA samples in plates in order to carry out the DNA sequencing with the Sanger technique. These methods make it possible to work... more
The conventional seismic inversion approach is practical for operational work, as it only uses simple linearized algorithms and assumptions, but may be less applicable when dealing with a complex geological setting, especially in the... more
In order to obtain potentially interesting patterns and relations from large, distributed, heterogeneous databases, it is essential to employ an intelligent and automated KDD (Knowledge Discovery in Databases) process. One of the most... more
In order to obtain potentially interesting patterns and relations from large, distributed, heterogeneous databases, it is essential to employ an intelligent and automated KDD (Knowledge Discovery in Databases) process. One of the most... more
Properly organizing knowledge so that it can be managed often requires the acquisition of patterns and relations from large, distributed, heterogeneous databases. The employment of an intelligent and automated KDD (Knowledge Discovery in... more
Positional accuracy of geographic objects is treated in this contribution. Two main sources of uncertainty that influences the positional accuracy are considered: • uncertainty of classification, • uncertainty of georeferencing (image... more
Risk approaches …………………………………………………………… EU Water Framework Directive (WFD) approach ……………….. International conservation status classification (IUCN) approach ... Key environmental correlates ……………………………………………... Physical variables... more
In this paper, we present the theoretical foundation for optimal classification using class-specific features and provide examples of its use. A new probability density function (PDF) projection theorem makes it possible to project... more
In this paper, we present a novel method for classification of cancer cell line images using complex waveletbased region covariance matrix descriptors. Microscopic images containing irregular carcinoma cell patterns are represented by... more
Eastern offshore basin (Fig. 1) is one of the most promising deep water block with several discoveries. After extensive exploration, it is now under development phase. Reservoirs in this area are sands within Godavari clay of... more
This paper describes AutoClass H, a program for automatically discovering (inducing) classes from a database, based on a Bayesian statistical technique which automatically determines the most probable number of classes, their... more
We describe AutoClass, an approach to unsupervised classi cation based upon the classical mixture model, supplemented by a B a yesian method for determining the optimal classes. We include a moderately detailed exposition of the... more
The task of inferring a set of classes and class descriptions most likely to explain a given data set can be placed on a firm theoretical foundation using Bayesian statistics. Within this framework, and using various mathematical and... more
^33_> \43a4`1 0> b1 [31 93[c[> d[\@[e]0[\f33_>\4> \[f5]e0^331 > @fbg1 0h > i[> ]\^32d[\@ej]k2_ lmnopqr st u v-wx'xx)-(.w,) (y0z.-{.z|xx|-}) .z) 1~x'-/. ,|zx.-.y,~.wx)-~1./~wx'zz|-./-)-wx/1x/-}) .,.) z~-w~wx0-{/~-{/.01x'zz)-0},) 6-v~... more
Manet security has a lot of open issues. Due to its characteristics, this kind of network needs preventive and corrective protection. In this paper, we focus on corrective protection proposing an anomaly IDS model for Manet. The design... more
The widely used Bayesian classifier is based on the assumption of equal prior probabilities for all the classes. However, inclusion of equal prior probabilities may not guarantee high classification accuracy for the individual classes.... more
Question: Is it possible to mathematically classify relevés into vegetation types on the basis of their average indicator values, including the uncertainty of the classification?Location: The Netherlands.Method: A large relevé database... more
Many approaches to image restoration are aimed at removing either Gaussian or uniform impulsive noise. This is because both types of degradation processes are distinct in nature, and hence they are easier to manage when considered... more
This study analyzes the PMB data in 2016. The PMB process at XYZ University has several paths, namely Regular, Transfer, and Bidikmisi. When the PMB process difficulties are encountered one of them is the number of prospective students... more
Data Mining refers to using a variety of techniques to identify suggest of information or decision making knowledge in the database and extracting these in a way that they can put to use in areas such as decision support, predictions,... more
This final report summarizes the kind of processing support provided to four Kansas ERTS investigations. Details of the processing results may be found in the respective final reports of these investigations.
Feature representation is a very important factor that has great effect on the performance of speech recognition systems. In this paper we focus on a feature generation process that is based on linear transformation of the original... more
An effective mask estimation scheme for missing-feature reconstruction is described that achieves robust speech recognition in the presence of unknown noise. In previous work on Bayesian classification for mask estimation, white noise and... more
Download research papers for free!