[IJCST-V10I1P5]:C. Sunitha Ram, Swetha Gayathri Kuchimanchi

IJCST Eighth Sense Research Group

Outline

Title

[IJCST-V10I1P5]:C. Sunitha Ram, Swetha Gayathri Kuchimanchi

IJCST Eighth Sense Research Group

visibility

…

description

8 pages

link

1 file

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

Computational intelligence poses Several possibilities in Bioinformatics, particularly by generating low-cost, lowprecision, good solutions. Rough sets promise to open up an important dimension in this direction. The present article surveys the role of artificial neural networks, fuzzy sets and genetic algorithms, with particular emphasis on rough sets, in Bioinformatics. Since the work entails processing huge amounts of incomplete or ambiguous biological data, the knowledge reduction capability of rough sets, Learning ability of neural networks, uncertainty handling capacity of fuzzy sets and searching potential of genetic algorithms are synergistically utilized.

IAEME Publication

IAEME PUBLICATION, 2013

Microarray gene expression datasets are more preferred form of disease diagnostic system than morphology. But some of the real world gene expression dataset consists of numerous missing values due to insufficient resolution, image corruption, dust or scratches on the slides or experimental error during the laboratory process. To apply any computational method for analyzing such missing valued data results in loss of biological meaning. To overcome this difficulty, this paper proposes a method “local least square imputation” for identification of missing values. This method uses local similarity structures and performs least square optimization for finding out the missing values. Further this paper uses multiclass gene expression data as they are rarely addressed in the literature. Fuzzy Rough set based f-information measure is used for gene selection and SVM is used for sample classification. From the simulation study, it is found that the proposed approach enhances the classification accuracy without the loss of its inherent biological meaning. Statistical analysis of the test result shows that the proposed method outperforms other approaches reported.

downloadDownload free PDF View PDFchevron_right

Rough Set Approach for Generation of Classification Rules of Breast Cancer Data

aboul ella hassanien

Informatica (lithuanian Academy of Sciences), 2004

Extensive amounts of knowledge and data stored in medical databases require the development of specialized tools for storing, accessing, analysis, and effectiveness usage of stored knowledge and data. Intelligent methods such as neural networks, fuzzy sets, decision trees, and expert systems are, slowly but steadily, applied in the medical fields. Recently, rough set theory is a new intelligent technique was used for the discovery of data dependencies, data reduction, approximate set classification, and rule induction from databases.

downloadDownload free PDF View PDFchevron_right

Rough Set Protein Classifier

Ramadevi Yellasiri

2005

Classification of voluminous protein data based on the structural and functional properties is a challenging task for researchers in bioinformatics field. In this paper a faster, accurate and efficient classification tool Rough Set Protein Classifier has been developed which has a classification accuracy of 97.7%. It is a hybridized tool comprising Sequence Arithmetic, Rough Set Theory and Concept Lattice. It reduces the domain search space to 9% without losing the potentiality of classification of proteins.

downloadDownload free PDF View PDFchevron_right

Applications of Rough Sets in Health Sciences and Disease Diagnosis

Zain Abbas

2015

Soft computing is a consortium of techniques that work together to setup flexible information processing capability for handling real-life ambiguous situations. It aims at solving problems involving uncertainty and imprecision mimicking the human like decision making. Fuzzy set theory is an approach that has been widely adopted in such situations. Rough Set Theory (RST) is another soft computing approach that uses sets to represent vague or incomplete knowledge and provide a framework for approximation of concepts. It has been widely used to deal with imprecision in health sciences such as in patient diagnosis and disease classification. In this paper we present a review of rough set theory and its applications in disease diagnosis with several examples using real data sets. Key-Words: Rough Set Theory, Soft Computing, Vague data, Imprecision, Health Sciences, Disease diagnosis

downloadDownload free PDF View PDFchevron_right

A Comparative Analysis of Rough Set Based Intelligent Techniques for Unsupervised Gene Selection

Nizar Banu P K

International Journal of System Dynamics Applications, 2013

As the micro array databases increases in dimension and results in complexity, identifying the most informative genes is a challenging task. Such difficulty is often related to the huge number of genes with very few samples. Research in medical data mining addresses this problem by applying techniques from data mining and machine learning to the micro array datasets. In this paper Unsupervised Tolerance Rough Set based Quick Reduct (U-TRS-QR), a diverse feature selection algorithm, which extends the existing equivalent rough sets for unsupervised learning, is proposed. Genes selected by the proposed method leads to a considerably improved class predictions in wide experiments on two gene expression datasets: Brain Tumor and Colon Cancer. The results indicate consistent improvement among 12 classifiers.

downloadDownload free PDF View PDFchevron_right

Rough Sets in Medical Informatics Applications

aboul ella hassanien

2009

Rough sets offer an effective approach of managing uncertainties and can be employed for tasks such as data dependency analysis, feature identification, dimensionality reduction, and pattern classification. As these tasks are common in many medical applications it is only natural that rough sets, despite their relative 'youth'compared to other techniques, provide a suitable method in such applications.

downloadDownload free PDF View PDFchevron_right

Fuzzy–Rough Sets for Information Measures and Selection of Relevant Genes From Microarray Data

Sankar Pal

IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 2010

Several information measures such as entropy, mutual information, and f-information have been shown to be successful for selecting a set of relevant and nonredundant genes from a high-dimensional microarray data set. However, for continuous gene expression values, it is very difficult to find the true density functions and to perform the integrations required to compute different information measures. In this regard, the concept of the fuzzy equivalence partition matrix is presented to approximate the true marginal and joint distributions of continuous gene expression values. The fuzzy equivalence partition matrix is based on the theory of fuzzy-rough sets, where each row of the matrix represents a fuzzy equivalence partition that can automatically be derived from the given expression values. The performance of the proposed approach is compared with that of existing approaches using the class separability index and the predictive accuracy of the support vector machine. An important finding, however, is that the proposed approach is shown to be effective for selecting relevant and nonredundant continuous-valued genes from microarray data.

downloadDownload free PDF View PDFchevron_right

Multiple DNA Sequence Alignment using a Hybrid Model of GA and Rough Sets

Sayed Fouad, Sara El-Metwally

Multiple DNA Sequence alignment is one of the most important bioinformatics problems where the search space is too large for exact algorithms to be possible. This paper proposes a new approach for multiple sequence alignment using a hybrid model of Genetic Algorithm and Rough Sets. Genetic Algorithms try to evolve initial alignment to find better one with better similarity score. The resulting alignment of genetic algorithm contain gaps in random locations of sequences and these gaps are translated to missing bases when the alignment is converted to suitable representation for Rough Sets analysis. Rough sets try to cluster DNA sequence in the presence of missing bases and build the Modified Similarity Relation that is dependent on the number of missing bases with respect to the number of the whole defined attributes for each sequence. The discernibility matrix has been constructed to compute the minimal sets of reducts, which used to extract the minimal sets of decision rules that describe similarity relations between sequences. These rules are used later as basis to compare between sequences, rule-to-rule rather than residue-to-residue comparing. Fragments of DNA sequences of swine influenza A (H1N1) virus are aligned using our hybrid model of Genetic Algorithm and Rough Sets.

downloadDownload free PDF View PDFchevron_right

Learning rough set classifiers from gene expressions and clinical data

Kristin Norsett

2002

Biological research is currently undergoing a revolution. With the advent of microarray technology the behavior of thousands of genes can be measured simultaneously. This capability opens a wide range of research opportunities in biology, but the technology generates a vast amount of data that cannot be handled manually. Computational analysis is thus a prerequisite for the success of this technology, and research and development of computational tools for microarray analysis are of great importance.

downloadDownload free PDF View PDFchevron_right

A Study on Hybridization of Intelligent Techniques in Bioinformatics

Peyakunta Bhargavi

Fuzzy Systems, 2017

This chapter aims to study the use of Hybridization of intelligent techniques in the areas of bioinformatics and computational molecular biology. These areas have risen from the needs of biologists to utilize and help interpret the vast amounts of data that are constantly being gathered in genomic research. Also describes the kind of methods which were developed by the research community in order to search, classify and mine different available biological databases and simulate biological experiments. This chapter also presents the hybridization of intelligent systems involving neural networks, fuzzy systems, neuro-fuzzy system, rough set theory, swam intelligence and genetic algorithm. The key idea was to demonstrate the evolution of intelligence in bioinformatics. The developed hybridization of intelligent techniques was applied to the real world applications. The hybridization of intelligent systems performs better than the individual approaches. Hence these approaches might be e...

downloadDownload free PDF View PDFchevron_right

Loading Preview

Sorry, preview is currently unavailable. You can download the paper by clicking the button above.

References (4)

© 2008 Computational Intelligence in Bioinformatics Editors: Kelemen,Arpad, Abraham, Ajith, Chen, Yuehui (Eds.)
Computational intelligence techniques in bioinformatics Aboul Ella Hassanien 1 , Eiman Tamah Al- Shammari, Neveen I Ghali
Computational Intelligence in Bioinformatics by Jean-Christophe Nebel
Computational Intelligence in Solving Bioinformatics Problems: Reviews, Perspectives, and Challenges by Aboul-Ella Hassanien1,2, Mariofanna G. Milanova3, Tomasz G. Smolinski4, and Ajith Abraham5

iir publications

Gene selection is a main procedure of discriminate analysis of microarray data which is the process of selecting most informative genes from the whole gene data base. This paper approach a method for selecting informative genes by using Rough Set Theory. Rough Set Theory is a effective mathematical tool for selecting informative genes. This paper describes basics of Rough Set Theory and Rough Set attribute reduction by Quick -Reduct based Genetic Algorithm.

downloadDownload free PDF View PDFchevron_right

Hybrid system based on rough sets and genetic algorithms for medical data classifications

Abeer Korany

2013

Computational intelligence provides the biomedical domain by a significant support. The application of machine learning techniques in medical applications have been evolved from the physician needs. Screening, medical images, pattern classification, prognosis are some examples of health care support systems. Typically medical data has its own characteristics such as huge size and features, continuous and real attributes that refer to patients' investigations. Therefore, discretization and feature selection process are considered a key issue in improving the extracted knowledge from patients' investigations records. In this paper, a hybrid system that integrates Rough Set (RS) and Genetic Algorithm (GA) is presented for the efficient classification of medical data sets of different sizes and dimensionalities. Genetic Algorithm is applied with the aim of reducing the dimension of medical datasets and RS decision rules were used for efficient classification. Furthermore, the proposed system applies the Entropy Gain Information (EI) for discretization process. Four biomedical data sets are tested by the proposed system (EI-GA-RS), and the highest score was obtained through three different datasets. Other different hybrid techniques shared the proposed technique the highest accuracy but the proposed system preserves its place as one of the highest results systems four three different sets. EI as discretization technique also is a common part for the best results in the mentioned datasets while RS as an evaluator realized the best results in three different data sets.

downloadDownload free PDF View PDFchevron_right

Computational intelligence in solving bioinformatics problems: Reviews, perspectives, and challenges

Tomasz Smolinski

… in Biomedicine and …, 2008

downloadDownload free PDF View PDFchevron_right

Computational intelligence in solving bioinformatics problems

Professor Aboul Ella Hassanien

Artificial Intelligence in Medicine, 2005

This chapter presents a broad overview of Computational Intelligence (CI) techniques including Artificial Neural Networks (ANN), Particle Swarm Optimization (PSO), Genetic Algorithms (GA), Fuzzy Sets (FS), and Rough Sets (RS). We review a number of applications of computational intelligence to problems in bioinformatics and computational biology, including gene expression, gene selection, cancer classification, protein function prediction, multiple sequence alignment, and DNA fragment assembly. We discuss some representative methods to provide inspiring examples to illustrate how CI could be applied to solve bioinformatic problems and how bioinformatics could be analyzed, processed, and characterized by computational intelligence. Challenges to be addressed and future directions of research are presented. An extensive bibliography is also included.

downloadDownload free PDF View PDFchevron_right

Neural Networks and Rough Sets: A comparative study on data classification

Renato Sassi

The 2006 International …, 2006

This paper addresses a contrastive study between Neural Networks and Rough Sets on data classification. The experiments were carried out using the Iris database, of public domain, to evaluate the classification. The confusion matrix method was used to evaluate the performance of these classifiers. With these contrastive experiments, we investigated the capacity of each classifier for application in a potential application on knowledge extraction in databases. In this experiment the results indicate that the Neural Networks classifier, except SLP, presents significant superiority on Rough Sets classifiers.

downloadDownload free PDF View PDFchevron_right

Gene Selection Using Multi-objective Genetic Algorithm Integrating Cellular Automata and Rough Set Theory

Arka Ghosh

Springer International Publishing Switzerland LNCS 8298, pp. 144–155, 2013., 2013

Feature selection is one of the most key problems in the field of machine learning and data mining. It can be done in mainly two different ways, namely, filter approach and wrapper approach. Filter approach is independent of underlying classifier logic and relatively less costly than the wrapper approach which is classifier dependent. Many researchers have applied Genetic algorithm (GA) as wrapper approach for feature selection. In the paper, a novel feature selection method is proposed based on the multi-objective genetic algorithm which is applied on population generated by non-linear uniform hybrid cellular automata. The fitness functions are defined one using set lower bound approximation of rough set theory and the other using Kullbak-Leibler divergence method. A comparative study between proposed method and some leading feature selection methods are given using some popular microarray cancer dataset to demonstrate the effectiveness of the method.

downloadDownload free PDF View PDFchevron_right

Introduction to the Special Issue on Rough Sets and Knowledge Discovery

Wojciech Ziarko

Computational Intelligence, 1995

downloadDownload free PDF View PDFchevron_right

Rough-Fuzzy C-Medoids Algorithm and Selection of Bio-Basis for Amino Acid Sequence Analysis

Sankar Pal

IEEE Transactions on Knowledge and Data Engineering, 2007

In most pattern recognition algorithms, amino acids cannot be used directly as inputs since they are nonnumerical variables. They, therefore, need encoding prior to input. In this regard, bio-basis function maps a nonnumerical sequence space to a numerical feature space. It is designed using an amino acid mutation matrix. One of the important issues for the bio-basis function is how to select the minimum set of bio-bases with maximum information. In this paper, we describe an algorithm, termed as rough-fuzzy c-medoids (RFCMdd) algorithm, to select the most informative bio-bases. It is comprised of a judicious integration of the principles of rough sets, fuzzy sets, the c-medoids algorithm, and the amino acid mutation matrix. While the membership function of fuzzy sets enables efficient handling of overlapping partitions, the concept of lower and upper bounds of rough sets deals with uncertainty, vagueness, and incompleteness in class definition. The concept of crisp lower bound and fuzzy boundary of a class, introduced in RFCMdd, enables efficient selection of the minimum set of the most informative bio-bases. Some new indices are introduced for evaluating quantitatively the quality of selected bio-bases. The effectiveness of the proposed algorithm, along with a comparison with other algorithms, has been demonstrated on different types of protein data sets.

downloadDownload free PDF View PDFchevron_right

A New Gene Selection Algorithm using Fuzzy-Rough Set Theory for Tumor Classification

Seyedeh Farahbakhshian, Milad Taleby Ahvanooey

CONTROL ENGINEERING AND APPLIED INFORMATICS, 2020

In statistics and machine learning, feature selection is the process of picking a subset of relevant attributes for utilizing in a predictive model. Recently, rough set-based feature selection techniques, that employ feature dependency to perform selection process, have been drawn attention. Classification of tumors based on gene expression is utilized to diagnose proper treatment and prognosis of the disease in bioinformatics applications. Microarray gene expression data includes superfluous feature genes of high dimensionality and smaller training instances. Since exact supervised classification of gene expression instances in such high-dimensional problems is very complex, the selection of appropriate genes is a crucial task for tumor classification. In this study, we present a new technique for gene selection using a discernibility matrix of fuzzy-rough sets. The proposed technique takes into account the similarity of those instances that have the same and different class labels to improve the gene selection results, while the state-of-the art previous approaches only address the similarity of instances with different class labels. To meet that requirement, we extend the Johnson reducer technique into the fuzzy case. Experimental results demonstrate that this technique provides better efficiency compared to the state-of-the-art approaches.

downloadDownload free PDF View PDFchevron_right

Rough Set-Based Neuro-Fuzzy System

Chai Quek

Encyclopedia of Artificial Intelligence, 2009

This paper presents a novel hybrid intelligent system which synergizes the concept of knowledge reduction in rough set theory with the human-like reasoning style of fuzzy systems and the learning and connectionist structure of neural networks. The proposed rough set-based neuro-fuzzy system (RNFS) incorporates a wrapper-based feature selection method that employs the mutual information maximization scheme which selects attributes with high relevance and the concept of knowledge reduction in rough set theory which selects attributes with low redundancy. Experimental results show that the proposed RNFS utilizes less computational effort and yielded promising results on feature selection as well as classification accuracy.

downloadDownload free PDF View PDFchevron_right

[IJCST-V10I1P5]:C. Sunitha Ram, Swetha Gayathri Kuchimanchi

Sign up for access to the world's latest research

Abstract

Related papers

References (4)

Related papers