Background: Microarray technology is a powerful methodology for identifying differentially expressed genes. However, when thousands of genes in a microarray data set are evaluated simultaneously by fold changes and significance tests, the... more
Background: An important use of data obtained from microarray measurements is the classification of tumor types with respect to genes that are either up or down regulated in specific cancer types. A number of algorithms have been proposed... more
Variation in cellular activity in a tissue induces changes in RNA concentration, which affects the validity of gene mRNA abundance analyzed by reverse transcription quantitative PCR (RT-qPCR). A common way of accounting for such variation... more
In the field of medical science diseases diagnosis by Tissue microarrays is one of the active areas of research .There are various gene selection techniques in the literature. Gene selection provides genes subsets that are capable to... more
Microarray technologies are rapidly becoming available for new species including teleost fishes. We constructed a rainbow trout cDNA microarray targeted at the identification of genes which are differentially expressed in response to... more
Kinesin super family protein 2A (KIF2A) is an ATP-dependent microtubule destabilizer, that belongs to the kinesin-13 family. It is highly expressed in juvenile brains, but its postnatal function has not been determined due to mortality in... more
Cancer has been suggested to result from interactions between genetic and environmental factors, and certain subgroups in the general population may be at increased risk because of their relatively higher susceptibility to environmental... more
Summary A serial analysis of gene expression (SAGE) library is a collection of thousands of small DNA “tags,” each of which represents a distinct messenger RNA (mRNA) transcript. Existing methods have been proposed for analyzing single... more
Microarray experiments generate large datasets with expression values for thousands of genes, but not more than a few dozens of samples. A challenging task with these data is to reveal groups of genes which act together and whose... more
Although X chromosome inactivation in female mammals evolved to balance the expression of X chromosome and autosomal genes in the two sexes, female embryos pass through developmental stages in which both X chromosomes are active in... more
Breast cancer is the second leading cause of female mortality globally. Effective diagnostic tools, such as biosensors that utilize reliable biomarkers, are essential for early detection, particularly in low-income countries. This study... more
Kinesin super family protein 2A (KIF2A) is an ATP-dependent microtubule destabilizer, that belongs to the kinesin-13 family. It is highly expressed in juvenile brains, but its postnatal function has not been determined due to mortality in... more
Napier grass (Pennisetum purpureum Schumach.) is a highly productive C4 tropical forage grass that has been targeted as a potential bioenergy crop. To further increase the efficiency of bioethanol production by molecular breeding, a... more
Cancer can be considered as one of the leading causes of death widely. One of the most effective tools to be able to handle cancer diagnosis, prognosis, and treatment is by using expression profiling technique which is based on microarray... more
In microarray studies, the number of samples is relatively small compared to the number of genes per sample. An important aspect of microarray studies is the prediction of patient survival based on their gene expression profile. This... more
Gene expression profiling has been widely used to study molecular signatures of many diseases and to develop molecular diagnostics for disease prediction. Gene selection, as an important step for improved diagnostics, screens tens of... more
It is crucial to accurately categorize cancers using microarray data. Researchers have employed a variety of computational intelligence approaches to analyze gene expression data. It is believed that the most difficult part of the problem... more
Selection of genes that are differentially expressed and critical to a particular biological process has been a major challenge in post-array analysis. Recent development in bioinformatics has made various data sources available such as... more
This paper presents the development of the N-Spherical Minimalist Machine Learning (MML) classifier, an innovative model within the Minimalist Machine Learning paradigm. Using N-spherical coordinates and concepts from metaheuristics and... more
The concept of linear separability of gene expression data sets with respect to two classes has been recently studied in the literature. The problem is to efficiently find all pairs of genes which induce a linear separation of the data.... more
It is known that breast cancer is not just one disease, but rather a collection of many different diseases occurring in one site that can be distinguished based in part on characteristic gene expression signatures. Appropriate diagnosis... more
Human brain development is a continuum governed by differential gene expression. Therefore, we proceeded to identify genes selectively expressed in the developing brain. Using differential display and library screening, a novel rat cDNA,... more
In this paper, we compare the accuracy of classification for different cancers, based on gene microarray expression data. For this reason, we have used a combination between filter selection methods and clustering algorithms to select... more
Kinesin super family protein 2A (KIF2A) is an ATP-dependent microtubule destabilizer, that belongs to the kinesin-13 family. It is highly expressed in juvenile brains, but its postnatal function has not been determined due to mortality in... more
pair linkage analysis of 24 families with pre-eclampsia, we confirm a susceptibility locus on chromosome 10q22.1 in Dutch females: a multipoint non-parametric linkage score of 3.6 near marker D10S1432 was obtained. Haplotype analysis... more
Nous avons proposé un algorithme original de Fouille de Données, LICORN, afin d'inférer des relations de régulation coopérative à partir de données d'expression. LICORN donne de bons résultats s'il est appliqué à des données de levure,... more
L'inférence de signatures de facteurs de transcription à partir des données puces à ADN a déjà été étudié dans la communauté bioinformatique. La principale difficulté à résoudre est de trouver un ensemble d'heuristiques pertinentes, afin... more
I would like to thank a number of people for their help, encouragement and support during the undertaking and writing up of this work. Firstly, my thanks go to both my supervisors Professor David Humber and Professor Robert Cheke. Without... more
Machine-learning algorithms have been widely used in breast cancer diagnosis to help pathologists and physicians in the decision-making process. However, the high dimensionality of genetic data makes the classification process a... more
Je tiens tout d’abord à remercier mes encadrants. Après avoir grandemment contribué à me faire venir enseigner à l’Université, Eric Matzner-Løber m’a fait confiance pour ce travail de recherche; je tiens à lui témoigner toute ma... more
Androgens play an important role in controlling the growth of the normal prostate gland and in the pathogenesis of benign prostate hyperplasia, and prostate cancer. Although testosterone is the main androgen secreted from the testes,... more
22q11 deletion syndrome (22q11DS) is a developmental anomaly caused by a microdeletion on human chromosome 22q11. Although mouse models indicate that Tbx1 is the gene responsible for the syndrome, the phenotypic spectrum of del22q11... more
Cystic fibrosis (CF) is caused by mutations in the gene encoding the cystic fibrosis conductance transmembrane regulator (CFTR). Symptoms are pancreatic insufficiency, chronic obstructive lung disease, liver disease, chronic sinusitis and... more
In the classification of Mass Spectrometry (MS) proteomics data, peak detection, feature selection, and learning classifiers are critical to classification accuracy. To better understand which methods are more accurate when classifying... more
This paper proposes an optimized classifier based on feature elimination (OCFE) for gene selection with combining two feature elimination methods, ReliefF and SVM-RFE. ReliefF algorithm is filter feature selection which rank the data by... more
Selecting the most relevant factors from genetic profiles that can optimally characterize cellular states is of crucial importance in identifying complex disease genes and biomarkers for disease diagnosis and assessing drug efficiency. In... more
Motivation: Pre-selection of informative features for supervised classification is a crucial, albeit delicate, task. It is desirable that feature selection provides the features that contribute most to the classification task per se and... more
Microarray technology today has the ability of having the whole genome spotted on a single chip. It allows the biologist to inspect thousands of gene activities simultaneously. Machine learning approaches are suited and used to... more
In gene expression microarray data analysis, selecting a small number of discriminative genes from thousands of genes is an important problem for accurate classification of diseases or phenotypes. The problem becomes particularly... more
Accuracy in quantitative real-time polymerase chain reaction (qPCR) requires the use of stable endogenous controls. Normalization with multiple reference genes is the gold standard, but their identification is a laborious task, especially... more
This paper presents an informative gene set selection approach to tumor diagnosis based on the Distance Sensitive Rival Penalized Competitive Learning (DSRPCL) algorithm and redundancy analysis. Since the DSRPCL algorithm can allocate an... more
Selection of significant genes via expression patterns is an important problem in microarray data processing. In this article, we propose and study a new method for selecting relevant genes obtained by spectral biclustering and based on... more
Selective Alzheimer's disease indicator-1 (seladin-1) is a novel gene with antiapoptotic activity that is down-regulated in vulnerable brain regions in Alzheimer's disease. This gene encodes 3--hydroxysterol ⌬-24-reductase (DHCR24),... more
Selective Alzheimer's disease indicator-1 (seladin-1) is a novel gene with antiapoptotic activity that is down-regulated in vulnerable brain regions in Alzheimer's disease. This gene encodes 3--hydroxysterol ⌬-24-reductase (DHCR24),... more
The unique characteristic of a biological system in nature is described by its DNA through the transcription of its genes. This is like a memory map that each cell of an organism contains. An artificial embryonic cell contains a similar... more
Smoking remains a global health crisis, contributing to addiction and diverse diseases through complex biological mechanisms. This study explores the hypothesis that smoking induces epigenetic modifications and alters bidirectional... more
The increasing dimensionality of gene expression data poses significant challenges in cancer classification, particularly in colon cancer. This study presents a novel filtering approach (FA) and a gene classifier (GC) to enhance gene... more