The fuzzy gene filter: A classifier performance assesment
Abstract
The Fuzzy Gene Filter (FGF) is an optimised Fuzzy Inference System designed to rank genes in order of differential expression, based on expression data generated in a microarray experiment. This paper examines the effectiveness of the FGF for feature selection using various classification architectures. The FGF is compared to three of the most common gene ranking algorithms: t-test, Wilcoxon test and ROC curve analysis. Four classification schemes are used to compare the performance of the FGF vis-a-vis the standard approaches: K Nearest Neighbour (KNN), Support Vector Machine (SVM), Naive Bayesian Classifier (NBC) and Artificial Neural Network (ANN). A nested stratified Leave-One-Out Cross Validation scheme is used to identify the optimal number top ranking genes, as well as the optimal classifier parameters. Two microarray data sets are used for the comparison: a prostate cancer data set and a lymphoma data set.
- Publication:
-
arXiv e-prints
- Pub Date:
- August 2011
- DOI:
- 10.48550/arXiv.1108.4545
- arXiv:
- arXiv:1108.4545
- Bibcode:
- 2011arXiv1108.4545P
- Keywords:
-
- Computer Science - Machine Learning;
- Computer Science - Computational Engineering;
- Finance;
- and Science
- E-Print:
- Intelligent Systems and Control / 742: Computational Bioscience (ISC 2011) July 11 - 13, 2011 Cambridge, United Kingdom Editor(s): J.F. Whidborne, P. Willis, G. Montana