Exploiting the Accumulated Evidence for Gene Selection in Microarray Gene Expression Data
Abstract
Machine Learning methods have of late made significant efforts to solving multidisciplinary problems in the field of cancer classification using microarray gene expression data. Feature subset selection methods can play an important role in the modeling process, since these tasks are characterized by a large number of features and a few observations, making the modeling a non-trivial undertaking. In this particular scenario, it is extremely important to select genes by taking into account the possible interactions with other gene subsets. This paper shows that, by accumulating the evidence in favour (or against) each gene along the search process, the obtained gene subsets may constitute better solutions, either in terms of predictive accuracy or gene size, or in both. The proposed technique is extremely simple and applicable at a negligible overhead in cost.
- Publication:
-
arXiv e-prints
- Pub Date:
- March 2013
- DOI:
- 10.48550/arXiv.1303.0156
- arXiv:
- arXiv:1303.0156
- Bibcode:
- 2013arXiv1303.0156P
- Keywords:
-
- Computer Science - Computational Engineering;
- Finance;
- and Science;
- Computer Science - Machine Learning;
- Quantitative Biology - Quantitative Methods;
- I.5.2
- E-Print:
- 10 pages, 2 algorithms A shorter version of this paper appeared in the Procs. of the 19th European Conference on Artificial Intelligence (ECAI 2010)