Coupled two-way clustering analysis of gene microarray data
Abstract
We present a coupled two-way clustering approach to gene microarray data analysis. The main idea is to identify subsets of the genes and samples, such that when one of these is used to cluster the other, stable and significant partitions emerge. The search for such subsets is a computationally complex task. We present an algorithm, based on iterative clustering, that performs such a search. This analysis is especially suitable for gene microarray data, where the contributions of a variety of biological mechanisms to the gene expression levels are entangled in a large body of experimental data. The method was applied to two gene microarray data sets, on colon cancer and leukemia. By identifying relevant subsets of the data and focusing on them we were able to discover partitions and correlations that were masked and hidden when the full dataset was used in the analysis. Some of these partitions have clear biological interpretation; others can serve to identify possible directions for future research.
- Publication:
-
Proceedings of the National Academy of Science
- Pub Date:
- October 2000
- DOI:
- 10.1073/pnas.210134797
- arXiv:
- arXiv:physics/0004009
- Bibcode:
- 2000PNAS...9712079G
- Keywords:
-
- Cell Biology / Genetics;
- Physics - Biological Physics;
- Physics - Computational Physics;
- Physics - Data Analysis;
- Statistics and Probability;
- Quantitative Biology - Quantitative Methods
- E-Print:
- doi:10.1073/pnas.210134797