Population stratification using a statistical model on hypergraphs
Abstract
Population stratification is a problem encountered in several areas of natural science, engineering, and public health. We tackle this problem by mapping a population and its element attributes onto a hypergraph, a natural extension of the concept of graph or network to encode associations among any number of elements. On this hypergraph, we construct a statistical model reflecting our intuition about how the element attributes can emerge from a postulated population structure. Finally, we introduce the concept of stratification representativeness as a mean to identify the simplest stratification already containing most of the information about the population structure. We demonstrate the power of this framework stratifying an animal and a human population based on phenotypic and genotypic properties, respectively.
- Publication:
-
Physical Review E
- Pub Date:
- June 2008
- DOI:
- 10.1103/PhysRevE.77.066106
- arXiv:
- arXiv:0712.1365
- Bibcode:
- 2008PhRvE..77f6106V
- Keywords:
-
- 89.75.Hc;
- 89.75.Fb;
- 02.50.Tt;
- Networks and genealogical trees;
- Structures and organization in complex systems;
- Inference methods;
- Quantitative Biology - Populations and Evolution;
- Computer Science - Artificial Intelligence;
- Physics - Data Analysis;
- Statistics and Probability
- E-Print:
- 7 pages, 6 figures