Efficient Identification of Equivalences in Dynamic Graphs and Pedigree Structures
Abstract
We propose a new framework for designing test and query functions for complex structures that vary across a given parameter such as genetic marker position. The operations we are interested in include equality testing, set operations, isolating unique states, duplication counting, or finding equivalence classes under identifiability constraints. A motivating application is locating equivalence classes in identity-by-descent (IBD) graphs, graph structures in pedigree analysis that change over genetic marker location. The nodes of these graphs are unlabeled and identified only by their connecting edges, a constraint easily handled by our approach. The general framework introduced is powerful enough to build a range of testing functions for IBD graphs, dynamic populations, and other structures using a minimal set of operations. The theoretical and algorithmic properties of our approach are analyzed and proved. Computational results on several simulations demonstrate the effectiveness of our approach.
- Publication:
-
arXiv e-prints
- Pub Date:
- January 2013
- DOI:
- 10.48550/arXiv.1301.3946
- arXiv:
- arXiv:1301.3946
- Bibcode:
- 2013arXiv1301.3946K
- Keywords:
-
- Computer Science - Data Structures and Algorithms;
- Quantitative Biology - Quantitative Methods;
- Statistics - Computation
- E-Print:
- Code for paper available at http://www.stat.washington.edu/~hoytak/code/hashreduce