Efficient Identification of Equivalences in Dynamic Graphs and Pedigree Structures

doi:10.48550/arXiv.1301.3946

Efficient Identification of Equivalences in Dynamic Graphs and Pedigree Structures

We propose a new framework for designing test and query functions for complex structures that vary across a given parameter such as genetic marker position. The operations we are interested in include equality testing, set operations, isolating unique states, duplication counting, or finding equivalence classes under identifiability constraints. A motivating application is locating equivalence classes in identity-by-descent (IBD) graphs, graph structures in pedigree analysis that change over genetic marker location. The nodes of these graphs are unlabeled and identified only by their connecting edges, a constraint easily handled by our approach. The general framework introduced is powerful enough to build a range of testing functions for IBD graphs, dynamic populations, and other structures using a minimal set of operations. The theoretical and algorithmic properties of our approach are analyzed and proved. Computational results on several simulations demonstrate the effectiveness of our approach.

Publication:

arXiv e-prints

Pub Date:

January 2013

DOI:

10.48550/arXiv.1301.3946

arXiv:

arXiv:1301.3946

Bibcode:

2013arXiv1301.3946K

Keywords:

Computer Science - Data Structures and Algorithms;
Quantitative Biology - Quantitative Methods;
Statistics - Computation

E-Print:

Code for paper available at http://www.stat.washington.edu/~hoytak/code/hashreduce

NASA/ADS

Efficient Identification of Equivalences in Dynamic Graphs and Pedigree Structures

Abstract