Simulation data for the estimation of numerical constants for approximating pairwise evolutionary distances between amino acid sequences
Abstract
Estimating the number of substitution events per site that have occurred during the evolution of a pair of amino acid sequences is a common task in phylogenetics and comparative genomics that often requires quite slow maximum-likelihood procedures when taking into account explicit evolutionary models. Data presented in this article are large sets of numbers of substitution events and associated numbers of observed differences between pairs of aligned amino acid sequences that have been generated through a simulation procedure of sequence evolution under a broad range of evolutionary models. These data are available at https://zenodo.org/record/2653704.
- Publication:
-
Data in Brief
- Pub Date:
- August 2019
- DOI:
- Bibcode:
- 2019DIB....2504212B
- Keywords:
-
- Amino acid;
- Evolutionary model;
- Corrected distance;
- Uncorrected distance;
- Computer simulation;
- Nonlinear regression