New molecular sequence data and species trees for North American whipsnakes

In this data article we present species trees based on coalescent species delimitation results for North American whipsnakes, as well as metadata pertaining to the article "The effect of missing data on coalescent species delimitation and a taxonomic revision of whipsnakes (Colubridae: Masticophis)" (MPE-2017-76-R1). Species trees were constructed using SNP data generated from double-digest RADseq, filtered to 80% completeness between species. Tables correspond with the primary manuscript and serve as a repository of genetic sequence information for whipsnakes. These data can be downloaded and combined with future whipsnake datasets.


a b s t r a c t
In this data article we present species trees based on coalescent species delimitation results for North American whipsnakes, as well as metadata pertaining to the article "The effect of missing data on coalescent species delimitation and a taxonomic revision of whipsnakes (Colubridae: Masticophis)" (MPE-2017-76-R1). Species trees were constructed using SNP data generated from double-digest RADseq, filtered to 80% completeness between species. Tables correspond with the primary manuscript and serve as a repository of genetic sequence information for whipsnakes. These data can be downloaded and combined with future whipsnake datasets.
& 2018 The Authors. Published by Elsevier Inc. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).

Subject area
Biology More specific subject area Phylogenetics, Herpetology Type of data Figure,

Value of the data
The mitochondrial phylogeny shows the phylogenetic structure of whipsnakes, with an emphasis on the Great Plains lineage of the United States.
The included species trees reveal previously unknown phylogenetic relationships among whipsnakes.
Supplementary Table 1 shows metadata for whipsnake sequence data that could be used in future studies. Table 1 explains missing data thresholds used for various analyses.

Data
The data included in this DIB article includes details on two species trees of North American whipsnakes, as well as the meta-data for all whipsnake sequence data used in the original article [1]. The species trees were estimated using coalescent methods from SNP data, with 20% missing data thresholds. Two species trees to encompass both species complexes included in this study. Meta-data includes information for both mitochondrial sequence data, and ddRADseq short sequence reads (Table 1).

Mitochondrial phylogenetic analysis
We aligned all sequences with the Geneious Aligner under default settings [2]. We calculated uncorrected average pairwise distance between lineages in Mega v7 [3]. We selected the most probable model of nucleotide evolution for Likelihood analyses using Bayesian information criteria implemented in PartitionFinder [4], partitioning by codon position. We estimated a maximum likelihood phylogeny using raxmlGUI v1.3 with 1000 rapid bootstrap iterations [5] and visualized our final phylogeny in FigTree v1.4.3 [6]. We considered nodes with bootstrap values Z70 as strongly supported (Fig. 1).

SNP-based species tree analysis
Species trees were estimated with r20% missing data using SNAPP v1.0. We assigned species identities based on the best supported model from our BFD* analyses. We allowed BEAUti to estimate the mutation rate, and confirmed that both U and V were approximately equal to one. We assigned a Gamma distribution to our Lambda prior, with an Alpha of 1 and a Beta of 77. On our Snap prior we assigned an Alpha of 1, a Beta of 100, and a Lambda of 77. We ran the analyses for 10,000,000 MCMC generations, sampling every 1000 generations. We visualized the complete tree sets in DENSITREE v1.0 [7], and removed the first 10% of trees as burn-in (Fig. 2).   Species trees generated using SNAPP based on the best-supported models from our Bayes Factor delimitation analysis from the primary manuscript for datasets C and E ( o 20% missing loci). Support values are labeled for each node that is not fully supported.