Cited 6 times since 2019 (1.3 per year) source: EuropePMC Molecular ecology resources, Volume 20, Issue 1, 1 1 2019, Pages 66-78 Attack of the PCR clones: Rates of clonality have little effect on RAD-seq genotype calls. Euclide PT, McKinney GJ, Bootsma M, Tarsa C, Meek MH, Larson WA

Interpretation of high-throughput sequence data requires an understanding of how decisions made during bioinformatic data processing can influence results. One source of bias that is often cited is PCR clones (or PCR duplicates). PCR clones are common in restriction site-associated sequencing (RAD-seq) data sets, which are increasingly being used for molecular ecology. To determine the influence PCR clones and the bioinformatic handling of clones have on genotyping, we evaluate four RAD-seq data sets. Data sets were compared before and after clones were removed to estimate the number of clones present in RAD-seq data, quantify how often the presence of clones in a data set causes genotype calls to change compared to when clones were removed, investigate the mechanisms that lead to genotype call changes and test whether clones bias heterozygosity estimates. Our RAD-seq data sets contained 30%-60% PCR clones, but 95% of RAD-tags had five or fewer clones. Relatively few genotypes changed once clones were removed (5%-10%), and the vast majority of these changes (98%) were associated with genotypes switching from a called to no-call state or vice versa. PCR clones had a larger influence on genotype calls in individuals with low read depth but appeared to influence genotype calls at all loci similarly. Removal of PCR clones reduced the number of called genotypes by 2% but had almost no influence on estimates of heterozygosity. As such, while steps should be taken to limit PCR clones during library preparation, PCR clones are likely not a substantial source of bias for most RAD-seq studies.

Mol Ecol Resour. 2019 10;20(1):66-78