RADdesigner: a workflow to select the optimal sequencing methodology in genotyping experiments on woody plant species

0301 basic medicine 03 medical and health sciences 15. Life on land
DOI: 10.1007/s11295-019-1372-3 Publication Date: 2019-07-18T16:02:41Z
ABSTRACT
Polymorphism detection in woody plant species is important in phylogenetics and an essential stage in plant breeding programs. Genotyping has benefitted from the development of next-generation sequencing (NGS) techniques. Methodologies based upon single and double-digested DNA sequencing (RADseq and ddRADseq) allow the detection of polymorphisms on thousands of loci in non-model organisms for which a reference genome is not available, such as many woody plant species. However, experiments using these methodologies are not safe from technical errors, and require careful designs in order to reduce library preparation and sequencing costs at the same time unbiased results are obtained. In order to avoid undesired effects of technical errors on genotyping, it is advisable to incorporate error quantification through pilot sequencing of technical replicates, along with filtering strategies and parameter tuning, so the technical conditions of the whole experiment can be properly defined a priori. However, these kinds of approaches to account for technical errors have been frequently disregarded in the literature. Here, we present RADdesigner, a workflow that combines pilot sequencing of replicate samples with a bioinformatic pipeline to select the most suitable methodology by quantifying genotyping errors, to optimize polymorphism detection parameters, and to perform post-filtering of the results produced by the commonly used RADseq bioinformatic pipeline IpyRAD. The workflow was validated using a pilot study that analyzed a subset of replicate samples of Quercus suber, Q. ilex, and their hybrids.
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES (68)
CITATIONS (8)