reGenotyper: Detecting mislabeled samples in genetic data

C. ELEGANS 0301 basic medicine Nematoda QTL Gene Expression DISEASE NATURAL VARIATION DATA Q R Animal Models Genomics Phenotypes Phenotype Experimental Organism Systems Caenorhabditis Elegans Physical Sciences POPULATIONS Medicine Algorithms Research Article EXPRESSION 570 Genotype Permutation Science Quantitative Trait Loci Polymorphism, Single Nucleotide MIX-UPS 03 medical and health sciences Model Organisms Genome-Wide Association Studies Genetics Life Science Animals Humans Computer Simulation GENOME-WIDE ASSOCIATION Evolutionary Biology Population Biology IDENTIFICATION PERTURBATION Discrete Mathematics Gene Expression Profiling Organisms Biology and Life Sciences Computational Biology Reproducibility of Results Human Genetics Genome Analysis Invertebrates Genetic Loci Combinatorics Genetic Polymorphism Caenorhabditis Population Genetics Mathematics
DOI: 10.1371/journal.pone.0171324 Publication Date: 2017-02-13T19:44:20Z
ABSTRACT
In high-throughput molecular profiling studies, genotype labels can be wrongly assigned at various experimental steps; the resulting mislabeled samples seriously reduce the power to detect the genetic basis of phenotypic variation. We have developed an approach to detect potential mislabeling, recover the "ideal" genotype and identify "best-matched" labels for mislabeled samples. On average, we identified 4% of samples as mislabeled in eight published datasets, highlighting the necessity of applying a "data cleaning" step before standard data analysis.
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES (36)
CITATIONS (18)
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....