NAToRA, a relatedness-pruning method to minimize the loss of dataset size in genetic and omics analyses
Pruning
Heuristics
DOI:
10.1016/j.csbj.2022.04.009
Publication Date:
2022-04-09T15:17:41Z
AUTHORS (16)
ABSTRACT
Genetic and omics analyses frequently require independent observations, which is not guaranteed in real datasets. When relatedness cannot be accounted for, solutions involve removing related individuals (or observations) and, consequently, a reduction of available data. We developed network-based relatedness-pruning method that minimizes dataset while unwanted relationships dataset. It uses node degree centrality metric to identify highly connected nodes individuals) implements heuristics approximate the minimal allow its application complex compared with two other popular population genetics methodologies (PLINK KING), NAToRA shows best combination all relatives keeping largest possible number datasets tested also, similar effects on allele frequency spectrum Principal Component Analysis than PLINK KING. freely available, both as standalone tool can easily incorporated part pipeline, graphical web allows visualization networks. also accepts variety relationship metrics input, facilitates use. release genealogies simulator software used for different tests performed this study.
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES (20)
CITATIONS (15)
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....