NFDI4DS | UHH-SEMS - Publication Details

Neptune: A Bioinformatics Tool for Rapid Discovery of Genomic Variation in Bacterial Populations

0303 health sciences 03 medical and health sciences

DOI: 10.1101/032227 Publication Date: 2015-11-19T06:12:33Z

Abstract Supplemental Material References Cited by

AUTHORS (12)

Eric Marinier

Rahat Zaheer

Chrystal Berry

Kelly Weedmark

Michael Domaratzki

Philip Mabon

Natalie Knox

Aleisha Reimer

Morag Graham

Linda Chui

Gary Van Domselaar

ABSTRACT

The ready availability of vast amounts of genomic sequence data has created the need to rethink comparative genomics algorithms using “big data” approaches. Neptune is an efficient system for rapidly locating differentially abundant genomic content in bacterial populations using an exactk-mer matching strategy, while accommodatingk-mer mismatches. Neptune’s loci discovery process identifies sequences that are sufficiently common to a group of target sequences and sufficiently absent from non-targets using probabilistic models. Neptune uses parallel computing to efficiently identify and extract these loci from draft genome assemblies without requiring multiple sequence alignments or other computationally expensive comparative sequence analyses. Tests on simulated and real data sets showed that Neptune rapidly identifies regions that are both sensitive and specific. We demonstrate that this system can identify trait-specific loci from different bacterial lineages. Neptune is broadly applicable for comparative bacterial analyses, yet will particularly benefit pathogenomic applications, owing to efficient and sensitive discovery of differentially abundant genomic loci.

SUPPLEMENTAL MATERIAL

Coming soon ....

REFERENCES (24)

CITATIONS (1)

EXTERNAL LINKS

OPENAIRE - Products CROSSREF - Publications

PlumX Metrics

Neptune: A Bioinformatics Tool for Rapid Discovery of Genomic Variation in Bacterial Populations

RECOMMENDATIONS

FAIR ASSESSMENT

Coming soon ....

JUPYTER LAB

Coming soon ....