LotuS2: an ultrafast and highly accurate tool for amplicon sequencing analysis
0301 basic medicine
0303 health sciences
670
10203 Bioinformatics (Computational Biology) (applications to be 10610)
QR100-130
Short read
Reproducibility of Results
High-Throughput Nucleotide Sequencing
Sequence Analysis, DNA
Amplicon data analysis
Amplicon sequencing
Microbial ecology
Soil
03 medical and health sciences
RNA, Ribosomal, 16S
Bioinformatics and Systems Biology (methods development to be 10203)
Microbiome
16S rRNA
Sequence Analysis
Long read
Software
DOI:
10.1186/s40168-022-01365-1
Publication Date:
2022-10-19T02:03:32Z
AUTHORS (7)
ABSTRACT
Abstract
Background
Amplicon sequencing is an established and cost-efficient method for profiling microbiomes. However, many available tools to process this data require both bioinformatics skills and high computational power to process big datasets. Furthermore, there are only few tools that allow for long read amplicon data analysis. To bridge this gap, we developed the LotuS2 (less OTU scripts 2) pipeline, enabling user-friendly, resource friendly, and versatile analysis of raw amplicon sequences.
Results
In LotuS2, six different sequence clustering algorithms as well as extensive pre- and post-processing options allow for flexible data analysis by both experts, where parameters can be fully adjusted, and novices, where defaults are provided for different scenarios.
We benchmarked three independent gut and soil datasets, where LotuS2 was on average 29 times faster compared to other pipelines, yet could better reproduce the alpha- and beta-diversity of technical replicate samples. Further benchmarking a mock community with known taxon composition showed that, compared to the other pipelines, LotuS2 recovered a higher fraction of correctly identified taxa and a higher fraction of reads assigned to true taxa (48% and 57% at species; 83% and 98% at genus level, respectively). At ASV/OTU level, precision and F-score were highest for LotuS2, as was the fraction of correctly reported 16S sequences.
Conclusion
LotuS2 is a lightweight and user-friendly pipeline that is fast, precise, and streamlined, using extensive pre- and post-ASV/OTU clustering steps to further increase data quality. High data usage rates and reliability enable high-throughput microbiome analysis in minutes.
Availability
LotuS2 is available from GitHub, conda, or via a Galaxy web interface, documented at http://lotus2.earlham.ac.uk/.
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES (58)
CITATIONS (54)
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....