Characteristics of 454 pyrosequencing data—enabling realistic simulation with flowsim

Pyrosequencing Benchmarking Sanger sequencing Sequence (biology) MIT License
DOI: 10.1093/bioinformatics/btq365 Publication Date: 2010-09-07T17:41:46Z
ABSTRACT
Abstract Motivation: The commercial launch of 454 pyrosequencing in 2005 was a milestone genome sequencing terms performance and cost. Throughout the three available releases, average read lengths have increased to ∼500 base pairs are thus approaching obtained from traditional Sanger sequencing. Study design projects would benefit being able simulate experiments. Results: We explore raw data investigate its characteristics derive empirical distributions for flow values generated by pyrosequencing. Based on our findings, we implement Flowsim, simulator that generates realistic files arbitrary size given set input DNA sequences. finally use examine impact sequence results concrete whole-genome assemblies, suggest planning projects, benchmarking assembly methods other fields. Availability: Flowsim is freely under General Public License http://blog.malde.org/index.php/flowsim/ Contact: susanne.balzer@imr.no; ketil.malde@imr.no
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES (16)
CITATIONS (113)
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....