NFDI4DS | UHH-SEMS - Publication Details

Fast Statistical Alignment

Multiple sequence alignment Alignment-free sequence analysis Computational statistics Benchmark (surveying) Centroid Smith–Waterman algorithm Structural alignment

DOI: 10.1371/journal.pcbi.1000392 Publication Date: 2009-05-28T21:27:18Z

Abstract Supplemental Material References Cited by

AUTHORS (8)

Robert K. Bradley

Adam Roberts

Michael Smoot

Sudeep Juvekar

Jaeyoung Do

Colin Dewey

Ian Holmes

Lior Pachter

ABSTRACT

We describe a new program for the alignment of multiple biological sequences that is both statistically motivated and fast enough problem sizes arise in practice. Our Fast Statistical Alignment based on pair hidden Markov models which approximate an insertion/deletion process tree uses sequence annealing algorithm to combine posterior probabilities estimated from these into alignment. FSA its explicit statistical model produce alignments are accompanied by estimates accuracy uncertainty every column character alignment--previously available only with programs use computationally-expensive Chain Monte Carlo approaches--yet can align thousands long sequences. Moreover, utilizes unsupervised query-specific learning procedure parameter estimation leads improved benchmark reference comparison existing programs. The centroid approach taken FSA, combination procedure, drastically reduces amount false-positive data given other methods. companion visualization tool exploring be used via web interface at http://orangutan.math.berkeley.edu/fsa/, source code http://fsa.sourceforge.net/.

SUPPLEMENTAL MATERIAL

Coming soon ....

REFERENCES (66)

CITATIONS (301)

EXTERNAL LINKS

OPENAIRE - Products CROSSREF - Publications OPENALEX - Publications

PlumX Metrics

Fast Statistical Alignment

RECOMMENDATIONS

FAIR ASSESSMENT

Coming soon ....

JUPYTER LAB

Coming soon ....