probalign¶

Description¶

Probalign uses partition function posterior probability estimates to compute maximum expected accuracy multiple sequence alignments. It performs statistically significantly better than the leading alignment programs Probcons v1.1, MAFFT v5.851, and MUSCLE v3.6 on BAliBASE 3.0, HOMSTRAD, and OXBENCH benchmarks. Probalign improvements are largest on datasets containing N/C terminal extensions and on datasets with long and heterogeneous length sequences. On heteregeneous length datasets containing repeats Probalign alignment accuracy is 10% and 15% than the other three methods when standard deviation of length is at least 300 and 400.

Environment Modules¶

Run module spider probalign to find out what environment modules are available for this application.

Environment Variables¶

HPC_PROBALIGN_DIR - installation directory
HPC_PROBALIGN_BIN - executable directory
HPC_PROBALIGN_DOC - documentation directory

Citation¶

If you publish research that uses probalign you have to cite it as follows:

Chikkagoudar, S., Roshan, U., & Livesay, D. R. (2007). eProbalign: generation and manipulation of multiple sequence alignments using partition function posterior probabilities. Nucleic Acids Research, 35(Web Server issue), W675-W677. https://doi.org/10.1093/nar/gkm267

Categories¶

biology, ngs