PALADIN¶

Description¶

PALADIN is a protein sequence alignment tool designed for the accurate functional characterization of metagenomes.

PALADIN is based on BWA, and aligns sequences via read-mapping using BWT. PALADIN, however, offers the novel approach of aligning in the protein space. During the index phase, it processes the reference genome's nucleotide sequences and GTF/GFF annotation containing CDS entries, first converting these transcripts into the corresponding protein sequences, then creating the BWT and suffix array from these proteins. The process of translatation is skiped when providing a protein reference file (e.g., UniProt) for mapping. During the alignment phase, it attempts to find ORFs in the read sequences, then converts these to protein sequences, and aligns to the reference protein sequences.

Environment Modules¶

Run module spider paladin to find out what environment modules are available for this application.

Environment Variables¶

HPC_PALADIN_DIR - installation directory
HPC_PALADIN_BIN - executable directory
HPC_PALADIN_EXE - examples directory

Citation¶

If you publish research that uses paladin you have to cite it as follows:

Westbrook, A., Ramsdell, J., Schuelke, T., Normington, L., Bergeron, R. D., Thomas, W. K., & MacManes, M. D. (2017). PALADIN: protein alignment for functional profiling using protein family templates. Bioinformatics, 33(10), 1495-1497. https://doi.org/10.1093/bioinformatics/btx021

Categories¶

biology, phylogenetics