MOSAIK¶
Description¶
MOSAIK is a reference-guided assembler comprising of four main modular programs:
MosaikBuild converts various sequence formats into Mosaik’s native read format. MosaikAligner pairwise aligns each read to a specified series of reference sequences. MosaikSort resolves paired-end reads and sorts the alignments by the reference sequence coordinates. Finally, MosaikText converts alignments to different text-based formats.
At this time, the workflow consists of supplying sequences in FASTA, FASTQ, Illumina Bustard & Gerald, or SRF file formats and producing results in the BLAT axt, the BAM/SAM, the UCSC Genome Browser bed, or the Illumina ELAND formats.
Environment Modules¶
Run module spider mosaik to find out what environment modules are available for this application.
Environment Variables¶
- HPC_MOSAIK_DIR - installation directory
- HPC_MOSAIK_BIN - executable directory
- HPC_MOSAIK_NET - network file directory
- MOSAIK_TMP - directory for temporary files
Additional Usage Information¶
If you would like to use the provided network files with MosaikAligner use the $HPC_MOSAIK_NET variable for the location of network file directory. E.g.
MosaikAligner -annpe $HPC_MOSAIK_NET/2.1.26.pe.100.0065.ann \
-annse $HPC_MOSAIK_NET/2.1.26.se.100.005.ann -in ...
Citation¶
If you publish research that uses Mosaik you have to cite it as follows:
Lee, W.-P., Stromberg, M. P., Ward, A., Stewart, C., Garrison, E. P., & Marth, G. T. (2014). MOSAIK: A hash-based algorithm for accurate next-generation sequencing short-read mapping. PLoS ONE, 9(3), e90581. https://doi.org/10.1371/journal.pone.0090581 
Categories¶
biology, alignment