Canu¶

Description¶

Canu is a fork of the Celera Assembler designed for high-noise single-molecule sequencing (such as the PacBio RSII or Oxford Nanopore MinION).

Environment Modules¶

Run module spider canu to find out what environment modules are available for this application.

Environment Variables¶

HPC_CANU_DIR - installation directory
HPC_CANU_BIN - executable directory
HPC_CANU_OBJ - object directory

Additional Usage Information¶

When running Canu you should specify Memory and CPU arguments that match the requested resources in your command. e.g.

#SBATCH --cpus-per-task=8 
#SBATCH --mem=10g

-maxThreads=8
-maxMemory=10g

If your canu jobs run out of memory make sure to set scheduled job resources in gridOptions and merylMemory respectively to make sure meryl does not exceed the requested memory. See the Canu Parameter Reference Page for more detailed information..

Citation¶

If you publish research that uses canu you have to cite it as follows:

Koren S, Walenz BP, Berlin K, Miller JR, Phillippy AM. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Research. (2017). http://doi.org/10.1101/gr.215087.116

Categories¶

biology, sequencing