Skip to content

ARC

Description

arc website

ARC is a pipeline which facilitates iterative, reference guided 'de nono' assemblies with the intent of reducing time in analysis and increasing accuracy of results by only considering those reads which should assemble together and reducing reference bias as compared to mapping based approaches. The software is designed to work in situations where a whole-genome assembly is not the objective, but rather when the researcher wishes to assemble discreet 'targets' contained within next-generation shotgun sequence data. ARC reduces the complexity of the traditionally difficult problem of assembly by subsetting the reads into small, manageable pieces which can then be assembled quickly and efficiently. Applications include those in which the researcher wishes to de novo assemble specific content and a set of semi-similar reference targets is available to initialize the assembly process. working directory and adjust it for your analysis.

Environment Modules

Run module spider arc to find out what environment modules are available for this application.

Environment Variables

To run ARC copy the $HPC_ARC_CONF/ARC_config.txt file to your current * HPC_ARC_DIR - installation directory * HPC_ARC_BIN - executable directory * HPC_ARC_CONF - sample configuration file directory * HPC_ARC_TEST - test data directory

Additional Usage Information

In preparing your ARC_config.txt file, be careful in how things are named. Do not start a non-numerical parameter with a number. ARC checks if the parameter starts with a number, and if it does, assumes the whole parameter is a number and tries to convert to integer.

For example a reference file that is named 5_genes.fasta would throw an error because ARC sees the 5, assumes the parameter is a number and tries to convert "5_genes.fasta" into an integer.

Categories

biology, ngs