VIRCLUST¶
Description¶
VirClust is a tool for protein-based virus clustering.
Environment Modules¶
Run module spider virclust
to find out what environment modules are available for this application.
Environment Variables¶
- HPC_VIRCLUST_DIR - installation directory
- HPC_VIRCLUST_BIN - executable directory
- HPC_VIRCLUST_BLASTDB - blast db directory
- HPC_VIRCLUST_IPRSCANDB - InterProScan directory and database
- HPC_VIRCLUST_DB - virclust db directory for Efam, Efam-XC, PHROGS, pVOGSs, and VOGDB databases
Additional Usage Information¶
To run Virclust, use the following command: Rscript $HPC_VIRCLUST_BIN/VirClust_MASTER.R sing=conda condaenvpath=$HPC_VIRCLUST_DIR [...options]
interproscan=$HPC_VIRCLUST_IPRSCANDB #when annotating against InterProScan db
blastdb=$HPC_VIRCLUST_BLASTDB #when annotating against NR blast db
databases=$HPC_VIRCLUST_DB #when annotating against other db
Citation¶
If you publish research that uses virclust you have to cite it as follows:
Categories¶
biology, annotation