NCBI BLAST+¶
Description¶
The Basic Local Alignment Search Tool (BLAST) is the most widely used sequence similarity tool. There are versions of BLAST that compare protein queries to protein databases, nucleotide queries to nucleotide databases, as well as versions that translate nucleotide queries or databases in all six frames and compare to protein databases or queries. PSI-BLAST produces a position-specific-scoring-matrix (PSSM) starting with a protein query, and then uses that PSSM to perform further searches. It is also possible to compare a protein or nucleotide query to a database of PSSM’s.
- BLASTDB - BLAST database location
Environment Modules¶
Run module spider ncbi_blast to find out what environment modules are available for this application.
Environment Variables¶
- HPC_NCBI_BLAST_DIR - installation directory
- HPC_NCBI_BLAST_BIN - executable directory
- HPC_NCBI_BLAST_DOC - documents directory
Additional Usage Information¶
BLASTDB Location¶
All databases are located in /data/reference/blast/db. The default database directory is a symlink to the latest release, but the older releases are still available. Its location is set automatically by the ncbi_blast module via the $BLASTDB environment variable. To override the default database location to perhaps use an older release or your own directory run export BLASTDB=/some/path either on the command line or in the job script as appropriate. E.g. export BLASTDB=/data/reference/blast/201506. Afterwards, continue calling databases by name without the full path. For example, blastx -db nr ....
Hosted BLAST Databases¶
NCBI Databases¶
We periodically mirror the entire NCBI BLAST reference databases. Below is a list of the current BLAST databases, please note this list is to subject to change with updates.
28S_fungal_sequences16S_ribosomal_RNAITS_RefSeq_FungiBetacoronavirus18S_fungal_sequencesITS_eukaryote_sequencesLSU_eukaryote_rRNALSU_prokaryote_rRNASSU_eukaryote_rRNAenv_ntenv_nrhuman_genomelandmarkmitomouse_genomenrnt_eukntnt_othersnt_proknt_virusespataapatntpdbaapdbntref_euk_rep_genomesref_prok_rep_genomesref_viroids_rep_genomesref_viruses_rep_genomesrefseq_select_rnarefseq_select_protrefseq_proteinrefseq_rnaswissprottsa_nrtsa_nttaxdbcore_nt
Custom Databases¶
We can also build custom databases upon request. As an added benefit, when we host a custom database and make it available to all HiPerGator users; it won't consume your group's blue quota. Please submit a support request for more information.
Alligator.miss.v0.2- Alligator mississippiensis v. 0.2 buildaliivibrio_fischeri- Aliivibrio Fischeri (ASM1180v1)a_baumannii-AB0057- A. baumannii str. AB0057aplCal3- A. californica 3.0 WGS assembly, 4331 contigsa_thaliana_tair10_ufrcbaciSubtcamelus_dromedarius_ufrc- Camelus Dromedarius` - JDVD01000001.1CFSAN001992_ufrcchlaCavGPIC- Chlamydia psittaci (GPIC)chlaPneumAR39- Chlamydia PneumoniaechlaTracA- Chlamydia trachomatis serovar AchlaTracD- Chlamydia trachomatis serovar DchlaTracL2- Chlamydia trachomatis serovar L2chlaTracMurNigg- Chlamydia muridarumcp003200_1_ufrccp012090_ufrcdanio_rerio_grcz10_ufrcdanio_rerio_zv9_75_rm_toplevel_ufrcDanio Rerio(GRCZ10)Danio Rerio(zv9)daphnia_pulex_alldmel-r6DROME_prot- Deep Metazoan Project protein databasee_faecalis_GCA_000172575Escherichia_colieucalyptus_grandis_2Eucalyptus Grandis- Eucalyptus Grandis v2.0fa_fb_1fa_rr_1GCF_000001405GCF_000001635GCF_000007585grch38GRCh38.p11- Human Genome assembly GRCh38.p11hg18hg19hhv8gk18hhv8jsc1klebsiella_pneumoniae_CAV1596Klebsiella Pneumoniae(CAV1596)Klebsiella Pneumoniae(HS11286)Klebsiella Pneumoniae(MGH78578)Klebsiella_pneumoniae_CAV1596-97klebsiella_pneumoniae_mgh_78578_ufrclsu108- LSUReflsu111- LSURefLSUParc_115- [1]LSURef_115- [2]md5nr- A comprehensive non-redundant protein databaseMHV4_NC_001826_2mm10mm9MRSAplasmids2014m_tuberculosis-CDC1551- M. tuberculosis str CDC1551m_tuberculosis_H37Rv_ufrcmusa_acuminata_v2_ufrcn_meningitides_cfam18n_meningitidis_serB_MC58Oreochromis_niloticus_oreNil2_ufrcoryx_leucoryx_ufrc- Oryx Leucoryx - oryxL1s1Phum_Geneious_APhum_Geneious_BPhumU1_USDA_sc- Pediculus humanus USDA suupercontigsPhytophthora_parasitica_ufrcp_schaeffi_v0_1_bboyd- Bret Boyd's buildp_tremula_alba_717_v1P Tremula x Alba 717-1B4 v1.1p_trichocarpa_v3_0rfam_10_1- release 10.1rfam_11- release 11rn5- Ratticus norvegicus rn5 releasern6_draft_73_smSalmonella enterica subsp. enterica serovar Javiana str. CFSAN001992salmTyph14028Ss_aureuss_bicolor_v2s_entericashigella_flexneri_2a_2457T_ufrcshigella_phage_POCJ13_ufrcSilba_123_SSURefSilva_128_ss_nr99_ufrcs_mutans-UA159-NC_004350Solanum_pimpinellifolium_ufrcSorbi1Sorbi1.21ssu108nr- SSURef NRssu111nr- SSURef NRSSUParc_115- [3]SSURef_NR99_115- [4]taxonomy4blastt_manatus_29620vcarteri_317_v2v_cholerae_o1_2010el_1786vibrChol1- Vibrio cholerae O1 biovar eltor str. N16961vibrChol_O395_1- Vibrio cholerae O395Vibrio_phage_CTX_HQ224500vibrVuln_CMCP6_1- Vibrio vulnificus CMCP6x_perforanszea_mayszea_mays_AGPv4_ensembl_ufrczea_mays_AGPv4_sm_enseml_ufrczika_kx_922705_ufrcZmB73_RefGen_v2_ufrc
Categories¶
biology