alterlab-blast

Solid

Runs NCBI BLAST+ 2.17.0 sequence searches from the command line: makeblastdb (with -parse_seqids), blastn/blastp/blastx/tblastn with tabular -outfmt 6/7 for parsing, correct -task choice (megablast vs blastn vs blastn-short), -taxids/-negative_taxids taxonomic scoping, and -mt_mode multithreading; plus a DIAMOND blastp --ultra-sensitive path for large protein searches. Warns that -max_target_seqs is a heuristic keep-count, not a top-N best-hits filter. Use when the user wants command-line BLAST, makeblastdb, a local BLAST database, blastn/blastp/blastx/tblastn searches, or DIAMOND protein search. For the Bio.Blast web NCBIWWW API prefer alterlab-biopython; for quick one-liner database lookups prefer alterlab-gget. Part of the AlterLab Academic Skills suite.

AI & Automation 27 stars 4 forks Updated today MIT

Install

View on GitHub

Quality Score: 87/100

Stars 20%
48
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# BLAST+ — Command-Line Sequence Search Run local NCBI **BLAST+ 2.17.0** searches end-to-end: build a database with `makeblastdb`, search it with `blastn` / `blastp` / `blastx` / `tblastn`, emit machine-parseable tabular output, and scope by taxonomy. For very large protein searches, hand off to **DIAMOND** `blastp --ultra-sensitive` (100x–10,000x the speed of BLAST, per the DIAMOND project). This is the **CLI / local-database** skill; it is deliberately distinct from the Biopython web API and the gget one-liner (see routing table below). > Bulk DB builds and large searches are CPU/IO-heavy and fully offline — good > candidates to run on local compute rather than burning API calls. ## When to Use This Skill Use this skill when the request involves any of: - "BLAST these sequences", "run blastn/blastp/blastx/tblastn", "command-line BLAST" - "build a local BLAST database", "makeblastdb", "index this FASTA for BLAST" - "search my reads against a local nt/nr database", "get tabular BLAST hits I can parse" - "scope the BLAST search to a taxon" (`-taxids` / `-negative_taxids`) - "BLAST is too slow on millions of proteins" → DIAMOND `blastp` - retrieving sequences out of a BLAST DB (`blastdbcmd`, requires `-parse_seqids`) ### Does NOT Trigger Route adjacent requests to the right sibling skill instead of forcing BLAST+: | The request is really about… | Route to | |------------------------------|----------| | The **web** BLAST API (`Bio.Blast.NCBIWWW.qblast`), or scripting BLA...

Details

Author
AlterLab-IEU
Repository
AlterLab-IEU/AlterLab-Academic-Skills
Created
2 months ago
Last Updated
today
Language
Python
License
MIT

Integrates with

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Solid

blast-sequence-search

BLAST skill for sequence similarity searching, homology detection, and database querying

1,313 Updated today
a5c-ai
AI & Automation Solid

alterlab-biopython

Manipulate biological sequences, parse FASTA/GenBank/PDB files, run phylogenetics, and access NCBI/PubMed programmatically via Biopython (Bio.SeqIO, Bio.Entrez, Bio.PDB, Bio.Blast). Use when scripting custom bioinformatics pipelines, batch-processing sequence files, automating BLAST, or fetching records from Entrez — for quick one-off database lookups use gget, for unified multi-service integration use bioservices. Part of the AlterLab Academic Skills suite.

27 Updated today
AlterLab-IEU
AI & Automation Solid

alterlab-qiime2-amplicon

Runs 16S/ITS amplicon (microbiome) analysis with the QIIME 2 amplicon distribution (2026.1; renamed to "qiime2" in 2026.4) in the correct order: manifest import, cutadapt trim-paired primer removal BEFORE dada2 denoise-paired (trunc-len chosen from the demux quality .qzv), feature-classifier classify-sklearn against a version-matched SILVA 138 or Greengenes2 classifier, and diversity core-metrics-phylogenetic — teaching the .qza/.qzv artifact-and-provenance model and the 2026.1 feature-table summarize change (the former summarize_plus). Use when the request mentions QIIME2, QIIME 2, qiime, 16S, 18S, ITS, amplicon, microbiome, ASV, DADA2 denoising, feature table, taxonomic classification, or core-metrics diversity. For downstream alpha/beta diversity, PCoA, and PERMANOVA on the exported feature table prefer alterlab-scikit-bio; this is conda-only (no pip install). Part of the AlterLab Academic Skills suite.

27 Updated today
AlterLab-IEU