ena-database

Solid

ENA REST API for sequences, reads, assemblies, and annotations. Portal API search, Browser API retrieval (XML/FASTA/EMBL), file reports for FASTQ/BAM URLs, taxonomy, cross-refs. For multi-DB Python use bioservices; for NCBI-only use pubmed-database or Biopython Entrez.

Data & Documents 286 stars 26 forks Updated 4 days ago NOASSERTION

Install

View on GitHub

Quality Score: 82/100

Stars 20%

Recency 20%

100

Frontmatter 20%

Documentation 15%

100

Issue Health 10%

License 10%

100

Description 5%

100

Skill Content

# ENA Database — European Nucleotide Archive Programmatic Access ## Overview The European Nucleotide Archive (ENA) is EMBL-EBI's comprehensive nucleotide sequence database, encompassing raw sequencing reads, genome assemblies, annotated sequences, and associated metadata. It mirrors and extends INSDC data (GenBank, DDBJ). All access is via REST APIs with no authentication required. ## When to Use - Searching for sequencing studies, samples, or experiments by organism, project, or keyword - Downloading raw FASTQ/BAM files for reanalysis of public sequencing datasets - Retrieving genome assemblies with quality statistics (N50, contig count, genome size) - Fetching nucleotide sequences in FASTA or EMBL flat-file format by accession - Exploring taxonomic lineage and finding organisms by partial name - Cross-referencing ENA records with external databases (ArrayExpress, UniProt, PDB) - Building bulk download lists for large-scale sequencing projects - For **multi-database Python queries** (ENA + UniProt + KEGG), prefer `bioservices` instead - For **NCBI-specific queries** (PubMed literature, GenBank records), use `pubmed-database` or Biopython Entrez ## Prerequisites ```bash pip install requests ``` **API constraints**: - **Rate limit**: 50 requests per second across all ENA APIs - **No authentication** required - **Large result sets**: use pagination (`limit` + `offset`) or streaming (`limit=0` for TSV download) - Portal API base: `https://www.ebi.ac.uk/ena/portal/api` - B...

Details

Author: jaechang-hits
Repository: jaechang-hits/SciAgent-Skills
Created: 5 months ago
Last Updated: 4 days ago
Language: Python
License: NOASSERTION

Integrates with

REST API · API

Bundled in these plugins

sciagent-skills

Similar Skills

Semantically similar based on skill content — not just same category

API & Backend Solid

ensembl-database

Ensembl REST API for gene/transcript/variant annotations in 300+ species. Gene info by symbol/ID, sequence, cross-refs (HGNC, RefSeq, UniProt), regulatory features. For bulk local use pyensembl; for pathways use kegg-database.

286 Updated 4 days ago

jaechang-hits

API & Backend Solid

encode-database

ENCODE Portal REST API for regulatory genomics: TF ChIP-seq, ATAC-seq/DNase-seq peaks, histone marks, and RNA-seq across 1000+ cell types. Search experiments by assay/biosample/target; download BED/bigWig; retrieve SCREEN cCREs by region or gene. Use to annotate variants with regulatory tracks, find open chromatin in a cell type, or fetch peak files for ChIP/ATAC analysis. For regulatory variant scoring use regulomedb-database; for GWAS associations use gwas-database.

286 Updated 4 days ago

jaechang-hits

API & Backend Solid

gene-database

NCBI Gene via E-utilities: curated records across 1M+ taxa. Official symbols, aliases, RefSeq IDs, summaries, coordinates, GO, interactions. Use for gene ID resolution and cross-species function queries. For sequences use Ensembl; for expression use geo-database.

286 Updated 4 days ago

jaechang-hits