biopython-molecular-biology

Solid

Molecular biology toolkit: sequence manipulation, FASTA/GenBank/PDB I/O, NCBI Entrez, BLAST automation, pairwise/MSA alignment, Bio.PDB, phylogenetic trees. Use for batch processing, custom pipelines, format conversion, PubMed/GenBank queries. For quick gene lookups use gget; for multi-service REST APIs use bioservices.

AI & Automation 286 stars 26 forks Updated 4 days ago NOASSERTION

Install

View on GitHub

Quality Score: 82/100

Stars 20%

Recency 20%

100

Frontmatter 20%

Documentation 15%

100

Issue Health 10%

License 10%

100

Description 5%

100

Skill Content

# Biopython: Computational Molecular Biology Toolkit ## Overview Biopython is the standard open-source Python library for computational molecular biology, providing modular APIs for sequence handling, biological file parsing, NCBI database access, BLAST searches, protein structure analysis, and phylogenetics. It supports Python 3 and requires NumPy. ## When to Use - Parse and convert biological file formats (FASTA, GenBank, FASTQ, PDB, mmCIF, PHYLIP) - Fetch sequences or publications from NCBI databases (GenBank, PubMed, Protein) programmatically - Run and parse BLAST searches (remote NCBI or local BLAST+) - Perform pairwise or multiple sequence alignments with custom scoring - Analyze 3D protein structures — distances, angles, DSSP, superimposition - Build and visualize phylogenetic trees from sequence alignments - Calculate sequence statistics (GC content, molecular weight, melting temperature) - Batch-process thousands of sequences with custom filtering logic - Use `pysam` instead for reading SAM/BAM/CRAM alignment files and working with mapped reads; use `scikit-bio` instead for advanced ecological diversity metrics ## Prerequisites - **Python packages**: `biopython`, `numpy`, `matplotlib` (for tree visualization) - **Data requirements**: Sequence files (FASTA, GenBank, FASTQ) or accession IDs for NCBI access - **Environment**: Python 3.8+; NCBI Entrez requires email registration ```bash pip install biopython numpy matplotlib ``` ## Quick Start ```python from Bio...

Details

Author: jaechang-hits
Repository: jaechang-hits/SciAgent-Skills
Created: 5 months ago
Last Updated: 4 days ago
Language: Python
License: NOASSERTION

Integrates with

REST API · API

Bundled in these plugins

sciagent-skills

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Featured

biopython

Comprehensive molecular biology toolkit. Use for sequence manipulation, file parsing (FASTA/GenBank/PDB), phylogenetics, and programmatic NCBI/PubMed access (Bio.Entrez). Best for batch processing, custom bioinformatics pipelines, BLAST automation. For quick lookups use gget; for multi-service integration use bioservices.

726 Updated 1 weeks ago

LeonChaoX

Data & Documents Featured

biopython

31,973 Updated today

K-Dense-AI

Data & Documents Listed

biopython

13 Updated yesterday

tassiovale