← ClaudeAtlas

histone-aggregationlisted

Build comprehensive histone mark maps by aggregating narrowPeak data across multiple ENCODE experiments, donors, and labs. Use when the user wants to answer "where is this histone mark present in my tissue?" by combining peak calls from multiple studies into a union peak set with confidence annotations. Handles cross-lab batch effects, broad vs narrow marks, and ENCODE blocklist filtering.
ammawla/encode-toolkit · ★ 35 · AI & Automation · score 79
Install: claude install-skill ammawla/encode-toolkit
# Aggregate Histone ChIP-seq Peaks Across Studies ## When to Use - User wants to combine histone ChIP-seq peaks across multiple ENCODE experiments for a tissue or cell type - User asks "where is H3K27ac in pancreas?" or "build a histone mark map for liver" - User needs a union peak set from multiple donors, labs, or replicates - User wants to create a consensus binding map from multiple ChIP-seq datasets - Example queries: "aggregate all H3K4me3 peaks in brain", "combine histone marks across donors", "build enhancer map from H3K27ac data" Build a comprehensive map of histone mark binding for a tissue/cell type by merging narrowPeak files from multiple ENCODE experiments into a union peak set. ## Scientific Rationale **The question**: "Does my tissue have this histone mark, and at what genomic locations?" This is a **detection/cataloging** question, not a differential one. Once a histone mark passes noise thresholds (ENCODE IDR, quality metrics), detection is binary — the mark is either bound or not. If detected in one donor but not another, that region is still a real binding site. Individual variation and technical differences (lab, depth, antibody lot) explain *absence*, not that *presence* is spurious. **Therefore: we want the UNION of all detections, not a consensus.** ### Literature Support - **ChIP-Atlas** (Oki et al. 2018, EMBO Reports, 597 citations): Integrated >70,000 public ChIP-seq datasets using union of all peak calls - **ENCODE Phase 3** (Gorkin et al.