digital-archive

Solid

Digital archiving workflows with AI enrichment, entity extraction, and knowledge graph construction. Use when building content archives, implementing AI-powered categorization, extracting entities and relationships, or integrating multiple data sources. Covers patterns from the Jay Rosen Digital Archive project.

Code & Development 233 stars 44 forks Updated today MIT

Install

View on GitHub

Quality Score: 89/100

Stars 20%
79
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# Digital archive methodology Patterns for building production-quality digital archives with AI-powered analysis and knowledge graph construction. ## Archive architecture ### Multi-source integration pattern ``` ┌─────────────────┐ ┌──────────────────┐ ┌────────────────┐ │ OCR Pipeline │ │ Web Scraping │ │ Social Media │ │ (newspapers) │ │ (articles) │ │ (transcripts) │ └────────┬────────┘ └────────┬─────────┘ └───────┬────────┘ │ │ │ └──────────────────────┼──────────────────────┘ │ ┌───────────▼───────────┐ │ Unified Schema │ │ (35+ fields) │ └───────────┬───────────┘ │ ┌──────────────────────┼──────────────────────┐ │ │ │ ┌────────▼────────┐ ┌──────────▼──────────┐ ┌───────▼───────┐ │ AI Enrichment │ │ Entity Extraction │ │ PDF Archive │ │ (Gemini) │ │ (Knowledge Graph) │ │ (WCAG 2.1) │ └────────┬────────┘ └──────────┬──────────┘ └───────┬───────┘ │ │ │ └──────────────────────┼──────────────────────┘ │ ┌───────────▼───────────┐ │ Google Sheets │ │ (primary database) │...

Details

Author
jamditis
Repository
jamditis/claude-skills-journalism
Created
5 months ago
Last Updated
today
Language
HTML
License
MIT

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Solid

archive

Archive session learnings, debugging solutions, and deployment logs to .archive/yyyy-mm-dd/ as indexed markdown with searchable tags. Use when completing a significant task, resolving a tricky bug, deploying, or when the user says "archive this". Maintains .archive/MEMORY.md index for cross-session knowledge reuse.

903 Updated yesterday
ReScienceLab
AI & Automation Listed

ai-data-engineering

Data pipelines, feature stores, and embedding generation for AI/ML systems. Use when building RAG pipelines, ML feature serving, or data transformations. Covers feature stores (Feast, Tecton), embedding pipelines, chunking strategies, orchestration (Dagster, Prefect, Airflow), dbt transformations, data versioning (LakeFS), and experiment tracking (MLflow, W&B).

368 Updated 5 months ago
ancoleman
AI & Automation Solid

digital-engagement-strategy

Develop digital content strategies including virtual exhibitions, online programming, social media campaigns, and digital collection access

1,034 Updated today
a5c-ai
Web & Frontend Solid

web-archiving

Web page archiving and retrieval from cached/deleted sources. Use when accessing unavailable pages, preserving web content, creating legal evidence archives, or building redundant archival workflows. Covers Wayback Machine, Archive.today, ArchiveBox, and evidence preservation tools.

233 Updated today
jamditis
DevOps & Infrastructure Listed

engineering-advanced

Advanced engineering patterns for AI-native products. Use when the user mentions agent design, RAG architecture, AI pipelines, MCP servers, API design best practices, CI/CD pipeline architecture, system design interviews, observability, infrastructure as code, or advanced engineering topics. Also triggers on: agent, RAG, retrieval augmented generation, MCP, API design, REST, GraphQL, CI/CD, GitHub Actions, Docker, Kubernetes, microservices architecture, event-driven, message queues, caching strategies, database design, system design.

1 Updated yesterday
ceoimperiumprojects