rag-architect

Solid

Designs and implements production-grade RAG systems by chunking documents, generating embeddings, configuring vector stores, building hybrid search pipelines, applying reranking, and evaluating retrieval quality. Use when building RAG systems, vector databases, or knowledge-grounded AI applications requiring semantic search, document retrieval, context augmentation, similarity search, or embedding-based indexing.

AI & Automation 9,440 stars 802 forks Updated 1 weeks ago MIT

Install

View on GitHub

Quality Score: 94/100

Stars 20%

100

Recency 20%

Frontmatter 20%

Documentation 15%

100

Issue Health 10%

License 10%

100

Description 5%

100

Skill Content

# RAG Architect ## Core Workflow 1. **Requirements Analysis** — Identify retrieval needs, latency constraints, accuracy requirements, and scale 2. **Vector Store Design** — Select database, schema design, indexing strategy, sharding approach 3. **Chunking Strategy** — Document splitting, overlap, semantic boundaries, metadata enrichment 4. **Retrieval Pipeline** — Embedding selection, query transformation, hybrid search, reranking 5. **Evaluation & Iteration** — Metrics tracking, retrieval debugging, continuous optimization For each step, validate before moving on (see checkpoints below). ## Reference Guide Load detailed guidance based on context: | Topic | Reference | Load When | |-------|-----------|-----------| | Vector Databases | `references/vector-databases.md` | Comparing Pinecone, Weaviate, Chroma, pgvector, Qdrant | | Embedding Models | `references/embedding-models.md` | Selecting embeddings, fine-tuning, dimension trade-offs | | Chunking Strategies | `references/chunking-strategies.md` | Document splitting, overlap, semantic chunking | | Retrieval Optimization | `references/retrieval-optimization.md` | Hybrid search, reranking, query expansion, filtering | | RAG Evaluation | `references/rag-evaluation.md` | Metrics, evaluation frameworks, debugging retrieval | ## Implementation Examples ### 1. Chunking Documents ```python from langchain.text_splitter import RecursiveCharacterTextSplitter # Evaluate chunk_size on your domain data — never use 512 blindly spl...

Details

Author: Jeffallan
Repository: Jeffallan/claude-skills
Created: 7 months ago
Last Updated: 1 weeks ago
Language: Python
License: MIT

Integrates with

LangChain · AI Pinecone · Database Weaviate · Database Qdrant · Database

Related Skills

AI & Automation Featured

videodb

See, Understand, Act on video and audio. See- ingest from local files, URLs, RTSP/live feeds, or live record desktop; return realtime context and playable stream links. Understand- extract frames, build visual/semantic/temporal indexes, and search moments with timestamps and auto-clips. Act- transcode and normalize (codec, fps, resolution, aspect ratio), perform timeline edits (subtitles, text/image overlays, branding, audio overlays, dubbing, translation), generate media assets (image, audio, video), and create real time alerts for events from live streams or desktop capture.

196,640 Updated 2 days ago

affaan-m

AI & Automation Featured

ck

Persistent per-project memory for Claude Code. Auto-loads project context on session start, tracks sessions with git activity, and writes to native memory. Commands run deterministic Node.js scripts — behavior is consistent across model versions.

196,640 Updated 2 days ago

affaan-m

AI & Automation Featured

browser

Web browser automation with AI-optimized snapshots for claude-flow agents

55,973 Updated today

ruvnet