cocoindex

Solid

Comprehensive toolkit for developing with the CocoIndex library. Use when users need to create data transformation pipelines (flows), write custom functions, or operate flows via CLI or API. Covers building ETL workflows for AI data processing, including embedding documents into vector databases, building knowledge graphs, creating search indexes, or processing data streams with incremental updates.

Data & Documents 27,984 stars 2901 forks Updated today MIT

Install

View on GitHub

Quality Score: 93/100

Stars 20%
100
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# CocoIndex ## Overview CocoIndex is an ultra-performant real-time data transformation framework for AI with incremental processing. This skill enables building **indexing flows** that extract data from sources, apply transformations (chunking, embedding, LLM extraction), and export to targets (vector databases, graph databases, relational databases). **Core capabilities:** 1. **Write indexing flows** - Define ETL pipelines using Python 2. **Create custom functions** - Build reusable transformation logic 3. **Operate flows** - Run and manage flows using CLI or Python API **Key features:** - Incremental processing (only processes changed data) - Live updates (continuously sync source changes to targets) - Built-in functions (text chunking, embeddings, LLM extraction) - Multiple data sources (local files, S3, Azure Blob, Google Drive, Postgres) - Multiple targets (Postgres+pgvector, Qdrant, LanceDB, Neo4j, Kuzu) **For detailed documentation:** <https://cocoindex.io/docs/> **Search documentation:** <https://cocoindex.io/docs/search?q=url%20encoded%20keyword> ## When to Use This Skill Use when users request: - "Build a vector search index for my documents" - "Create an embedding pipeline for code/PDFs/images" - "Extract structured information using LLMs" - "Build a knowledge graph from documents" - "Set up live document indexing" - "Create custom transformation functions" - "Run/update my CocoIndex flow" ## Flow Writing Workflow ### Step 1: Understand Requirements As...

Details

Author
davila7
Repository
davila7/claude-code-templates
Created
11 months ago
Last Updated
today
Language
Python
License
MIT

Integrates with

Similar Skills

Semantically similar based on skill content — not just same category

Data & Documents Listed

cocoindex

Comprehensive toolkit for developing with the CocoIndex library. Use when users need to create data transformation pipelines (flows), write custom functions, or operate flows via CLI or API. Covers building ETL workflows for AI data processing, including embedding documents into vector databases, building knowledge graphs, creating search indexes, or processing data streams with incremental updates.

353 Updated today
aiskillstore
Data & Documents Listed

cocoindex

Comprehensive toolkit for developing with the CocoIndex library. Use when users need to create data transformation pipelines (flows), write custom functions, or operate flows via CLI or API. Covers building ETL workflows for AI data processing, including embedding documents into vector databases, building knowledge graphs, creating search indexes, or processing data streams with incremental updates.

69 Updated 2 weeks ago
cocoindex-io
AI & Automation Listed

couchbase-ai-applications

Design and build AI-powered applications on Couchbase, including RAG pipelines, vector search architecture, embedding strategies, and AI agent data patterns. Use whenever the user asks about RAG, retrieval-augmented generation, vector search for AI, Hyperscale Vector Index (HVI), Composite Vector Index (CVI), Search Vector Index (SVI), embedding pipelines, semantic search, AI agent memory, grounding LLMs with Couchbase, agentic data patterns, billion-scale vector search, multi-vector search, AI application architecture, or 'how do I build an AI app with Couchbase.' Distinct from couchbase-fts (which covers FTS index mechanics and query syntax) — this skill is about end-to-end AI application design: the data model, embedding pipeline, index type selection, retrieval strategy, and integration with LLM frameworks. Use proactively when the user is building AI features or has a use case involving language models, embeddings, or semantic retrieval.

1 Updated 2 weeks ago
celticht32