cocoindex

Solid

Comprehensive toolkit for developing with the CocoIndex library. Use when users need to create data transformation pipelines (flows), write custom functions, or operate flows via CLI or API. Covers building ETL workflows for AI data processing, including embedding documents into vector databases, building knowledge graphs, creating search indexes, or processing data streams with incremental updates.

Data & Documents 27,984 stars 2901 forks Updated today MIT

Install

View on GitHub

Quality Score: 93/100

Stars 20%

100

Recency 20%

100

Frontmatter 20%

Documentation 15%

100

Issue Health 10%

License 10%

100

Description 5%

100

Skill Content

# CocoIndex ## Overview CocoIndex is an ultra-performant real-time data transformation framework for AI with incremental processing. This skill enables building **indexing flows** that extract data from sources, apply transformations (chunking, embedding, LLM extraction), and export to targets (vector databases, graph databases, relational databases). **Core capabilities:** 1. **Write indexing flows** - Define ETL pipelines using Python 2. **Create custom functions** - Build reusable transformation logic 3. **Operate flows** - Run and manage flows using CLI or Python API **Key features:** - Incremental processing (only processes changed data) - Live updates (continuously sync source changes to targets) - Built-in functions (text chunking, embeddings, LLM extraction) - Multiple data sources (local files, S3, Azure Blob, Google Drive, Postgres) - Multiple targets (Postgres+pgvector, Qdrant, LanceDB, Neo4j, Kuzu) **For detailed documentation:** <https://cocoindex.io/docs/> **Search documentation:** <https://cocoindex.io/docs/search?q=url%20encoded%20keyword> ## When to Use This Skill Use when users request: - "Build a vector search index for my documents" - "Create an embedding pipeline for code/PDFs/images" - "Extract structured information using LLMs" - "Build a knowledge graph from documents" - "Set up live document indexing" - "Create custom transformation functions" - "Run/update my CocoIndex flow" ## Flow Writing Workflow ### Step 1: Understand Requirements As...

Details

Author: davila7
Repository: davila7/claude-code-templates
Created: 11 months ago
Last Updated: today
Language: Python
License: MIT

Integrates with

Anthropic · AI Azure · Cloud PostgreSQL · Database Qdrant · Database

Similar Skills

Semantically similar based on skill content — not just same category

Data & Documents Listed

cocoindex

353 Updated today

aiskillstore

Data & Documents Listed

cocoindex

69 Updated 2 weeks ago

cocoindex-io

AI & Automation Listed

couchbase-ai-applications

Design and build AI-powered applications on Couchbase, including RAG pipelines, vector search architecture, embedding strategies, and AI agent data patterns. Use whenever the user asks about RAG, retrieval-augmented generation, vector search for AI, Hyperscale Vector Index (HVI), Composite Vector Index (CVI), Search Vector Index (SVI), embedding pipelines, semantic search, AI agent memory, grounding LLMs with Couchbase, agentic data patterns, billion-scale vector search, multi-vector search, AI application architecture, or 'how do I build an AI app with Couchbase.' Distinct from couchbase-fts (which covers FTS index mechanics and query syntax) — this skill is about end-to-end AI application design: the data model, embedding pipeline, index type selection, retrieval strategy, and integration with LLM frameworks. Use proactively when the user is building AI features or has a use case involving language models, embeddings, or semantic retrieval.

1 Updated 2 weeks ago

celticht32