langchain-data-handling

Featured

Implement LangChain RAG pipelines with document loaders, text splitters, embeddings, and vector stores (Chroma, Pinecone, FAISS). Trigger: "langchain RAG", "langchain documents", "langchain vector store", "langchain embeddings", "document loaders", "text splitters", "retrieval".

AI & Automation 2,249 stars 312 forks Updated today MIT

Install

View on GitHub

Quality Score: 99/100

Stars 20%
100
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# LangChain Data Handling: RAG & Document Processing ## Overview Build Retrieval-Augmented Generation (RAG) pipelines: load documents, split into chunks, embed with OpenAI/Cohere, store in vector databases (FAISS, Chroma, Pinecone), and query with retrieval chains. ## Prerequisites - `@langchain/core`, `@langchain/openai` installed - For vector stores: `npm install @langchain/community` (FAISS) or `npm install @langchain/pinecone @pinecone-database/pinecone` ## Step 1: Document Loaders ```typescript import { TextLoader } from "langchain/document_loaders/fs/text"; import { PDFLoader } from "@langchain/community/document_loaders/fs/pdf"; import { DirectoryLoader } from "langchain/document_loaders/fs/directory"; import { CSVLoader } from "@langchain/community/document_loaders/fs/csv"; // Load a single file const textDocs = await new TextLoader("./data/readme.md").load(); const pdfDocs = await new PDFLoader("./data/manual.pdf").load(); // Load entire directory with type-based routing const dirLoader = new DirectoryLoader("./data/", { ".txt": (path) => new TextLoader(path), ".pdf": (path) => new PDFLoader(path), ".csv": (path) => new CSVLoader(path), }); const allDocs = await dirLoader.load(); console.log(`Loaded ${allDocs.length} documents`); ``` ## Step 2: Text Splitting ```typescript import { RecursiveCharacterTextSplitter } from "@langchain/textsplitters"; const splitter = new RecursiveCharacterTextSplitter({ chunkSize: 1000, // max chars per chunk c...

Details

Author
jeremylongshore
Repository
jeremylongshore/claude-code-plugins-plus-skills
Created
7 months ago
Last Updated
today
Language
Python
License
MIT

Integrates with

Related Skills