glean-migration-deep-dive

Featured

Migrate from Elasticsearch/Algolia: 1) Export all documents from source, 2) Transform to Glean document schema (id, title, url, body, permissions), 3) Create datasource with adddatasource, 4) Bulk index with bulkindexdocuments, 5) Validate search quality with test queries, 6) Switch search UI to use Glean Client API. Trigger: "glean migration deep dive", "migration-deep-dive".

AI & Automation 2,266 stars 315 forks Updated today MIT

Install

View on GitHub

Quality Score: 99/100

Stars 20%
100
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# Glean Migration Deep Dive ## Overview Comprehensive guide for migrating enterprise search from Elasticsearch or Algolia to Glean. Covers connector migration (replacing custom crawlers with Glean's push indexing API), permission model changes (mapping ACLs to Glean's datasource-level permissions), and full index rebuilds using `bulkindexdocuments`. Typical timeline is 2-4 weeks for a mid-size deployment with 100K-1M documents across multiple datasources. ## Migration Assessment ```typescript // Scan current integration for deprecated patterns and index health const assessment = { source: process.env.SEARCH_PROVIDER ?? 'elasticsearch', indices: await sourceClient.cat.indices({ format: 'json' }), totalDocs: 0, connectors: [] as string[], permissionModel: '', }; for (const idx of assessment.indices) { assessment.totalDocs += parseInt(idx['docs.count'] ?? '0', 10); assessment.connectors.push(idx.index); } assessment.permissionModel = assessment.source === 'elasticsearch' ? 'index-level' : 'api-key'; console.log(`Source: ${assessment.source}`); console.log(`Indices: ${assessment.connectors.length} | Total docs: ${assessment.totalDocs}`); console.log(`Permission model: ${assessment.permissionModel} → Glean datasource ACLs`); ``` ## Step-by-Step Migration ### Phase 1: Prepare Export all documents from the current search provider and map them to Glean's document schema. Each document needs `id`, `title`, `url`, `body`, and `permissions`. ```typescript interface Gle...

Details

Author
jeremylongshore
Repository
jeremylongshore/claude-code-plugins-plus-skills
Created
7 months ago
Last Updated
today
Language
Python
License
MIT

Integrates with

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Featured

glean-upgrade-migration

Check Glean developer changelog for API changes. Trigger: "glean upgrade migration", "upgrade-migration".

2,266 Updated today
jeremylongshore
AI & Automation Featured

algolia-migration-deep-dive

Migrate to Algolia from Elasticsearch, Typesense, or Meilisearch. Covers data migration, query translation, replaceAllObjects zero-downtime swap, and strangler fig traffic shifting. Trigger: "migrate to algolia", "switch to algolia", "algolia migration", "elasticsearch to algolia", "replace search engine", "algolia replatform".

2,266 Updated today
jeremylongshore
AI & Automation Featured

glean-core-workflow-b

Execute Glean secondary workflow: bulk document indexing, custom datasource connectors, and content lifecycle management via the Indexing API. Trigger: "glean bulk index", "glean custom connector", "glean datasource", "glean indexing".

2,266 Updated today
jeremylongshore
AI & Automation Featured

glean-hello-world

Index documents into Glean and search them back using the Indexing and Client APIs. Use when starting a new Glean custom connector, testing search quality, or learning the index/search pattern. Trigger: "glean hello world", "glean example", "glean index document", "glean search".

2,266 Updated today
jeremylongshore
AI & Automation Featured

glean-deploy-integration

Deploy Glean custom connectors as scheduled jobs on Cloud Run, Lambda, or Fly.io. Trigger: "deploy glean connector", "glean connector hosting", "schedule glean indexing".

2,266 Updated today
jeremylongshore