palantir-incident-runbook

Featured

Execute Palantir Foundry incident response with triage, mitigation, and postmortem. Use when responding to Foundry-related outages, API failures, or build pipeline incidents. Trigger with phrases like "palantir incident", "foundry outage", "palantir down", "foundry emergency", "palantir broken".

AI & Automation 2,266 stars 315 forks Updated today MIT

Install

View on GitHub

Quality Score: 99/100

Stars 20%
100
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# Palantir Incident Runbook ## Overview Rapid incident response for Foundry-related outages: API failures, transform build failures, authentication issues, and data pipeline stalls. ## Prerequisites - Access to application logs and Foundry build history - Foundry service user credentials for health checks - On-call escalation path defined ## Instructions ### Step 1: Triage (First 5 Minutes) ```bash set -euo pipefail echo "=== Foundry Incident Triage ===" echo "Time: $(date -u)" # 1. Check if Foundry itself is down curl -s -o /dev/null -w "Foundry API: HTTP %{http_code}\n" \ -H "Authorization: Bearer $FOUNDRY_TOKEN" \ "https://$FOUNDRY_HOSTNAME/api/v2/ontologies" || echo "FOUNDRY UNREACHABLE" # 2. Check our app health curl -s http://localhost:8080/health | python -m json.tool # 3. Check recent error logs grep -c "ApiError\|status_code.*[45][0-9][0-9]" /var/log/app/app.log | tail -1 ``` ### Step 2: Classify Severity | Severity | Criteria | Response Time | |----------|----------|---------------| | P1 Critical | Foundry API completely unreachable, all operations failing | Immediate | | P2 High | Intermittent 429/5xx errors, degraded performance | 15 minutes | | P3 Medium | Single transform failing, non-critical pipeline stalled | 1 hour | | P4 Low | Deprecation warnings, performance degradation | Next business day | ### Step 3: Common Incident Playbooks **Playbook A: Authentication Failure (401/403)** ```bash # 1. Verify token is set echo "Token set: ${FOUNDRY_TOKEN...

Details

Author
jeremylongshore
Repository
jeremylongshore/claude-code-plugins-plus-skills
Created
7 months ago
Last Updated
today
Language
Python
License
MIT

Integrates with

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Featured

palantir-common-errors

Diagnose and fix Palantir Foundry common errors and API exceptions. Use when encountering Foundry errors, debugging failed API calls, or troubleshooting transform build failures. Trigger with phrases like "palantir error", "fix palantir", "foundry not working", "debug foundry", "palantir 401 403".

2,266 Updated today
jeremylongshore
AI & Automation Featured

apollo-incident-runbook

Apollo.io incident response procedures. Use when handling Apollo outages, debugging production issues, or responding to integration failures. Trigger with phrases like "apollo incident", "apollo outage", "apollo down", "apollo production issue", "apollo emergency".

2,266 Updated today
jeremylongshore
AI & Automation Featured

figma-incident-runbook

Respond to Figma API outages, auth failures, and rate limit incidents. Use when Figma integration is down, experiencing errors, or running post-incident reviews for Figma-related failures. Trigger with phrases like "figma incident", "figma outage", "figma down", "figma broken", "figma emergency".

2,266 Updated today
jeremylongshore
AI & Automation Featured

fireflies-incident-runbook

Execute Fireflies.ai incident response with triage, remediation, and postmortem. Use when responding to Fireflies.ai API outages, auth failures, or webhook delivery problems. Trigger with phrases like "fireflies incident", "fireflies outage", "fireflies down", "fireflies on-call", "fireflies emergency", "fireflies broken".

2,266 Updated today
jeremylongshore
AI & Automation Featured

palantir-prod-checklist

Execute Palantir Foundry production deployment checklist and rollback procedures. Use when deploying Foundry integrations to production, preparing for launch, or implementing go-live procedures. Trigger with phrases like "palantir production", "deploy foundry", "palantir go-live", "foundry launch checklist".

2,266 Updated today
jeremylongshore