← ClaudeAtlas

autoresearchlisted

Autonomous skill-prompt optimization — Karpathy-style mutate/score/keep loop on SKILL.md. Triggers "autoresearch", "optimize skill", "tune", "evolve" a skill, "prompt optimization".
darkroomengineering/cc-settings · ★ 29 · AI & Automation · score 85
Install: claude install-skill darkroomengineering/cc-settings
# AutoResearch Autonomous skill optimization. You modify a skill's prompt, test it, keep improvements, revert failures. Repeat forever. Adapted from [Karpathy's autoresearch](https://github.com/karpathy/autoresearch). Same method: single editable file, single metric, git-based keep/revert, autonomous loop. The only difference: `SKILL.md` replaces `train.py`, checklist pass rate replaces `val_bpb`. **NEVER STOP.** Once the loop begins, do NOT pause to ask the human if you should continue. The human might be away and expects you to work indefinitely until manually interrupted. If you run out of ideas, think harder — re-read failing outputs, try combining near-misses, try more radical prompt rewrites. The loop runs until the human interrupts you, period. --- ## Setup Work with the user to configure, then go autonomous. 1. **Parse target skill**: Get `<skill-name>` from `$ARGUMENTS`. Validate `skills/<skill-name>/SKILL.md` exists. 2. **Load or create RESEARCH.md**: Check for `skills/<skill-name>/RESEARCH.md`. If it exists, read it. If not, generate one: - Read the target SKILL.md - Derive 3 test inputs from its description and use cases - Derive 5-7 checklist items from its workflow steps and output format - Write the generated RESEARCH.md and show it to the user for confirmation 3. **Parse config from RESEARCH.md**: - `## Test Inputs` — each `### Test N:` heading is one test case (the text below is the prompt) - `## Checklist` — each `- [ ]` line is a