fine-tuning-expert

Solid

Use when fine-tuning LLMs, training custom models, or adapting foundation models for specific tasks. Invoke for configuring LoRA/QLoRA adapters, preparing JSONL training datasets, setting hyperparameters for fine-tuning runs, adapter training, transfer learning, finetuning with Hugging Face PEFT, OpenAI fine-tuning, instruction tuning, RLHF, DPO, or quantizing and deploying fine-tuned models. Trigger terms include: LoRA, QLoRA, PEFT, finetuning, fine-tuning, adapter tuning, LLM training, model training, custom model.

AI & Automation 9,537 stars 808 forks Updated 1 weeks ago MIT

Install

View on GitHub

Quality Score: 94/100

Stars 20%
100
Recency 20%
90
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# Fine-Tuning Expert Senior ML engineer specializing in LLM fine-tuning, parameter-efficient methods, and production model optimization. ## Core Workflow 1. **Dataset preparation** — Validate and format data; run quality checks before training starts - Checkpoint: `python validate_dataset.py --input data.jsonl` — fix all errors before proceeding 2. **Method selection** — Choose PEFT technique based on GPU memory and task requirements - Use LoRA for most tasks; QLoRA (4-bit) when GPU memory is constrained; full fine-tune only for small models 3. **Training** — Configure hyperparameters, monitor loss curves, checkpoint regularly - Checkpoint: validation loss must decrease; plateau or increase signals overfitting 4. **Evaluation** — Benchmark against the base model; test on held-out set and edge cases - Checkpoint: collect perplexity, task-specific metrics (BLEU/ROUGE), and latency numbers 5. **Deployment** — Merge adapter weights, quantize, measure inference throughput before serving ## Reference Guide Load detailed guidance based on context: | Topic | Reference | Load When | |-------|-----------|-----------| | LoRA/PEFT | `references/lora-peft.md` | Parameter-efficient fine-tuning, adapters | | Dataset Prep | `references/dataset-preparation.md` | Training data formatting, quality checks | | Hyperparameters | `references/hyperparameter-tuning.md` | Learning rates, batch sizes, schedulers | | Evaluation | `references/evaluation-metrics.md` | Benchmarking, metri...

Details

Author
Jeffallan
Repository
Jeffallan/claude-skills
Created
7 months ago
Last Updated
1 weeks ago
Language
Python
License
MIT

Integrates with

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Listed

fine-tuning-expert

Use when fine-tuning LLMs, training custom models, or optimizing model performance for specific tasks. Invoke for parameter-efficient methods, dataset preparation, or model adaptation.

2 Updated today
zacklecon
AI & Automation Featured

peft-fine-tuning

Parameter-efficient fine-tuning for LLMs using LoRA, QLoRA, and 25+ methods. Use when fine-tuning large models (7B-70B) with limited GPU memory, when you need to train <1% of parameters with minimal accuracy loss, or for multi-adapter serving. HuggingFace's official library integrated with transformers ecosystem.

27,705 Updated today
davila7
AI & Automation Solid

peft-fine-tuning

Parameter-efficient fine-tuning for LLMs using LoRA, QLoRA, and 25+ methods. Use when fine-tuning large models (7B-70B) with limited GPU memory, when you need to train <1% of parameters with minimal accuracy loss, or for multi-adapter serving. HuggingFace's official library integrated with transformers ecosystem.

175,435 Updated today
NousResearch
AI & Automation Solid

peft-fine-tuning

Parameter-efficient fine-tuning for LLMs using LoRA, QLoRA, and 25+ methods. Use when fine-tuning large models (7B-70B) with limited GPU memory, when you need to train <1% of parameters with minimal accuracy loss, or for multi-adapter serving. HuggingFace's official library integrated with transformers ecosystem.

9,182 Updated 1 months ago
Orchestra-Research
AI & Automation Solid

peft-fine-tuning

Parameter-efficient fine-tuning for LLMs using LoRA, QLoRA, and 25+ methods. Use when fine-tuning large models (7B-70B) with limited GPU memory, when you need to train <1% of parameters with minimal accuracy loss, or for multi-adapter serving. HuggingFace's official library integrated with transformers ecosystem.

1,436 Updated 6 days ago
OpenRaiser