fine-tuning-expert

Solid

Use when fine-tuning LLMs, training custom models, or adapting foundation models for specific tasks. Invoke for configuring LoRA/QLoRA adapters, preparing JSONL training datasets, setting hyperparameters for fine-tuning runs, adapter training, transfer learning, finetuning with Hugging Face PEFT, OpenAI fine-tuning, instruction tuning, RLHF, DPO, or quantizing and deploying fine-tuned models. Trigger terms include: LoRA, QLoRA, PEFT, finetuning, fine-tuning, adapter tuning, LLM training, model training, custom model.

AI & Automation 9,537 stars 808 forks Updated 1 weeks ago MIT

Install

View on GitHub

Quality Score: 94/100

Stars 20%

100

Recency 20%

Frontmatter 20%

Documentation 15%

100

Issue Health 10%

License 10%

100

Description 5%

100

Skill Content

# Fine-Tuning Expert Senior ML engineer specializing in LLM fine-tuning, parameter-efficient methods, and production model optimization. ## Core Workflow 1. **Dataset preparation** — Validate and format data; run quality checks before training starts - Checkpoint: `python validate_dataset.py --input data.jsonl` — fix all errors before proceeding 2. **Method selection** — Choose PEFT technique based on GPU memory and task requirements - Use LoRA for most tasks; QLoRA (4-bit) when GPU memory is constrained; full fine-tune only for small models 3. **Training** — Configure hyperparameters, monitor loss curves, checkpoint regularly - Checkpoint: validation loss must decrease; plateau or increase signals overfitting 4. **Evaluation** — Benchmark against the base model; test on held-out set and edge cases - Checkpoint: collect perplexity, task-specific metrics (BLEU/ROUGE), and latency numbers 5. **Deployment** — Merge adapter weights, quantize, measure inference throughput before serving ## Reference Guide Load detailed guidance based on context: | Topic | Reference | Load When | |-------|-----------|-----------| | LoRA/PEFT | `references/lora-peft.md` | Parameter-efficient fine-tuning, adapters | | Dataset Prep | `references/dataset-preparation.md` | Training data formatting, quality checks | | Hyperparameters | `references/hyperparameter-tuning.md` | Learning rates, batch sizes, schedulers | | Evaluation | `references/evaluation-metrics.md` | Benchmarking, metri...

Details

Author: Jeffallan
Repository: Jeffallan/claude-skills
Created: 7 months ago
Last Updated: 1 weeks ago
Language: Python
License: MIT

Integrates with

OpenAI · AI Hugging Face · AI

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Listed

fine-tuning-expert

Use when fine-tuning LLMs, training custom models, or optimizing model performance for specific tasks. Invoke for parameter-efficient methods, dataset preparation, or model adaptation.

2 Updated today

zacklecon

AI & Automation Featured

peft-fine-tuning

Parameter-efficient fine-tuning for LLMs using LoRA, QLoRA, and 25+ methods. Use when fine-tuning large models (7B-70B) with limited GPU memory, when you need to train <1% of parameters with minimal accuracy loss, or for multi-adapter serving. HuggingFace's official library integrated with transformers ecosystem.

27,705 Updated today

davila7

AI & Automation Solid