ray-distributed-trainer

Solid

Distributed computing skill using Ray for parallel training, hyperparameter search, and resource management.

AI & Automation 814 stars 53 forks Updated today MIT

Install

View on GitHub

Quality Score: 95/100

Stars 20%

Recency 20%

100

Frontmatter 20%

Documentation 15%

100

Issue Health 10%

License 10%

100

Description 5%

100

Skill Content

# ray-distributed-trainer ## Overview Distributed computing skill using Ray for parallel training, hyperparameter search, and resource management across clusters. ## Capabilities - Ray Train for distributed training - Ray Tune for hyperparameter search at scale - Cluster resource management - Fault tolerance and checkpointing - Actor-based parallelism - Integration with PyTorch and TensorFlow - Elastic training support - Multi-node orchestration ## Target Processes - Distributed Training Orchestration - AutoML Pipeline Orchestration - Model Training Pipeline ## Tools and Libraries - Ray - Ray Train - Ray Tune - Ray Cluster ## Input Schema ```json { "type": "object", "required": ["mode", "config"], "properties": { "mode": { "type": "string", "enum": ["train", "tune", "cluster"], "description": "Ray operation mode" }, "config": { "type": "object", "properties": { "numWorkers": { "type": "integer" }, "useGpu": { "type": "boolean" }, "resourcesPerWorker": { "type": "object", "properties": { "cpu": { "type": "number" }, "gpu": { "type": "number" } } } } }, "trainConfig": { "type": "object", "properties": { "trainerPath": { "type": "string" }, "framework": { "type": "string", "enum": ["pytorch", "tensorflow", "xgboost"] }, "scalingConfig": { "type": "object" } } }, "tuneConfig"...

Details

Author: a5c-ai
Repository: a5c-ai/babysitter
Created: 4 months ago
Last Updated: today
Language: JavaScript
License: MIT

Related Skills

AI & Automation Featured

videodb

See, Understand, Act on video and audio. See- ingest from local files, URLs, RTSP/live feeds, or live record desktop; return realtime context and playable stream links. Understand- extract frames, build visual/semantic/temporal indexes, and search moments with timestamps and auto-clips. Act- transcode and normalize (codec, fps, resolution, aspect ratio), perform timeline edits (subtitles, text/image overlays, branding, audio overlays, dubbing, translation), generate media assets (image, audio, video), and create real time alerts for events from live streams or desktop capture.

196,640 Updated 2 days ago

affaan-m

AI & Automation Featured

ck

Persistent per-project memory for Claude Code. Auto-loads project context on session start, tracks sessions with git activity, and writes to native memory. Commands run deterministic Node.js scripts — behavior is consistent across model versions.

196,640 Updated 2 days ago

affaan-m

AI & Automation Featured

browser

Web browser automation with AI-optimized snapshots for claude-flow agents

55,973 Updated today

ruvnet