SkyPilot: Manage all your AI compute#
SkyPilot is a system to run, manage, and scale AI workloads on any AI infrastructure.
SkyPilot gives AI teams a simple interface to run jobs on any infra. Infra teams get a unified control plane to manage any AI compute — with advanced scheduling, scaling, and orchestration.
Why SkyPilot#
SkyPilot is easy to use for AI users:
Quickly spin up compute on your own infra
Environment and job as code — simple and portable
Easy job management: queue, run, and auto-recover many jobs
SkyPilot makes Kubernetes easy for AI & Infra teams:
Slurm-like ease of use, cloud-native robustness
Local dev experience on K8s: SSH into pods, sync code, or connect IDE
Turbocharge your clusters: gang scheduling, multi-cluster, and scaling
SkyPilot unifies multiple clusters, clouds, and hardware:
One interface to use reserved GPUs, Kubernetes clusters, Slurm clusters, or 20+ clouds
Flexible provisioning of GPUs, TPUs, CPUs, with smart failover
Team deployment and resource sharing
SkyPilot maximizes GPU fleet utilization:
Autostop: automatic cleanup of idle resources
Binpacking: workload binpacking on shared clusters
Intelligent scheduler: automatically schedule on the most available infra
SkyPilot supports your existing GPU, TPU, and CPU workloads, with no code changes.
Current supported infra: Kubernetes, Slurm, AWS, GCP, Azure, OCI, CoreWeave, Nebius, Lambda Cloud, RunPod, Fluidstack, Cudo, Digital Ocean, Paperspace, Cloudflare, Samsung, IBM, Vast.ai, VMware vSphere, Seeweb, Prime Intellect, Shadeform, Verda Cloud, VastData, Crusoe.
Getting started#
Install SkyPilot in 1 minute. Then, launch your first cluster in 2 minutes in Quickstart.
SkyPilot is BYOC: Everything is launched within your cloud accounts, VPCs, and clusters.
Benefits of SkyPilot on Kubernetes#
SkyPilot makes Kubernetes AI-native.
It turbocharges your existing Kubernetes clusters by accelerating AI/ML velocity:
AI-friendly interface to launch jobs and deployments
Much simplified interactive dev for K8s (SSH / sync code / connect IDE to pods)
…and optimizing GPU scheduling, utilization, and scaling:
Advanced scheduling: Gang scheduling, multi-node jobs, and queueing
Multi-cluster support: Bring all your clusters under one control plane
Multi-cloud support: One consistent interface to manage many providers
See SkyPilot vs Vanilla Kubernetes and this blog post for more details.
Contact the SkyPilot team#
You can chat with the SkyPilot team and community on the SkyPilot Slack.
Learn more#
To learn more, see SkyPilot Overview and SkyPilot blog.
SkyPilot adopters: Testimonials and Case Studies
Partners and integrations: Community Spotlights
Follow updates: