Skip to main content

Ctrl+K

You are viewing the latest developer preview docs. Click here to view docs for the latest stable release.

Site Navigation

Docs
Case Studies
Blog

Slack
Twitter
GitHub

Site Navigation

Docs
Case Studies
Blog

Slack
Twitter
GitHub

Getting Started

Overview
Installation
Quickstart
Agent Skills
Examples
Concept: Sky Computing
For Frontier AI
- Sandboxes

Clusters

Start a Development Cluster
Cluster Jobs
Provisioning Compute
Autostop and Autodown

Jobs

Managed Jobs
Checkpointing and Recovery
Multi-Node Jobs
Many Parallel Jobs
Model Training Guide
Using a Pool of Workers
Batch Inference
- Custom I/O Formats
Job Groups for RL

Model Serving

Getting Started
Serving User Guides

Infra Choices

Using Kubernetes
Using Slurm
- Getting Started
Using Existing Machines
Using Reservations
Using Cloud VMs
- Requesting Quota Increase
GPUs and Accelerators
- Using Google TPUs
- Using AMD GPUs

Data

Cloud Buckets
Volumes
Syncing Code, Git, and Files

User Guides

SkyPilot Recipes
Migrating from Slurm
External Links
Asynchronous Execution
Environment Variables and Secrets
Docker Containers
Opening Ports
Lifecycle hooks
Usage Collection
Frequently Asked Questions

Administrator Guides

API Server Deployment
Authentication and RBAC
Workspaces: Isolating Teams
Cloud Accounts and Permissions
Admin Policies
External Logging Storage
Airgapped Environments

References

SkyPilot YAML
CLI
Python SDK
Advanced Configuration
- Configuration Sources
SkyPilot Internals
Developer Guides
- Contributing to SkyPilot
- Guide: Adding a New Cloud

Serving User Guides#

Autoscaling
Updating a Service
- Rolling update
  - Example
- Blue-green update
  - Example
Authorization
- Setup API keys
Using Spot Instances for Serving
HTTPS Encryption
- HTTPS encrypted endpoint
High Availability Controller

previous

Serving Models

next

Autoscaling

© Copyright 2026, SkyPilot Team.