Skip to main content
Ctrl+K
👋 Join us for the SkyPilot AI Infra Meetup in San Francisco on August 14! Register here
You are viewing the latest developer preview docs. Click here to view docs for the latest stable release.

Site Navigation

  • Docs
  • Blog
  • Community
  • Slack
  • Twitter
  • GitHub

Site Navigation

  • Docs
  • Blog
  • Community
  • Slack
  • Twitter
  • GitHub

Getting Started

  • Overview
  • Installation
  • Quickstart
  • Examples
    • Quickstart: PyTorch
    • Training
      • Axolotl
      • DeepSpeed
      • Distributed PyTorch
      • Distributed TensorFlow
      • Finetuning GPT-OSS
      • Finetuning Llama 4
      • Finetuning Llama 3
      • Finetuning Llama 2
      • NeMo
      • Ray
      • Training on TPUs
      • Unsloth
      • Verl (RLHF)
      • Vertex AI
    • Serving
      • vLLM
      • SGLang
      • Ollama
      • Hugging Face TGI
      • LoRAX
      • Cog
    • Models
      • OpenAI gpt-oss
      • DeepSeek-R1
      • DeepSeek-R1 Distilled
      • DeepSeek-Janus
      • Gemma 3
      • Llama 4
      • Llama 3.2
      • Llama 3.1
      • Llama 3
      • Llama 2
      • CodeLlama
      • Pixtral
      • Mixtral
      • Mistral 7B
      • Qwen 2.5
      • Yi
      • Gemma
      • DBRX
      • GPT-2 via llm.c
      • Vicuna
    • AI Applications
      • DeepSeek-R1 for RAG
      • Large-Scale Batch Inference
      • Image Vector Database
      • Tabby: Coding Assistant
      • LocalGPT: Chat with PDF
      • Stable Diffusion
    • AI Performance
      • AWS EFA
      • GCP/GKE GPUDirect
      • Nebius with InfiniBand
    • Orchestrators
      • Airflow
      • Cron
      • Github Actions
    • Other Frameworks
      • Cross-cloud data transfer
      • DVC
      • Jupyter
      • MLFlow
      • MPI
  • Concept: Sky Computing

Clusters

  • Start a Development Cluster
  • Cluster Jobs
  • Provisioning Compute
  • Autostop and Autodown

Jobs

  • Managed Jobs
  • Multi-Node Jobs
  • Many Parallel Jobs
  • Model Training Guide

Model Serving

  • Getting Started
  • Serving User Guides
    • Autoscaling
    • Updating a Service
    • Authorization
    • Using Spot Instances for Serving
    • HTTPS Encryption

Infra Choices

  • Using Kubernetes
    • Getting Started
    • Kubernetes Cluster Setup
      • Deployment Guides
      • Exposing Services
    • Priority and Preemption
    • Multiple Kubernetes Clusters
    • SkyPilot vs. Vanilla Kubernetes
    • Examples
      • Kueue
      • Dynamic Workload Scheduler
      • Kueue with GKE DWS
      • Multi-region Kubernetes
    • Kubernetes Troubleshooting
  • Using Existing Machines
  • Using Reservations
  • Using Cloud VMs
    • Requesting Quota Increase
  • GPUs and Accelerators
    • Using Google TPUs
    • Using AMD GPUs

Data

  • Cloud Buckets
  • Volumes
  • Syncing Code, Git, and Files

User Guides

  • Asynchronous Execution
  • Environment Variables and Secrets
  • Docker Containers
  • Opening Ports
  • Usage Collection
  • Frequently Asked Questions

Administrator Guides

  • API Server Deployment
    • Deploying API Server
      • API server metrics monitoring
      • GPU metrics monitoring
      • Advanced: Cross-Cluster State Persistence
      • Example: Deploy on GKE, GCP, and Nebius with Okta
      • Example: Deploy SkyPilot API Server in Docker
      • Example: Deploy on GKE with Cloud SQL
    • Upgrading API Server
    • Performance Best Practices
    • Troubleshooting
    • Helm Chart Reference
  • Authentication and RBAC
  • Workspaces: Isolating Teams
  • Cloud Accounts and Permissions
    • AWS
    • GCP
    • Nebius
    • vSphere
    • Kubernetes
  • Admin Policies
  • External Logging Storage

References

  • SkyPilot YAML
  • CLI
  • Python SDK
  • Advanced Configuration
    • Configuration Sources
  • SkyPilot State
  • Developer Guides
    • Contributing to SkyPilot
    • Guide: Adding a New Cloud

Orchestrators#

  • Airflow
  • Cron
  • Github Actions

previous

Using InfiniBand in Nebius with SkyPilot

next

Running SkyPilot tasks in Airflow with the SkyPilot API Server

Edit on GitHub

© Copyright 2025, SkyPilot Team.