Skip to main content
Ctrl+K

Site Navigation

  • Docs
  • AI Gallery
  • Blog
  • Community
  • Slack
  • Twitter
  • GitHub

Site Navigation

  • Docs
  • AI Gallery
  • Blog
  • Community
  • Slack
  • Twitter
  • GitHub

Getting Started

  • Overview
  • Installation
  • Quickstart
  • Example: AI Training
  • Concept: Sky Computing

Clusters

  • Start a Cluster
  • Provisioning Compute
  • Autostop and Autodown

Jobs

  • Cluster Jobs
  • Managed Jobs
  • Multi-Node Jobs
  • Many Parallel Jobs

Model Serving

  • Getting Started
  • Serving User Guides
    • Autoscaling
    • Updating a Service
    • Authorization
    • Using Spot Instances for Serving
    • HTTPS Encryption

Infra Choices

  • GPUs and Accelerators
    • Using Google TPUs
  • Using Cloud VMs
  • Using Kubernetes
    • Getting Started
    • Kubernetes Cluster Setup
      • Deployment Guides
      • Exposing Services
    • Kubernetes Troubleshooting
    • Multiple Kubernetes Clusters
    • SkyPilot vs. Vanilla Kubernetes
  • Using Existing Machines
  • Using Reservations

Data

  • Cloud Buckets
  • Syncing Code and Artifacts

User Guides

  • Secrets and Environment Variables
  • Docker Containers
  • Opening Ports
  • Usage Collection
  • Frequently Asked Questions

Administrator Guides

  • Minimal Cloud Permissions
    • AWS
    • GCP
    • vSphere
    • Kubernetes
  • Cloud Authentication
  • Requesting Quota Increase
  • Admin Policies

References

  • SkyPilot YAML
  • CLI
  • Python API
  • Advanced Configurations
  • Developer Guides
    • Contributing to SkyPilot
    • Guide: Adding a New Cloud

Serving User Guides#

  • Autoscaling
    • Fixed replicas
    • Enabling autoscaling
    • Scaling delay
    • Scale-to-zero
  • Updating a Service
    • Rolling update
      • Example
    • Blue-green update
      • Example
  • Authorization
    • Setup API keys
  • Using Spot Instances for Serving
    • Base on-demand fallback
    • Dynamic on-demand fallback
    • Example
  • HTTPS Encryption
    • HTTPS encrypted endpoint

previous

Serving Models

next

Autoscaling

Edit on GitHub

© Copyright 2024, SkyPilot Team.