# Vast.ai Documentation – Affordable GPU Cloud Marketplace ## Docs - [create api-key](https://docs.vast.ai/api-reference/accounts/create-api-key.md): Creates a new API key with specified permissions for the authenticated user. - [create env-var](https://docs.vast.ai/api-reference/accounts/create-env-var.md): Creates a new encrypted environment variable for the authenticated user. Keys are automatically converted to uppercase. Values are encrypted before storage. There is a limit on the total number of environment variables per user. - [create ssh-key](https://docs.vast.ai/api-reference/accounts/create-ssh-key.md): Creates a new SSH key and associates it with your account. The key will be automatically added to all your current instances. - [create subaccount](https://docs.vast.ai/api-reference/accounts/create-subaccount.md): Creates either a standalone user account or a subaccount under a parent account. Subaccounts can be restricted to host-only functionality. - [delete api key](https://docs.vast.ai/api-reference/accounts/delete-api-key.md): Deletes an existing API key belonging to the authenticated user. The API key is soft-deleted by setting a deleted_at timestamp. - [delete env var](https://docs.vast.ai/api-reference/accounts/delete-env-var.md): Deletes an environment variable associated with the authenticated user. The variable must exist and belong to the requesting user. - [delete ssh key](https://docs.vast.ai/api-reference/accounts/delete-ssh-key.md): Removes an SSH key from the authenticated user's account - [set user](https://docs.vast.ai/api-reference/accounts/set-user.md): Updates the user data for the authenticated user. - [show api keys](https://docs.vast.ai/api-reference/accounts/show-api-keys.md): Retrieves all API keys associated with the authenticated user. - [show connections](https://docs.vast.ai/api-reference/accounts/show-connections.md): Retrieves the list of cloud connections associated with the authenticated user. - [show env vars](https://docs.vast.ai/api-reference/accounts/show-env-vars.md): Retrieve a list of environment variables (secrets) for the authenticated user. - [show ipaddrs](https://docs.vast.ai/api-reference/accounts/show-ipaddrs.md): This endpoint retrieves the history of IP address accesses for the authenticated user. - [show ssh keys](https://docs.vast.ai/api-reference/accounts/show-ssh-keys.md): Retrieve a list of SSH keys associated with the authenticated user's account. - [show subaccounts](https://docs.vast.ai/api-reference/accounts/show-subaccounts.md): Retrieve a list of subaccounts associated with the authenticated user's account. - [show team role](https://docs.vast.ai/api-reference/accounts/show-team-role.md): Retrieve details of a specific team role by its name. - [show user](https://docs.vast.ai/api-reference/accounts/show-user.md): Retrieve information about the current authenticated user, excluding the API key. - [transfer credit](https://docs.vast.ai/api-reference/accounts/transfer-credit.md): Transfers specified amount of credits from the authenticated user's account to another user's account. - [update env var](https://docs.vast.ai/api-reference/accounts/update-env-var.md): Updates the value of an existing environment variable for the authenticated user. - [update ssh key](https://docs.vast.ai/api-reference/accounts/update-ssh-key.md): Updates the specified SSH key with the provided value. - [search invoices](https://docs.vast.ai/api-reference/billing/search-invoices.md): This endpoint allows users to search and retrieve invoices based on specified filters. - [show deposit](https://docs.vast.ai/api-reference/billing/show-deposit.md): Retrieves the deposit details for a specified instance. - [show earnings](https://docs.vast.ai/api-reference/billing/show-earnings.md): Retrieves the earnings history for a specified time range and optionally per machine. - [show invoices](https://docs.vast.ai/api-reference/billing/show-invoices.md): Gets your invoices within given timerange. - [Creating and Using Templates with API](https://docs.vast.ai/api-reference/creating-and-using-templates-with-api.md) - [Creating Instances with the API](https://docs.vast.ai/api-reference/creating-instances-with-api.md) - [attach ssh-key](https://docs.vast.ai/api-reference/instances/attach-ssh-key.md): Attaches an SSH key to the specified instance, allowing SSH access using the provided key. - [cancel copy](https://docs.vast.ai/api-reference/instances/cancel-copy.md): Cancel a remote copy operation specified by the destination ID (dst_id). - [cancel sync](https://docs.vast.ai/api-reference/instances/cancel-sync.md): Cancels an in-progress remote sync operation identified by the destination instance ID. This operation cannot be resumed once canceled and must be restarted if needed. - [change bid](https://docs.vast.ai/api-reference/instances/change-bid.md): Change the current bid price of an instance to a specified price. - [cloud copy](https://docs.vast.ai/api-reference/instances/cloud-copy.md): Starts a cloud copy operation by sending a command to the remote server. The operation can transfer data between an instance and a cloud service. - [copy](https://docs.vast.ai/api-reference/instances/copy.md): Initiate a remote copy operation to transfer data from one instance to another or between an instance and the local machine. - [create instance](https://docs.vast.ai/api-reference/instances/create-instance.md): Creates a new instance by accepting an "ask" contract from a provider. - [destroy instance](https://docs.vast.ai/api-reference/instances/destroy-instance.md): Destroys/deletes an instance permanently. This is irreversible and will delete all data. - [detach ssh-key](https://docs.vast.ai/api-reference/instances/detach-ssh-key.md): Detaches an SSH key from a specified instance, removing SSH access for that key. - [execute](https://docs.vast.ai/api-reference/instances/execute.md): Executes a constrained remote command on a specified instance. The command output can be retrieved from the returned result URL. - [manage instance](https://docs.vast.ai/api-reference/instances/manage-instance.md): Manage instance state and labels. The operation is determined by the request body parameters. - [prepay instance](https://docs.vast.ai/api-reference/instances/prepay-instance.md): Deposit credits into a reserved instance to receive usage discounts. The discount rate is calculated based on how many months of usage the prepaid amount covers. Maximum discount is typically 40%. - [reboot instance](https://docs.vast.ai/api-reference/instances/reboot-instance.md): Stops and starts a container without losing GPU priority. Updates container status to 'rebooting' and executes docker stop/start commands on the host machine. - [recycle instance](https://docs.vast.ai/api-reference/instances/recycle-instance.md): Destroys and recreates container in place (from newly pulled image) without losing GPU priority. Updates container status to 'recycling' and executes docker stop/remove commands on the host machine. - [show instance](https://docs.vast.ai/api-reference/instances/show-instance.md): Retrieves the details of a specific instance for the authenticated user. - [show instances](https://docs.vast.ai/api-reference/instances/show-instances.md): > ⚠️ **Deprecated:** This endpoint has been superseded by [`/api/v1/instances/`](#tag/Instances/GET/api/v1/instances/), which adds keyset pagination, filtering, column selection, and sorting. > Please migrate to v1. - [show instances (v1)](https://docs.vast.ai/api-reference/instances/show-instances-v1.md): Retrieve a paginated list of instances for the authenticated user. Supports keyset pagination (max 25 per page), filtering, column selection, and sorting. - [show logs](https://docs.vast.ai/api-reference/instances/show-logs.md): Request logs from a specific instance. The logs will be uploaded to S3 and can be retrieved from a generated URL. Supports both container logs and daemon system logs. - [show ssh-keys](https://docs.vast.ai/api-reference/instances/show-ssh-keys.md): Retrieves the SSH keys associated with a specific instance. - [API Introduction](https://docs.vast.ai/api-reference/introduction.md) - [cancel maint](https://docs.vast.ai/api-reference/machines/cancel-maint.md): Cancel a scheduled maintenance window for a specified machine. - [cleanup machine](https://docs.vast.ai/api-reference/machines/cleanup-machine.md): This endpoint removes expired contracts on a specified machine, freeing up space. - [list machine](https://docs.vast.ai/api-reference/machines/list-machine.md): Creates or updates ask contracts for a machine to list it for rent on the vast.ai platform. Allows setting pricing, minimum GPU requirements, end date and discount rates. - [remove defjob](https://docs.vast.ai/api-reference/machines/remove-defjob.md): Deletes the default job (background instances) for a specified machine. - [schedule maint](https://docs.vast.ai/api-reference/machines/schedule-maint.md): Schedules a maintenance window for a specified machine and notifies clients. - [set defjob](https://docs.vast.ai/api-reference/machines/set-defjob.md): Creates default jobs (background instances) for a specified machine with the given parameters. - [set min-bid](https://docs.vast.ai/api-reference/machines/set-min-bid.md): Sets the minimum bid price for a specified machine. - [show machines](https://docs.vast.ai/api-reference/machines/show-machines.md): Fetches data for multiple machines associated with the authenticated user. - [show reports](https://docs.vast.ai/api-reference/machines/show-reports.md): Retrieves a list of the most recent reports for a given machine. Each report includes details such as the problem identified, a message describing the issue, and the timestamp when the report was created. - [unlist machine](https://docs.vast.ai/api-reference/machines/unlist-machine.md): Removes all 'ask' type offer contracts for a specified machine, effectively unlisting it from being available for rent. - [add network-disk](https://docs.vast.ai/api-reference/network-volumes/add-network-disk.md): Adds a network disk to be used to create network volume offers, or adds machines to an existing network disk. - [create network-volume](https://docs.vast.ai/api-reference/network-volumes/create-network-volume.md): Creates a network volume from an offer. - [list network-volume](https://docs.vast.ai/api-reference/network-volumes/list-network-volume.md): Lists a network disk for rent as network volumes, or updates an existing listing with a new price/size/end date/discount. - [search network volumes](https://docs.vast.ai/api-reference/network-volumes/search-network-volumes.md): Search for available network volume offers with advanced filtering and sorting. - [unlist network-volume](https://docs.vast.ai/api-reference/network-volumes/unlist-network-volume.md): Unlists a network volume for rent. - [Permissions and Authorization](https://docs.vast.ai/api-reference/permissions-and-authorization.md) - [Rate Limits and Errors](https://docs.vast.ai/api-reference/rate-limits-and-errors.md) - [search benchmarks](https://docs.vast.ai/api-reference/search/search-benchmarks.md): Retrieve benchmark data based on search parameters. - [search offers](https://docs.vast.ai/api-reference/search/search-offers.md): Search for available GPU machine offers with advanced filtering and sorting. - [search templates](https://docs.vast.ai/api-reference/search/search-templates.md): Searches for templates using filter-based queries. - [create endpoint](https://docs.vast.ai/api-reference/serverless/create-endpoint.md): This endpoint creates a new job processing endpoint with specified parameters. - [create workergroup](https://docs.vast.ai/api-reference/serverless/create-workergroup.md): Creates a new workergroup configuration that manages worker instances for a serverless endpoint. - [delete endpoint](https://docs.vast.ai/api-reference/serverless/delete-endpoint.md): Deletes an endpoint group by ID. Associated workergroups will also be deleted. - [delete workergroup](https://docs.vast.ai/api-reference/serverless/delete-workergroup.md): Deletes an existing workergroup. - [get endpoint logs](https://docs.vast.ai/api-reference/serverless/get-endpoint-logs.md): Retrieves logs for a specific endpoint by name. - [get endpoint workers](https://docs.vast.ai/api-reference/serverless/get-endpoint-workers.md): Retrieves the current list and status of workers for a specific endpoint. Useful for monitoring, debugging connectivity issues, and understanding resource usage. - [get workergroup logs](https://docs.vast.ai/api-reference/serverless/get-workergroup-logs.md): Retrieves logs for a specific workergroup by ID. - [get workergroup workers](https://docs.vast.ai/api-reference/serverless/get-workergroup-workers.md): Retrieves the current list and status of workers for a specific workergroup. Useful for monitoring, debugging connectivity issues, and understanding resource usage within a workergroup. - [route](https://docs.vast.ai/api-reference/serverless/route.md): Calls on the serverless engine to retrieve a GPU instance address within your endpoint for processing a request. The engine will return either a ready worker URL or status information if no workers are available. - [show endpoints](https://docs.vast.ai/api-reference/serverless/show-endpoints.md): Retrieve a list of endpoint jobs for the authenticated user. - [show workergroup](https://docs.vast.ai/api-reference/serverless/show-workergroup.md): Retrieves the list of workergroups associated with the authenticated user. - [update endpoint CLI Usage: vastai update endpoint ID [OPTIONS] ](https://docs.vast.ai/api-reference/serverless/update-endpointcli-usage:-vastai-update-endpoint-id-[options].md): Updates the specified endpoint group with the provided parameters. - [update workergroup](https://docs.vast.ai/api-reference/serverless/update-workergroup.md): Updates the properties of an existing workergroup based on the provided parameters. - [create team](https://docs.vast.ai/api-reference/team/create-team.md): Creates a new [team](https://docs.vast.ai/documentation/teams/teams-overview) with given name and following default roles: - **Owner**: Full access to all team resources, settings, and member management. The team owner is the user who creates the team. - **Manager**: All permissions of owner except… - [create team role](https://docs.vast.ai/api-reference/team/create-team-role.md): Creates a new role within a team. Only team owners or managers with the appropriate permissions can perform this operation. - [destroy team](https://docs.vast.ai/api-reference/team/destroy-team.md): Deletes a team and all associated data including API keys, rights, invitations, memberships and metadata. The team owner's master API key is converted to a normal client key. - [invite team member](https://docs.vast.ai/api-reference/team/invite-team-member.md): Sends an invitation email to the specified user to join the team with the given role. - [remove team member](https://docs.vast.ai/api-reference/team/remove-team-member.md): Removes a member from the team by revoking their team-related API keys and updating membership status. Cannot remove the team owner. - [remove team role](https://docs.vast.ai/api-reference/team/remove-team-role.md): Removes a role from the team. Cannot remove the team owner role. - [show team members](https://docs.vast.ai/api-reference/team/show-team-members.md): Retrieve a list of team members associated with the authenticated user's team. - [show team roles](https://docs.vast.ai/api-reference/team/show-team-roles.md): Retrieve a list of all roles for a team, excluding the owner' role. - [update team role](https://docs.vast.ai/api-reference/team/update-team-role.md): Update an existing team role with new name and permissions. - [create template](https://docs.vast.ai/api-reference/templates/create-template.md): Creates a new template for launching instances. - [delete template](https://docs.vast.ai/api-reference/templates/delete-template.md): Deletes an existing template. - [edit template](https://docs.vast.ai/api-reference/templates/edit-template.md): Edits an existing template in place. - [delete volume](https://docs.vast.ai/api-reference/volumes/delete-volume.md): Delete a volume by its ID. - [list volumes](https://docs.vast.ai/api-reference/volumes/list-volumes.md): Retrieve information about all volumes rented by you. - [rent volume](https://docs.vast.ai/api-reference/volumes/rent-volume.md): Rent/create a new volume with specified parameters. - [search volumes](https://docs.vast.ai/api-reference/volumes/search-volumes.md): Search for available volumes based on specified criteria. - [unlist volume](https://docs.vast.ai/api-reference/volumes/unlist-volume.md): Remove a volume listing from the marketplace. - [Blender Batch Rendering](https://docs.vast.ai/blender-batch-rendering.md) - [Blender in the Cloud](https://docs.vast.ai/blender-in-the-cloud.md) - [Commands](https://docs.vast.ai/cli/commands.md) - [Overview & quickstart](https://docs.vast.ai/cli/get-started.md) - [CUDA](https://docs.vast.ai/cuda.md) - [Disco Diffusion](https://docs.vast.ai/disco-diffusion.md) - [Welcome to Vast.ai](https://docs.vast.ai/documentation/get-started/index.md): Step-by-step Vast.ai developer documentation with examples, guides, and API references. - [QuickStart](https://docs.vast.ai/documentation/get-started/quickstart.md) - [Clusters](https://docs.vast.ai/documentation/host/clusters.md) - [Datacenter Status](https://docs.vast.ai/documentation/host/datacenter-status.md) - [Earning](https://docs.vast.ai/documentation/host/earning.md) - [Tax Guide for Hosts](https://docs.vast.ai/documentation/host/guide-to-taxes.md) - [Hosting Overview](https://docs.vast.ai/documentation/host/hosting-overview.md) - [How to Self-Test](https://docs.vast.ai/documentation/host/how-to-self-test.md) - [Host Payouts](https://docs.vast.ai/documentation/host/payment.md) - [Understanding Verification](https://docs.vast.ai/documentation/host/understanding-verification.md) - [Verification Stages](https://docs.vast.ai/documentation/host/verification-stages.md) - [VMs](https://docs.vast.ai/documentation/host/vms.md) - [Finding & Renting Instances](https://docs.vast.ai/documentation/instances/choosing/find-and-rent.md): Find and rent GPU instances on Vast.ai. Learn how to search, filter, understand offer cards, and configure your instance. - [Instance Types](https://docs.vast.ai/documentation/instances/choosing/instance-types.md): Understand Vast.ai instance types - On-demand, Reserved, and Interruptible. Learn how each type works, their differences, and when to use each. - [Choosing Instances Overview](https://docs.vast.ai/documentation/instances/choosing/overview.md): Learn the complete process of selecting and renting a GPU instance on Vast.ai, from choosing templates to configuring and launching. - [Reserved Instances](https://docs.vast.ai/documentation/instances/choosing/reserved-instances.md): Save up to 50% on GPU costs by pre-paying for reserved instances. Learn how to convert on-demand instances to reserved pricing. - [Choosing a Template](https://docs.vast.ai/documentation/instances/choosing/templates.md): Select the right template for your Vast.ai instance. Templates define your Docker image, launch mode, and initialization settings. - [Instance Portal](https://docs.vast.ai/documentation/instances/connect/instance-portal.md) - [Jupyter](https://docs.vast.ai/documentation/instances/connect/jupyter.md): Run Jupyter on Vast.ai with proxy or direct HTTPS. Learn setup, TLS certificate installation, and secure connections for smooth AI/ML development. - [Networking & Ports](https://docs.vast.ai/documentation/instances/connect/networking.md): Understand how Vast.ai handles networking, port mapping, and environment variables for Docker instances. - [Overview](https://docs.vast.ai/documentation/instances/connect/overview.md): Learn about Vast.ai connection methods—SSH, Jupyter, and Entrypoint—and how each controls instance access and workflow. - [SSH Connection](https://docs.vast.ai/documentation/instances/connect/ssh.md): Learn how to securely connect to Vast.ai instances using SSH. Generate keys, establish connections, use port forwarding, and integrate with VS Code. - [Windows SSH Guide](https://docs.vast.ai/documentation/instances/connect/windows-guide.md): Learn how to securely connect to Vast.ai instances using SSH on Windows. Understand the basics of SSH, how to generate and add keys, and how to use PuTTY and MobaXterm for GUI-based connections. - [Docker Execution Environment](https://docs.vast.ai/documentation/instances/docker-environment.md): Learn how Vast.ai Docker instances handle resource allocation, environment variables, networking, ports, and CLI usage. - [Managing Instances](https://docs.vast.ai/documentation/instances/manage-instances.md): Learn how to manage running instances - start, stop, destroy, monitor status, and handle common operational tasks. - [Instances Overview](https://docs.vast.ai/documentation/instances/overview.md): Instances are Docker containers that give you exclusive GPU access for training, inference, and development. Pay by the second, connect via SSH or Jupyter. - [Pricing](https://docs.vast.ai/documentation/instances/pricing.md): Understand Vast.ai's marketplace pricing model, rental types, reserved discounts, and costs for GPU instances. - [Scheduled Cloud Backups](https://docs.vast.ai/documentation/instances/storage/cloud-backups.md): Learn how to set up and schedule automated Vast.ai cloud backups using CLI or cron. Keep your data safe with best practices and easy management. - [Cloud Sync](https://docs.vast.ai/documentation/instances/storage/cloud-sync.md): Learn how to connect Vast.ai instances with cloud storage providers like Google Drive, S3, Backblaze, and Dropbox for secure data sync. - [Data Movement](https://docs.vast.ai/documentation/instances/storage/data-movement.md): Learn how to move data on Vast.ai using Cloud Sync, instance-to-instance transfers, CLI copy, VM migration, scp, and other efficient methods. - [Storage Types](https://docs.vast.ai/documentation/instances/storage/types.md): Understand the different storage options available on Vast.ai instances, including container storage and volumes. - [Volumes](https://docs.vast.ai/documentation/instances/storage/volumes.md) - [Virtual Machines](https://docs.vast.ai/documentation/instances/virtual-machines.md): Learn how Vast.ai virtual machines (VMs) provide full Linux instances with init manager support, process tracing, and nested containerization for advanced workloads. - [Account Settings](https://docs.vast.ai/documentation/reference/account-settings.md) - [Billing](https://docs.vast.ai/documentation/reference/billing.md) - [Billing](https://docs.vast.ai/documentation/reference/faq/billing.md): Questions about billing and credits - [General FAQ](https://docs.vast.ai/documentation/reference/faq/general.md): Basic questions about the Vast.ai platform - [FAQ Overview](https://docs.vast.ai/documentation/reference/faq/index.md): Find answers to common questions about Vast.ai - [Instances FAQ](https://docs.vast.ai/documentation/reference/faq/instances.md): Questions about creating and managing instances - [Jupyter & SSH FAQ](https://docs.vast.ai/documentation/reference/faq/jupyter-ssh.md): Connecting to instances via Jupyter and SSH - [Networking](https://docs.vast.ai/documentation/reference/faq/networking.md) - [Rental Types FAQ](https://docs.vast.ai/documentation/reference/faq/rental-types.md): Understanding on-demand vs interruptible instances - [Security FAQ](https://docs.vast.ai/documentation/reference/faq/security.md): Data protection and platform security - [Technical FAQ](https://docs.vast.ai/documentation/reference/faq/technical.md): Docker configuration, performance, and advanced topics - [Keys](https://docs.vast.ai/documentation/reference/keys.md) - [Referral Program](https://docs.vast.ai/documentation/reference/referral-program.md) - [Troubleshooting](https://docs.vast.ai/documentation/reference/troubleshooting.md) - [The SDK](https://docs.vast.ai/documentation/serverless/SDKoverview.md): Learn about the Serverless SDK, the primary method of interacting with Vast Serverless. - [Architecture Overview](https://docs.vast.ai/documentation/serverless/architecture.md): Learn how Vast Serverless operates and understand its major components. - [Automated Performance Testing](https://docs.vast.ai/documentation/serverless/automatedperformancetesting.md): Learn about the performance testing process in Vast.ai Serverless. - [Comfy UI](https://docs.vast.ai/documentation/serverless/comfy-ui.md): Learn how to use Comfy UI with Vast.ai Serverless for image generation workflows. - [ComfyUI ACE Step](https://docs.vast.ai/documentation/serverless/comfyui-acestep.md): Learn how to use ComfyUI with ACE Step v1 3.5B on Vast.ai Serverless for text-to-music generation. - [ComfyUI Wan 2.2](https://docs.vast.ai/documentation/serverless/comfyui-wan-2.2.md): Learn how to use ComfyUI with Wan 2.2 T2V A14B on Vast.ai Serverless for text-to-video generation. - [Creating Custom PyWorkers](https://docs.vast.ai/documentation/serverless/creating-new-pyworkers.md): Learn how to implement worker.py for Vast.ai Serverless using the Worker / WorkerConfig interface, including handlers, benchmarks, and log-based readiness. - [Architecture Overview](https://docs.vast.ai/documentation/serverless/deployments/architecture.md): Understand how Vast Deployments work under the hood, including deploy mode, serve mode, and update tiers. - [Configuring Deployments](https://docs.vast.ai/documentation/serverless/deployments/configuration.md): How to configure your deployment's image, packages, GPU requirements, autoscaling, environment variables, and more. - [@context Classes](https://docs.vast.ai/documentation/serverless/deployments/context.md): Load models and heavy resources once at worker startup and share them across all remote function calls. - [Example: Square Function](https://docs.vast.ai/documentation/serverless/deployments/examples/square.md): A minimal deployment that squares a number on a remote GPU. - [Example: MNIST Training & Inference](https://docs.vast.ai/documentation/serverless/deployments/examples/train-mnist.md): A deployment that trains an MNIST classifier at startup and serves inference requests. - [Example: vLLM Text Generation](https://docs.vast.ai/documentation/serverless/deployments/examples/vllm.md): A deployment that serves an LLM using vLLM's async engine. - [Deployments Overview](https://docs.vast.ai/documentation/serverless/deployments/index.md): An overview of Vast Deployments, the quickest way to run GPU code and set up endpoints in the Vast Cloud. - [@remote Functions](https://docs.vast.ai/documentation/serverless/deployments/remote-functions.md): Define, call, and benchmark async Python functions that execute on remote GPU workers. - [Serverless Overview](https://docs.vast.ai/documentation/serverless/index.md): An introduction to how Vast serverless compute works and how it's different from other serverless offerings. - [Endpoint and Worker Logs](https://docs.vast.ai/documentation/serverless/logging.md): Learn how to access Vast serverless logs - [Managing Scale](https://docs.vast.ai/documentation/serverless/managing-scale.md): Learn how to configure your Serverless endpoint for different load scenarios - [OpenAI API-compatible Interface](https://docs.vast.ai/documentation/serverless/openai-compatible-api.md): Use Vast.ai Serverless endpoints with the standard OpenAI API client by swapping your API key and base URL. - [The PyWorker](https://docs.vast.ai/documentation/serverless/overview.md): Learn about the Vast PyWorker and how it integrates with model instances. - [Pricing](https://docs.vast.ai/documentation/serverless/pricing.md): Learn how Vast.ai Serverless pricing works and when resources are billed - [Quickstart](https://docs.vast.ai/documentation/serverless/quickstart.md): Deploy your first vLLM endpoint - [Endpoint Parameters](https://docs.vast.ai/documentation/serverless/serverless-parameters.md): Learn about the parameters that control your scaling and best practices for setting them. - [Text Generation Inference (TGI)](https://docs.vast.ai/documentation/serverless/text-generation-inference-tgi.md): Learn how to use Text Generation Inference (TGI) with Vast.ai Serverless for text generation models. - [vLLM](https://docs.vast.ai/documentation/serverless/vllm.md): Learn how to use vLLM with Vast.ai Serverless for large language model inference. - [Worker States](https://docs.vast.ai/documentation/serverless/worker-states.md): Learn about the different worker states - [Workergroup Parameters](https://docs.vast.ai/documentation/serverless/workergroup-parameters.md): Learn about the parameters that control which workers get recruited. - [Zero Downtime Worker Update](https://docs.vast.ai/documentation/serverless/zero-downtime-worker-update.md): Update your Serverless template or model without dropping in-flight requests. - [Legacy Teams](https://docs.vast.ai/documentation/teams/legacy-teams.md) - [Managing Your Team](https://docs.vast.ai/documentation/teams/managing-teams.md) - [Teams Overview](https://docs.vast.ai/documentation/teams/teams-overview.md) - [Teams Quickstart](https://docs.vast.ai/documentation/teams/teams-quickstart.md) - [Teams Roles](https://docs.vast.ai/documentation/teams/teams-roles.md) - [Advanced Setup](https://docs.vast.ai/documentation/templates/advanced-setup.md) - [Creating Templates](https://docs.vast.ai/documentation/templates/creating-templates.md) - [Creating Templates for GROBID](https://docs.vast.ai/documentation/templates/examples/grobid.md) - [Templates](https://docs.vast.ai/documentation/templates/introduction.md) - [Managing Templates](https://docs.vast.ai/documentation/templates/managing-templates.md) - [Quick Start](https://docs.vast.ai/documentation/templates/quickstart.md) - [Template Settings](https://docs.vast.ai/documentation/templates/template-settings.md) - [Autonomous AI Research with Autoresearch on Vast.ai](https://docs.vast.ai/examples/ai-agents/autoresearch.md) - [BrowseSafe Prompt Injection Detection](https://docs.vast.ai/examples/ai-agents/browsesafe.md) - [OpenClaw AI Assistant with vLLM on Vast.ai](https://docs.vast.ai/examples/ai-agents/openclaw.md) - [Budget-Friendly Alternative to Claude Code - Overnight Ralph Loop Guide](https://docs.vast.ai/examples/ai-agents/overnight-ralph-loop.md) - [Unsloth Studio](https://docs.vast.ai/examples/ai-ml-frameworks/unsloth-studio.md) - [Serving Rerankers with vLLM](https://docs.vast.ai/examples/embeddings/serving-rerankers-vllm.md) - [DR-Tulu Research Agent](https://docs.vast.ai/examples/mcp/dr-tulu.md) - [Migrate from Runpod to Vast.ai](https://docs.vast.ai/examples/migrations/runpod-to-vast.md) - [GLiNER2](https://docs.vast.ai/examples/ner/gliner2.md) - [Running RolmOCR](https://docs.vast.ai/examples/ocr/rolmocr.md) - [dstack + vLLM](https://docs.vast.ai/examples/serving-infrastructure/dstack-vllm.md) - [SGLang Router](https://docs.vast.ai/examples/serving-infrastructure/sglang-router-vast.md) - [GLM-4.7-Flash](https://docs.vast.ai/examples/text-generation/glm-47-flash.md) - [MiniMax-M2](https://docs.vast.ai/examples/text-generation/minimax-m2.md) - [Speaker Diarization with Pyannote](https://docs.vast.ai/examples/transcription/speaker-diarization-pyannote.md) - [Google Colab](https://docs.vast.ai/google-colab.md) - [Huggingface TGI with LLama3](https://docs.vast.ai/huggingface-tgi-with-llama3.md) - [Image Generation](https://docs.vast.ai/image-generation.md) - [Infinity Embeddings](https://docs.vast.ai/infinity-embeddings.md) - [Langflow + Ollama](https://docs.vast.ai/langflow-ollama.md) - [Linux Virtual Desktop](https://docs.vast.ai/linux-virtual-desktop.md) - [Linux Virtual Machines](https://docs.vast.ai/linux-virtual-machines.md) - [Mining on Bittensor](https://docs.vast.ai/mining-on-bittensor.md) - [Multi-Node training using Torch + NCCL](https://docs.vast.ai/multi-node-training-using-torch-nccl.md) - [Ollama + Webui](https://docs.vast.ai/ollama-webui.md) - [Oobabooga (LLM webui)](https://docs.vast.ai/oobabooga-llm-webui.md) - [PyTorch](https://docs.vast.ai/pytorch.md) - [Quantized GGUF models (cloned)](https://docs.vast.ai/quantized-gguf-models-cloned.md) - [RTX 5 Series](https://docs.vast.ai/rtx-5-series.md): Optimize your GPU experience with our comprehensive guide on RTX 5 Series GPUs (5090/5080/5070) and CUDA 12.8 compatibility. Learn how to rent an RTX 5090 on Vast.ai, select the right templates, and customize your storage while ensuring optimal performance. - [Python SDK Usage](https://docs.vast.ai/sdk/python/quickstart.md) - [Stable Diffusion](https://docs.vast.ai/stable-diffusion.md) - [TTS with Nari Labs Dia](https://docs.vast.ai/tts-with-nari-labs-dia.md) - [Video Generation](https://docs.vast.ai/video-generation.md) - [vLLM (LLM inference and serving)](https://docs.vast.ai/vllm-llm-inference-and-serving.md) - [Whisper ASR Guide](https://docs.vast.ai/whisper-asr-guide.md) ## OpenAPI Specs - [openapi](https://docs.vast.ai/api-reference/openapi.json) Built with [Mintlify](https://mintlify.com).