is a general-purpose speech recognition model trained on a large dataset of diverse audio. Go through the 

The template you selected will give your instance access to both Jupyter and SSH. Additionally the Open button will connect you to the instance portal web interface. 

4. HTTP and token-based auth are both enabled by default. To avoid certificate errors in your browser, please follow the instructions for installing the TLS certificate 

 to allow secure HTTPS connections to your instance via its IP. 

5. Use the open button to open up the instance, if you are not using the open button the default username will be: vastai , and the password will be the value of the environment variable:

. You can also find the token value by accessing the terminal and executing this command: 

6. After accessing the SwaggerUi by clicking the triangle button first then waiting for the page to load, then clicking into the link aligning with SwaggerUI you should see the page below. (note: usually loads fast but can take 5-10 minutes) 


Two POST endpoints are exposed in this template:

Use this endpoint to automatically detect the spoken language in a given audio file.

Use this endpoint for both transcription and translation of audio files.

Both of these endpoints are documented using the OpenAPI standard and can be tested in a web browser. 

 If you look in the response body (see below) you can see it was able to detect the language was English.

Note: If you are getting an internal 500 error its most likely the file you selected to upload is to large. 

For more information and specifics on things such as but not limited to Configuration, Additional Functionality, Instance Logs, Cloudflared, Api request, ssh tunnels and port reference mapping, and Caddy you can visit the

Google Colab

Whisper ASR Guide

A lower level organization that lives within an Endpoint. It consists of a template (with extra filters for search), a set of GPU instances (workers) created from that template, and hyperparameters.

Worker_Groups

The highest level clustering of instances for the autoscaler, consisting of a named endpoint string, a collection of Worker groups, and hyperparameters.

Endpoints

The Vast PyWorker is a Python web server designed to run alongside a machine learning model instance, providing autoscaler compatibility.

PyWorker

Worker_Group

Endpoint

The minimum number of workers you want to keep "cold" (meaning stopped and fully loaded) when your group has no load.

min_cold_workers

The maximum number of workers your router group can have.

max_workers

Teams Overview

Guides

Introduction

QuickStart

Instances Help

Billing Help

Networking

Troubleshooting

Data Movement

overview

Creating Templates for GROBID

Creating a Custom Template

PyTorch

AI/ML Frameworks

CUDA

GPU Programming

Linux Virtual Desktop

Linux Virtual Machines

Virtual Computing

TTS with Nari Labs Dia

AI Audio Generation

Ollama + Webui

Oobabooga (LLM webui)

Huggingface TGI with LLama3

Quantized GGUF models (cloned)

vLLM (LLM inference and serving)

AI Text Generation

Image Generation

Stable Diffusion

Disco Diffusion

AI Image Generation

Video Generation

AI Video Generation

Infinity Embeddings

Text Embeddings

Blender in the Cloud

Blender Batch Rendering

3D Rendering

Mining on Bittensor

Cryptocurrency

Development Tools

Audio-to-Text

Use Cases

Edit team

Teams Quickstart

Team Creation

Teams Invitations

Teams Roles

Transfer Team Ownership

team

Overview

Guide to Taxes

Datacenter Status

Payment

Verification Stages

Clusters

hosting

Multi-Node training using Torch + NCCL

distributed-computing

Settings

Keys

Instance Portal

Templates

Volumes

Billing

Earning

Instances Guide

Search

Referral Program

Members

console

RTX 5 Series

Specific GPUs