> ## Documentation Index
> Fetch the complete documentation index at: https://docs.vast.ai/llms.txt
> Use this file to discover all available pages before exploring further.

# Generating Video with LTX-2.3 and ComfyUI

<script
  type="application/ld+json"
  dangerouslySetInnerHTML={{
__html: JSON.stringify({
"@context": "https://schema.org",
"@type": "HowTo",
"name": "How to Generate Video and Audio with LTX-2.3 on Vast.ai",
"description": "Deploy Lightricks LTX-2.3, a 22B parameter joint audio-video model, on Vast.ai using the pre-built ComfyUI template. Generate text-to-video, image-to-video, audio-conditioned video, and keyframe interpolation through a visual interface.",
"step": [
  {
    "@type": "HowToStep",
    "name": "Deploy the LTX-2.3 Template",
    "text": "Go to the LTX-2.3 model page on Vast.ai and click Deploy Now. Select a GPU instance and click Rent. The template automatically downloads all model weights during provisioning."
  },
  {
    "@type": "HowToStep",
    "name": "Open ComfyUI",
    "text": "Once the instance finishes provisioning, click the Open button to launch ComfyUI in your browser. Four pre-configured LTX-2.3 workflows are available in the sidebar."
  },
  {
    "@type": "HowToStep",
    "name": "Run a Workflow",
    "text": "Select a workflow from the sidebar: text-to-video, image-to-video, image+audio-to-video, or first-last-frame interpolation. Enter a prompt, optionally upload input media, and click Run to generate video."
  }
]
})
}}
/>

[LTX-2.3](https://huggingface.co/Lightricks/LTX-2.3) is the first open-source model that generates synchronized video and audio together in a single pass. Built by Lightricks, this 22B-parameter Diffusion Transformer doesn't just create video — it produces matching audio at the same time, whether that's dialogue with lip-synced characters, ambient soundscapes, or music-driven visuals.

What makes it stand out:

* **Joint audio-video generation** — no need to generate video and audio separately and try to sync them
* **Multiple generation modes** — text-to-video, image-to-video, audio-conditioned video, and keyframe interpolation
* **Fast inference** — the distilled pipeline generates in just 8 denoising steps
* **Open source** — full model weights available on HuggingFace, trainable with LoRA in under an hour

Running LTX-2.3 requires a powerful GPU. Vast.ai gives you on-demand access to the hardware you need, and the pre-built ComfyUI template means you can go from zero to generating videos in minutes — no CLI, SSH, or manual setup required.

## Find and rent your GPU

1. **Setup your Vast account and add credit:** Review the [quickstart guide](/documentation/get-started/quickstart) if you do not have an account with credits loaded.
2. **Deploy the LTX-2.3 template:** Go to the [LTX-2.3 model page](https://vast.ai/model/ltx-2-3) and click **Deploy Now**. This takes you to the Vast console with the LTX-2.3 ComfyUI template pre-selected.
3. **Select a GPU:** Choose an instance from the list and click **Rent**.

<Frame caption="Vast.ai console showing available RTX 5090 instances with the LTX-2.3 ComfyUI template">
  <img src="https://mintcdn.com/vastai-80aa3a82/szHd1B3Sawrb9biP/images/ltx-2-3/02-select-gpu.jpg?fit=max&auto=format&n=szHd1B3Sawrb9biP&q=85&s=bb87176568ffab33da89e690ed86003d" alt="Vast.ai console showing available RTX 5090 instances with the LTX-2.3 ComfyUI template" width="2354" height="1764" data-path="images/ltx-2-3/02-select-gpu.jpg" />
</Frame>

## Wait for provisioning

After renting, the instance automatically downloads all required model weights. You'll see a loading screen while models download. On a fast connection this takes just a few minutes. Once complete, the instance status shows a green **Running** indicator.

<Frame caption="Running instance with Open button ready">
  <img src="https://mintcdn.com/vastai-80aa3a82/szHd1B3Sawrb9biP/images/ltx-2-3/03-instance-running.jpg?fit=max&auto=format&n=szHd1B3Sawrb9biP&q=85&s=10311cf4837b589ddfb9318be29f840f" alt="Running instance showing verified status" width="1958" height="892" data-path="images/ltx-2-3/03-instance-running.jpg" />
</Frame>

## Open ComfyUI

Click the **Open** button on your instance to launch the Instance Portal. Click **Launch Application** under **ComfyUI** to open the visual workflow editor.

<Frame caption="Instance Portal with ComfyUI and other available services">
  <img src="https://mintcdn.com/vastai-80aa3a82/szHd1B3Sawrb9biP/images/ltx-2-3/04-instance-portal.jpg?fit=max&auto=format&n=szHd1B3Sawrb9biP&q=85&s=e81b8e0d57876f2cf832ba3c076e16d0" alt="Instance Portal showing ComfyUI, API Wrapper, Jupyter, and other services" width="2830" height="1060" data-path="images/ltx-2-3/04-instance-portal.jpg" />
</Frame>

In the left sidebar under **Workflows > Browse**, you'll see four pre-loaded workflows:

| Workflow             | Description                      |
| -------------------- | -------------------------------- |
| `video_ltx2_3_t2v`   | Text-to-Video                    |
| `video_ltx2_3_i2v`   | Image-to-Video                   |
| `video_ltx2_3_ia2v`  | Image + Audio-to-Video           |
| `video_ltx2_3_flf2v` | First & Last Frame Interpolation |

## Text-to-Video

Select **video\_ltx2\_3\_t2v** from the sidebar. Enter a descriptive prompt in the **Video Generation** node — describe camera angles, lighting, and motion cinematically. Adjust width, height, and frame count if desired (defaults: 1280x720, 121 frames, 25 fps). Click **Run**.

<Frame caption="Text-to-Video workflow generating a video of high-tech machinery">
  <img src="https://mintcdn.com/vastai-80aa3a82/szHd1B3Sawrb9biP/images/ltx-2-3/05-workflow-t2v.jpg?fit=max&auto=format&n=szHd1B3Sawrb9biP&q=85&s=80c5177c5ec74cf2785ffb3e9086728e" alt="Text-to-Video workflow in ComfyUI" width="3356" height="1798" data-path="images/ltx-2-3/05-workflow-t2v.jpg" />
</Frame>

The workflow includes automatic prompt enhancement powered by Gemma 3, which expands short prompts into detailed cinematic descriptions.

## Image-to-Video

Select **video\_ltx2\_3\_i2v**. Upload a reference image in the **Load Image** node (a sample Egyptian queen image is included). Enter a prompt describing how the image should come to life. Click **Run**. The model uses your image as the first frame and generates consistent motion.

<Frame caption="Image-to-Video workflow animating an Egyptian queen portrait">
  <img src="https://mintcdn.com/vastai-80aa3a82/szHd1B3Sawrb9biP/images/ltx-2-3/06-workflow-i2v.jpg?fit=max&auto=format&n=szHd1B3Sawrb9biP&q=85&s=694fc2fbf09278f6b655b281d8efde9d" alt="Image-to-Video workflow in ComfyUI" width="3354" height="1796" data-path="images/ltx-2-3/06-workflow-i2v.jpg" />
</Frame>

## Image + Audio-to-Video

Select **video\_ltx2\_3\_ia2v**. Upload a reference image and an audio file (a sample MP3 is included). Enter a prompt describing the scene. Click **Run**. The model generates video synchronized to the audio — lip movements match dialogue, and scene energy follows the audio's rhythm.

<Frame caption="Image + Audio-to-Video workflow syncing a character to an audio track">
  <img src="https://mintcdn.com/vastai-80aa3a82/szHd1B3Sawrb9biP/images/ltx-2-3/07-workflow-ia2v.jpg?fit=max&auto=format&n=szHd1B3Sawrb9biP&q=85&s=9ae4d1296c2e35c3a5572e77f2e2459b" alt="Image+Audio-to-Video workflow in ComfyUI" width="3356" height="1794" data-path="images/ltx-2-3/07-workflow-ia2v.jpg" />
</Frame>

## First & Last Frame Interpolation

Select **video\_ltx2\_3\_flf2v**. Load two images — a first frame and a last frame (sample car images are included). Enter a prompt describing the transition. Click **Run**. The model generates a smooth video interpolation between your two keyframes.

<Frame caption="First & Last Frame interpolation generating a smooth camera transition between two car shots">
  <img src="https://mintcdn.com/vastai-80aa3a82/szHd1B3Sawrb9biP/images/ltx-2-3/08-workflow-flf2v.jpg?fit=max&auto=format&n=szHd1B3Sawrb9biP&q=85&s=6bac20e99e3f65bd57fd2b3a7353e806" alt="First-Last-Frame interpolation workflow in ComfyUI" width="3350" height="1796" data-path="images/ltx-2-3/08-workflow-flf2v.jpg" />
</Frame>

## Cleanup

When finished, go to the [Vast.ai console](https://cloud.vast.ai/instances/) and click **Delete** on your instance to stop charges.

## Resources

* [LTX-2.3 on HuggingFace](https://huggingface.co/Lightricks/LTX-2.3)
* [LTX-2 GitHub Repository](https://github.com/Lightricks/LTX-2)
* [Vast.ai Model Page](https://vast.ai/model/ltx-2-3)
