Image & Video

LTX-2 +(aud2vid) support in the Blender add-on: Pallaidium

New Multi-Input mode lets creators batch process text, image, and audio to video in one go.

Deep Dive

The free, open-source Blender add-on Pallaidium has received a major update, integrating the powerful LTX-2 AI model for audio-to-video generation directly within the 3D software. Created by developer tintwotin, this release introduces a novel 'Multi-Input' mode. This feature allows users to group a text strip, an image strip, and an audio strip into a single meta strip within Blender's Video Sequence Editor. This grouped input can then be selected for batch processing, enabling the simultaneous generation of multiple video clips from varied inputs in one operation.

Technically, LTX-2 is a large model, but optimizations—credited to Diffusers developer asomoza—make it possible to run on systems with less than 16GB of VRAM, significantly lowering the hardware barrier. Pallaidium positions itself as an end-to-end solution integrated into Blender, aiming to streamline the entire creative pipeline 'from script to screen and back.' The developer demonstrated the capability by creating a game scene for 'GenZ,' using LTX-2's aud2vid feature to generate video content directly from audio within the Blender environment.

This update is significant for indie creators and studios using Blender. It moves advanced AI video synthesis from standalone, often costly cloud services into a familiar, free, and local creative toolkit. The ability to batch process multi-modal inputs (text, image, audio) within a non-linear editor workflow could dramatically speed up pre-visualization, concept art creation, and even final asset generation for animation and game development, all while maintaining creative control and data privacy by running locally.

Key Points
  • Pallaidium add-on now supports LTX-2 for audio-to-video (aud2vid) generation inside Blender.
  • New Multi-Input mode batches text, image, and audio strips for efficient, simultaneous processing.
  • Optimizations allow LTX-2 to run on under 16GB VRAM, making high-end AI video more accessible.

Why It Matters

Brings professional-grade AI video generation into a free, local 3D suite, empowering indie animators and game devs.