• Home
  • Cinema
  • Studio
  • Digital
  • Royalty Free Music
  • web agency
  • blog
  • Home
  • Cinema
  • Studio
  • Digital
  • Royalty Free Music
  • web agency
  • blog

BLOG! BLOG! BLOG! BLOG! BLOG! BLOG! BLOG! BLOG! BLOG! BLOG! BLOG! BLOG!

blog

articoli, news, update

[VID+AUD] 🎭 LTX-2.3 ID-LoRA — Consistent Identity Across Audio & Video

bla bla bla

  • Marzo 30, 2026

Hi folks, this is CCS.

Another add to the LTX-2.3 workflow family: T/I2V with ID-LoRA + Reference Audio — voice-preserved talking-character generation, open source, inside ComfyUI.

What the pipeline does
You load a reference image (or skip it for pure T2V), write a structured prompt with [VISUAL], [SPEECH], and [SOUND] tags, and optionally feed a short 5-second audio clip to anchor the speaker’s voice identity. The model generates a synchronized audio+video clip where the face, lip movement, and vocal timbre stay coherent throughout.

The reference audio trick
The LTXVReferenceAudio node encodes your audio clip into a conditioning signal injected into the shared AV latent stream. It does not replay the sample — it extracts persistent vocal traits (timbre, resonance, speaking color) and reuses them in the newly generated dialogue. ~5 seconds of clean recording is enough.

What you need

For LTX-2.3 models downloads and other info, refer to my previous posts:
[VID] LTX-2.3: The New King of AI Video? 🚀 Full Workflow & Test | Patreon

As for the LoRA ID reference links:

https://huggingface.co/AviadDahan/LTX-2.3-ID-LoRA-CelebVHQ-3K

https://huggingface.co/AviadDahan/LTX-2.3-ID-LoRA-TalkVid-3K

The workflow supports both high and low VRAM paths, tiled decoding, and the VAE Decode → Disk node for very long or memory-constrained renders.

For a clean workflow, as always, click “convert all links” in the autolink module.

Thanks again for the support, guys—let’s keep this journey going together! This month there will be even more incredible updates from the AI image/video generation world—stay tuned!

More soon. — CCS

Share:

Categorie

  • Cinema

  • Digital

  • Musica

  • Web design

Altri post

[VID+AUD]🎧LTX-2.3: From Audio + Image to Long-Form, Lip-Synced full video🎬

Read More »

[VID+AUD] Directing LTX-2.3: From Audio-Guided Lipsync to Full Video Pipeline (Patreon supporters)

Read More »

[VID+AUD] 🎬 Directing with Sound: LTX-2.3 Audio-Guided Performance & Lipsync 👄

Read More »

[VID] LTX-2.3 PROMPTING MASTERCLASS — DIRECTING THE LTX-2.3 MODEL (Patreon supporters)

Read More »

FAIDENBLASS
studio

Hai bisogno di video-editing, post-produzione per il tuo progetto audio-video? Contattaci!
CONTATTACI

FAIDENBLASS
digital

Utilizziamo gli ultimi sistemi di Stable diffusion in locale di generazione di immagini + video. Contattaci per ogni info o richiesta
CONTATTACI

FAIDENBLASS
agency

Costruiamo siti web dinamici e responsivi su wordpress
CONTATTACI

Send Us A Message

PrevPrevious[VID+AUD]🎧LTX-2.3: From Audio + Image to Long-Form, Lip-Synced full video🎬

FAIDENBLASS

Faidenblass: il punto di incontro tra arte analogica e digitale.

links

  • Carmine Cristallo Scalzi
  • Mitologia Elfica
  • faidenblass web agency

pagine sito

  • Home
  • Cinema
  • Studio
  • Digital
  • Royalty Free Music
  • web agency
  • blog
  • Home
  • Cinema
  • Studio
  • Digital
  • Royalty Free Music
  • web agency
  • blog