@HuggingPapers
OmniForcing unlocks real-time joint audio-visual generation Achieves ~25 FPS with 0.7s latency—a 35× speedup over offline diffusion models—by distilling bidirectional LTX-2 into a causal streaming generator with maintained multi-modal fidelity. https://t.co/UGYGMyTQOs