@HuggingPapers
Meta AI's Saber redefines zero-shot reference-to-video generation It generates stunning, identity-preserving videos from text & images. No costly R2V datasets required, trained solely on video-text pairs. Achieves state-of-the-art with masked training. https://t.co/KCO1ZwddpE