@multimodalart
IT'S OUT! 🚀 MoDA: Multi-modal Diffusion Architecture for Talking Head Generation finally a talking head: open source 🏋️ fast ⚡ portrait + audio-driven 🧑🎨🎧 with emotion control (and yes, i built an inference system + Gradio, generate in < 15s on @huggingface spaces 🤗) https://t.co/VgF8BLrM8s