@HuggingPapers
Tencent just released the Hunyuan Embodied AI model on Hugging Face A 2B parameter vision-language model with Mixture-of-Transformers architecture. It achieves SOTA results on CV-Bench, DA-2K and 10+ embodied understanding benchmarks. https://t.co/1ecUPjqqzu