@ShenyuanGao
π€ How can we enable zero-shot generalization to unseen scenarios for robot world models? Thrilled to share DreamDojo π β an interactive robot world model pretrained on 44K hours of human egocentric videos, the largest and most diverse dataset to date for robot world model learning. Our model not only excels in generalization, but also supports real-time interaction at 10 FPS after distillation. It enables several important applications, including live teleoperation, policy evaluation, and model-based planning at test time. π Project: https://t.co/hJIEiGXnKz π° Paper: https://t.co/oa5xr8Y2GH π€ Code & models & datasets: https://t.co/A8B4ii0Kah #WorldModels #Robotics #EmbodiedAI #RL #AI #NVIDIA Sharing more details in the thread π§΅