@rohanpaul_ai
🤯 56 researchers from 32 universities across US, China, UK built an enormous video reasoning dataset to prove current AI models struggle with basic physical logic. "Very Big Video Reasoning Suite" The problem is that the AI does not genuinely know how solid objects are supposed to behave. So Berkeley, Stanford, CMU, Harvard, Oxford, Columbia, NTU, Johns Hopkins, and 24 other institutions built this 2mn samples which makes it 1000 times larger than all existing collections combined. Video generation systems usually focus on making things look pretty but they completely fail to understand spatial rules and causality. The team created a massive factory of visual tasks that tests how well models handle navigation, object manipulation, and logic. Even the most advanced commercial systems only scored around 54% while human testers easily achieved over 97% accuracy. Training an open model on this specific data improved its reasoning skills but a massive gap still exists.