Your curated collection of saved posts and media
π― https://t.co/BYoaTiZMQa
Gn, π..Β·Λ ΰΌ βΎ βο½‘Λ βοΈ Grok Imagine "extend from frame" 30 second continuous video.π« https://t.co/XHrdpXhnZA
xAI's Grok Imagine just took over the entire DesignArena Video leaderboard - not one, but THREE #1 rankings β #1 Video Arena - Elo 1337, a 33-point gap over #2 β #1 Image to Video Arena - Elo 1298, beating Google Veo 3.1, Kling & Sora β #1 Video Editing Arena - Elo 1291 Itβs wild, xAI was nowhere in the video space a few months ago, and now it's #1 across various benchmarks Grok Imagine's rate of progress is in a league of its own
π§΅New paper: "Lost in Backpropagation: The LM Head is a Gradient Bottleneck" The output layer of LLMs destroys 95-99% of your training signal during backpropagation, and this significantly slows down pretraining π https://t.co/lnbGfesIFA
I'm pretty confident this can be leveraged to graft a modified backwards pass onto the LM head of a pretrained model to improve the validation loss over standard LM head bwd. More to come soon.
Do you want a 3D character interacting with an object/pet/another person, following a desired action? Presenting Hoi3DGen: Generating High-Quality Human-Object-Interactions in 3D. Project: https://t.co/EE87KSjQCX Code: https://t.co/ddpLjciTWC https://t.co/QPTyXw45kk
Do you want a 3D character interacting with an object/pet/another person, following a desired action? Presenting Hoi3DGen: Generating High-Quality Human-Object-Interactions in 3D. Project: https://t.co/EE87KSjQCX Code: https://t.co/ddpLjciTWC https://t.co/QPTyXw45kk
All AI posters at GTC. This is not for human consumption. This video is for AI to watch. Click the grok button and talk to it about what it learned by seeing all the AI posters (highly technical) presented at @NVIDIAGTC tonight. Thanks NVIDIA for the badge and access. https://t.co/mKqIv1f6Dt
Wow. Grok watched this video and made a complete list of everything it saw: https://t.co/fqC1fuwhwX Do you have any idea how cool this is? It read every poster.
@Yulun_Du @ilyasut SGD is a ResNet too (the blocks of it are fwd+bwd), the residual stream is the weights so... π€ We're not taking the Attention is All You Need part literally enough? :D