🐦 Twitter Post Details

Viewing enriched Twitter post

@DrJimFan

What can half of GPT-1 do? We trained a 42M transformer called SONIC to control the body of a humanoid robot. It takes a remarkable amount of subconscious processing for us humans to squat, turn, crawl, sprint. SONIC captures this "System 1" - the fast, reactive whole-body intelligence - in a single model that translates any motion command into stable, natural motor signals. And it's all open-source!! The key insight: motion tracking is the one, true scalable task for whole body control. Instead of hand-engineering rewards for every new skill, we use dense, frame-by-frame supervision from human mocap data. The data itself encodes the reward function: "configure your limbs in any human-like position while maintaining balance". We scaled humanoid motion RL to an unprecedented scale: 100M+ mocap frames and 500,000+ parallel robots across 128 GPUs. NVIDIA Isaac Lab allows us to accelerate physics at 10,000x faster tick, giving robots many years of virtual experience in only hours of wall clock time. After 3 days of training, the neural net transfers zero-shot to the real G1 robot with no finetuning. 100% success rate across 50 diverse real-world motion sequences. One SONIC policy supports all of the following: - VR whole-body teleoperation - Human video. Just point a webcam to live stream motions. - Text prompts. "Walk sideways", "dance like a monkey", "kick your left foot", etc. - Music audio. The robot dances to the beat, adapting to tempo and rhythm. - VLA foundation models. We plugged in GR00T N1.5 and achieved 95% success on mobile tasks. We open-source the code and model checkpoints!! Deep dive in thread:

View on Twitter

📊 Media Metadata

{
  "score": 0.42,
  "score_components": {
    "author": 0.09,
    "engagement": 0.0,
    "quality": 0.12,
    "source": 0.135,
    "nlp": 0.05,
    "recency": 0.025
  },
  "scored_at": "2026-03-01T12:13:33.508958",
  "import_source": "api_import",
  "source_tagged_at": "2026-03-01T12:13:33.508983",
  "enriched": true,
  "enriched_at": "2026-03-01T12:13:33.508987",
  "media": [
    {
      "url": "https://video.twimg.com/amplify_video/2026342653055610880/vid/avc1/3840x2160/cSt8SJsDOLpyce1n.mp4?tag=21",
      "type": "video"
    }
  ]
}

🔧 Raw API Response

{
  "type": "tweet",
  "id": "2026350142652383587",
  "url": "https://x.com/DrJimFan/status/2026350142652383587",
  "twitterUrl": "https://twitter.com/DrJimFan/status/2026350142652383587",
  "text": "What can half of GPT-1 do? We trained a 42M transformer called SONIC to control the body of a humanoid robot. It takes a remarkable amount of subconscious processing for us humans to squat, turn, crawl, sprint. SONIC captures this \"System 1\" - the fast, reactive whole-body intelligence - in a single model that translates any motion command into stable, natural motor signals. And it's all open-source!! \n\nThe key insight: motion tracking is the one, true scalable task for whole body control. Instead of hand-engineering rewards for every new skill, we use dense, frame-by-frame supervision from human mocap data. The data itself encodes the reward function: \"configure your limbs in any human-like position while maintaining balance\".\n\nWe scaled humanoid motion RL to an unprecedented scale: 100M+ mocap frames and 500,000+ parallel robots across 128 GPUs. NVIDIA Isaac Lab allows us to accelerate physics at 10,000x faster tick, giving robots many years of virtual experience in only hours of wall clock time. After 3 days of training, the neural net transfers zero-shot to the real G1 robot with no finetuning. 100% success rate across 50 diverse real-world motion sequences. \n\nOne SONIC policy supports all of the following:\n\n- VR whole-body teleoperation\n- Human video. Just point a webcam to live stream motions.\n- Text prompts. \"Walk sideways\", \"dance like a monkey\", \"kick your left foot\", etc.\n- Music audio. The robot dances to the beat, adapting to tempo and rhythm.\n- VLA foundation models. We plugged in GR00T N1.5 and achieved 95% success on mobile tasks.\n\nWe open-source the code and model checkpoints!! Deep dive in thread:",
  "source": "Twitter for iPhone",
  "retweetCount": 223,
  "replyCount": 78,
  "likeCount": 1511,
  "quoteCount": 42,
  "viewCount": 211745,
  "createdAt": "Tue Feb 24 17:34:56 +0000 2026",
  "lang": "en",
  "bookmarkCount": 612,
  "isReply": false,
  "inReplyToId": null,
  "conversationId": "2026350142652383587",
  "displayTextRange": [
    0,
    273
  ],
  "inReplyToUserId": null,
  "inReplyToUsername": null,
  "author": {
    "type": "user",
    "userName": "DrJimFan",
    "url": "https://x.com/DrJimFan",
    "twitterUrl": "https://twitter.com/DrJimFan",
    "id": "1007413134",
    "name": "Jim Fan",
    "isVerified": false,
    "isBlueVerified": true,
    "verifiedType": null,
    "profilePicture": "https://pbs.twimg.com/profile_images/1554922493101559808/SYSZhbcd_normal.jpg",
    "coverPicture": "https://pbs.twimg.com/profile_banners/1007413134/1672408318",
    "description": "",
    "location": "Views my own. Contact →",
    "followers": 365367,
    "following": 3110,
    "status": "",
    "canDm": true,
    "canMediaTag": true,
    "createdAt": "Wed Dec 12 22:11:27 +0000 2012",
    "entities": {
      "description": {
        "urls": []
      },
      "url": {}
    },
    "fastFollowersCount": 0,
    "favouritesCount": 8706,
    "hasCustomTimelines": true,
    "isTranslator": false,
    "mediaCount": 846,
    "statusesCount": 4105,
    "withheldInCountries": [],
    "affiliatesHighlightedLabel": {},
    "possiblySensitive": false,
    "pinnedTweetIds": [
      "2018754323141054786"
    ],
    "profile_bio": {
      "description": "NVIDIA Director of Robotics & Distinguished Scientist. Co-Lead of GEAR lab. Solving Physical AGI, one motor at a time. Stanford Ph.D. OpenAI's 1st intern.",
      "entities": {
        "description": {
          "hashtags": [],
          "symbols": [],
          "urls": [],
          "user_mentions": []
        },
        "url": {
          "urls": [
            {
              "display_url": "jimfan.me",
              "expanded_url": "https://jimfan.me",
              "indices": [
                0,
                23
              ],
              "url": "https://t.co/H4rXo4Ei8X"
            }
          ]
        }
      }
    },
    "isAutomated": false,
    "automatedBy": null
  },
  "extendedEntities": {
    "media": [
      {
        "additional_media_info": {
          "monetizable": false
        },
        "allow_download_status": {
          "allow_download": true
        },
        "display_url": "pic.twitter.com/bav2eIzVWn",
        "expanded_url": "https://twitter.com/DrJimFan/status/2026350142652383587/video/1",
        "ext_media_availability": {
          "status": "Available"
        },
        "id_str": "2026342653055610880",
        "indices": [
          274,
          297
        ],
        "media_key": "13_2026342653055610880",
        "media_results": {
          "id": "QXBpTWVkaWFSZXN1bHRzOgwABAoAARwfA+gsmjAAAAA=",
          "result": {
            "__typename": "ApiMedia",
            "id": "QXBpTWVkaWE6DAAECgABHB8D6CyaMAAAAA==",
            "media_key": "13_2026342653055610880"
          }
        },
        "media_url_https": "https://pbs.twimg.com/amplify_video_thumb/2026342653055610880/img/fjNGTzUDEMAW2-Da.jpg",
        "original_info": {
          "focus_rects": [],
          "height": 2160,
          "width": 3840
        },
        "sizes": {
          "large": {
            "h": 1152,
            "w": 2048
          }
        },
        "type": "video",
        "url": "https://t.co/bav2eIzVWn",
        "video_info": {
          "aspect_ratio": [
            16,
            9
          ],
          "duration_millis": 186786,
          "variants": [
            {
              "content_type": "application/x-mpegURL",
              "url": "https://video.twimg.com/amplify_video/2026342653055610880/pl/mBvLYqeGZNd87Amk.m3u8?tag=21"
            },
            {
              "bitrate": 256000,
              "content_type": "video/mp4",
              "url": "https://video.twimg.com/amplify_video/2026342653055610880/vid/avc1/480x270/1cOK9WnkW83mCc1A.mp4?tag=21"
            },
            {
              "bitrate": 832000,
              "content_type": "video/mp4",
              "url": "https://video.twimg.com/amplify_video/2026342653055610880/vid/avc1/640x360/crYgw66eZ9fo2ZQT.mp4?tag=21"
            },
            {
              "bitrate": 2176000,
              "content_type": "video/mp4",
              "url": "https://video.twimg.com/amplify_video/2026342653055610880/vid/avc1/1280x720/AMDViftPWlW-BXzI.mp4?tag=21"
            },
            {
              "bitrate": 10368000,
              "content_type": "video/mp4",
              "url": "https://video.twimg.com/amplify_video/2026342653055610880/vid/avc1/1920x1080/MTf2uLIAzvMNxP4z.mp4?tag=21"
            },
            {
              "bitrate": 25128000,
              "content_type": "video/mp4",
              "url": "https://video.twimg.com/amplify_video/2026342653055610880/vid/avc1/3840x2160/cSt8SJsDOLpyce1n.mp4?tag=21"
            }
          ]
        }
      }
    ]
  },
  "card": null,
  "place": {},
  "entities": {
    "hashtags": [],
    "symbols": [],
    "timestamps": [],
    "urls": [],
    "user_mentions": []
  },
  "quoted_tweet": null,
  "retweeted_tweet": null,
  "isLimitedReply": false,
  "article": null
}