🐦 Twitter Post Details

Viewing enriched Twitter post

@SakanaAILabs

Use Case 1: Autonomous ML Research Can an AI autonomously improve another AI’s training recipe? We tasked Fugu Ultra with improving a small GPT model using AutoResearch. Over 14 hours on a single H100 GPU, Fugu ran > 100 experiments. It iteratively edited the training code, ran tests, and kept any changes that successfully lowered the validation error rate. Watch the animation. The callouts track every time Fugu Ultra autonomously discovered a new improvement across batch size, model depth, learning rates, and optimizer settings. We pitted Fugu against three frontier models (Gemini 3.1 Pro, Opus 4.8, and GPT 5.5). To keep the focus purely on agentic behavior rather than brand wars, we anonymized them as Models A, B, and C. The Results: • Fugu Ultra (bold red) finished with the best mean performance (0.9774). • Fugu Ultra also achieved the best single run of the entire experiment (0.9748), leading every single baseline. For long horizon, agentic ML research, using Fugu to dynamically orchestrate a pool of strong models significantly outperforms relying on any individual monolithic model.

View on Twitter

📊 Media Metadata

{
  "media": [
    {
      "url": "https://crmoxkoizveukayfjuyo.supabase.co/storage/v1/object/public/media/posts/2069084332879462779/media_0.jpg",
      "media_url": "https://crmoxkoizveukayfjuyo.supabase.co/storage/v1/object/public/media/posts/2069084332879462779/media_0.jpg",
      "type": "photo",
      "filename": "media_0.jpg"
    }
  ],
  "processed_at": "2026-06-22T15:48:39.759510",
  "pipeline_version": "2.0"
}

🔧 Raw API Response

{
  "type": "tweet",
  "id": "2069084332879462779",
  "url": "https://x.com/SakanaAILabs/status/2069084332879462779",
  "twitterUrl": "https://twitter.com/SakanaAILabs/status/2069084332879462779",
  "text": "Use Case 1: Autonomous ML Research\n\nCan an AI autonomously improve another AI’s training recipe?\n\nWe tasked Fugu Ultra with improving a small GPT model using AutoResearch. Over 14 hours on a single H100 GPU, Fugu ran > 100 experiments. It iteratively edited the training code, ran tests, and kept any changes that successfully lowered the validation error rate.\n\nWatch the animation. The callouts track every time Fugu Ultra autonomously discovered a new improvement across batch size, model depth, learning rates, and optimizer settings.\n\nWe pitted Fugu against three frontier models (Gemini 3.1 Pro, Opus 4.8, and GPT 5.5). To keep the focus purely on agentic behavior rather than brand wars, we anonymized them as Models A, B, and C.\n\nThe Results:\n\n• Fugu Ultra (bold red) finished with the best mean performance (0.9774).\n• Fugu Ultra also achieved the best single run of the entire experiment (0.9748), leading every single baseline.\n\nFor long horizon, agentic ML research, using Fugu to dynamically orchestrate a pool of strong models significantly outperforms relying on any individual monolithic model.",
  "source": "Twitter for iPhone",
  "retweetCount": 2,
  "replyCount": 0,
  "likeCount": 4,
  "quoteCount": 0,
  "viewCount": 1136,
  "createdAt": "Mon Jun 22 15:45:21 +0000 2026",
  "lang": "en",
  "bookmarkCount": 4,
  "isReply": true,
  "inReplyToId": "2069079051239874657",
  "conversationId": "2068861630327443966",
  "displayTextRange": [
    0,
    283
  ],
  "inReplyToUserId": null,
  "inReplyToUsername": null,
  "author": {
    "type": "user",
    "userName": "SakanaAILabs",
    "url": "https://x.com/SakanaAILabs",
    "twitterUrl": "https://twitter.com/SakanaAILabs",
    "id": "218811492",
    "name": "Sakana AI",
    "isVerified": false,
    "isBlueVerified": true,
    "verifiedType": "Business",
    "profilePicture": "https://pbs.twimg.com/profile_images/1885939209388929024/dtnrOdGp_normal.jpg",
    "coverPicture": "https://pbs.twimg.com/profile_banners/218811492/1686643464",
    "description": "",
    "location": "Tokyo, Japan",
    "followers": 107325,
    "following": 0,
    "status": "",
    "canDm": false,
    "canMediaTag": true,
    "createdAt": "Tue Nov 23 10:20:07 +0000 2010",
    "entities": {
      "description": {
        "urls": []
      },
      "url": {}
    },
    "fastFollowersCount": 0,
    "favouritesCount": 2,
    "hasCustomTimelines": true,
    "isTranslator": false,
    "mediaCount": 413,
    "statusesCount": 1160,
    "withheldInCountries": [],
    "affiliatesHighlightedLabel": {},
    "possiblySensitive": false,
    "pinnedTweetIds": [
      "2036840833690071450"
    ],
    "profile_bio": {
      "description": "Building Frontier AI in Japan\n\nTry Sakana Chat, Marlin, Fugu 🐡 → https://t.co/1m2lSgnfB2",
      "entities": {
        "description": {
          "urls": [
            {
              "display_url": "sakana.ai",
              "expanded_url": "https://sakana.ai/",
              "indices": [
                65,
                88
              ],
              "url": "https://t.co/1m2lSgnfB2"
            }
          ]
        },
        "url": {
          "urls": [
            {
              "display_url": "sakana.ai/careers",
              "expanded_url": "https://sakana.ai/careers",
              "indices": [
                0,
                23
              ],
              "url": "https://t.co/1q07mb3TzE"
            }
          ]
        }
      }
    },
    "isAutomated": false,
    "automatedBy": null
  },
  "extendedEntities": {
    "media": [
      {
        "display_url": "pic.twitter.com/Gp96FEQ797",
        "expanded_url": "https://twitter.com/SakanaAILabs/status/2069084332879462779/photo/1",
        "ext_master_playlist_only": [],
        "ext_media_availability": {
          "status": "Available"
        },
        "ext_playlists": [],
        "id_str": "2069084284993056768",
        "indices": [
          284,
          307
        ],
        "media_key": "16_2069084284993056768",
        "media_results": {
          "id": "QXBpTWVkaWFSZXN1bHRzOgwAAgoAARy23THZWrAACgACHLbdPP+bQXsAAA==",
          "result": {
            "__typename": "ApiMedia",
            "id": "QXBpTWVkaWE6DAACCgABHLbdMdlasAAKAAIctt08/5tBewAA",
            "media_key": "16_2069084284993056768"
          }
        },
        "media_url_https": "https://pbs.twimg.com/tweet_video_thumb/HLbdMdlasAAVXPZ.jpg",
        "original_info": {
          "focus_rects": [],
          "height": 720,
          "width": 1280
        },
        "sizes": {
          "large": {
            "h": 720,
            "w": 1280
          }
        },
        "type": "animated_gif",
        "url": "https://t.co/Gp96FEQ797",
        "video_info": {
          "aspect_ratio": [
            16,
            9
          ],
          "variants": [
            {
              "bitrate": 0,
              "content_type": "video/mp4",
              "url": "https://video.twimg.com/tweet_video/HLbdMdlasAAVXPZ.mp4"
            }
          ]
        }
      }
    ]
  },
  "card": null,
  "place": {},
  "entities": {
    "hashtags": [],
    "symbols": [],
    "urls": [],
    "user_mentions": []
  },
  "quoted_tweet": null,
  "retweeted_tweet": null,
  "isLimitedReply": false,
  "communityInfo": null,
  "article": null
}