🐦 Twitter Post Details

Viewing enriched Twitter post

@dair_ai

Why do RL runs on LLMs blow up even when the recipe looks right? GEOALIGN, from the Alibaba team behind Qwen, points at the rollouts. A handful of bad batches push the policy in incoherent directions, and most stability tuning just damps the symptom. This work curates rollouts by their geometry, removing the samples that make update directions conflict before they destabilize training. Why does it matter? If instability is largely a bad-batch problem, rollout curation is a lower-effort lever than another round of KL or clip tuning. You fix the data going into the update rather than fighting the optimizer. Paper: https://t.co/tUAYC57MVy Learn to build effective AI agents in our academy: https://t.co/LRnpZN7L4c

View on Twitter

📊 Media Metadata

{
  "media": [
    {
      "type": "photo",
      "url": "https://crmoxkoizveukayfjuyo.supabase.co/storage/v1/object/public/media/posts/2071291294593564941/media_0.jpg",
      "filename": "media_0.jpg"
    }
  ],
  "processed_at": "2026-06-29T15:02:15.104494",
  "pipeline_version": "2.0"
}

🔧 Raw API Response

{
  "type": "tweet",
  "id": "2071291294593564941",
  "url": "https://x.com/dair_ai/status/2071291294593564941",
  "twitterUrl": "https://twitter.com/dair_ai/status/2071291294593564941",
  "text": "Why do RL runs on LLMs blow up even when the recipe looks right?\n\nGEOALIGN, from the Alibaba team behind Qwen, points at the rollouts. A handful of bad batches push the policy in incoherent directions, and most stability tuning just damps the symptom. This work curates rollouts by their geometry, removing the samples that make update directions conflict before they destabilize training.\n\nWhy does it matter?\n\nIf instability is largely a bad-batch problem, rollout curation is a lower-effort lever than another round of KL or clip tuning. You fix the data going into the update rather than fighting the optimizer.\n\nPaper: https://t.co/tUAYC57MVy\n\nLearn to build effective AI agents in our academy: https://t.co/LRnpZN7L4c",
  "source": "Twitter for iPhone",
  "retweetCount": 12,
  "replyCount": 2,
  "likeCount": 78,
  "quoteCount": 2,
  "viewCount": 8797,
  "createdAt": "Sun Jun 28 17:55:02 +0000 2026",
  "lang": "en",
  "bookmarkCount": 51,
  "isReply": false,
  "inReplyToId": null,
  "conversationId": "2071291294593564941",
  "displayTextRange": [
    0,
    278
  ],
  "inReplyToUserId": null,
  "inReplyToUsername": null,
  "author": {
    "type": "user",
    "userName": "dair_ai",
    "url": "https://x.com/dair_ai",
    "twitterUrl": "https://twitter.com/dair_ai",
    "id": "889050642903293953",
    "name": "DAIR.AI",
    "isVerified": false,
    "isBlueVerified": true,
    "verifiedType": null,
    "profilePicture": "https://pbs.twimg.com/profile_images/1643277398522187778/31dedbLo_normal.jpg",
    "coverPicture": "https://pbs.twimg.com/profile_banners/889050642903293953/1773242460",
    "description": "",
    "location": "",
    "followers": 127062,
    "following": 1,
    "status": "",
    "canDm": true,
    "canMediaTag": true,
    "createdAt": "Sun Jul 23 09:12:45 +0000 2017",
    "entities": {
      "description": {
        "urls": []
      },
      "url": {}
    },
    "fastFollowersCount": 0,
    "favouritesCount": 4678,
    "hasCustomTimelines": true,
    "isTranslator": false,
    "mediaCount": 260,
    "statusesCount": 3380,
    "withheldInCountries": [],
    "affiliatesHighlightedLabel": {},
    "possiblySensitive": false,
    "pinnedTweetIds": [
      "2071260533383135531"
    ],
    "profile_bio": {
      "description": "Democratizing AI research, education, and technologies. Learn about AI Agents for FREE at https://t.co/HHXg8rryu4",
      "entities": {
        "description": {
          "urls": [
            {
              "display_url": "academy.dair.ai/courses/elemen…",
              "expanded_url": "https://academy.dair.ai/courses/elements-of-ai-agents",
              "indices": [
                90,
                113
              ],
              "url": "https://t.co/HHXg8rryu4"
            }
          ]
        },
        "url": {
          "urls": [
            {
              "display_url": "dair.ai",
              "expanded_url": "https://www.dair.ai/",
              "indices": [
                0,
                23
              ],
              "url": "https://t.co/lkqPZtMU5s"
            }
          ]
        }
      }
    },
    "isAutomated": false,
    "automatedBy": null
  },
  "extendedEntities": {
    "media": [
      {
        "display_url": "pic.twitter.com/YekcRpsslL",
        "expanded_url": "https://twitter.com/dair_ai/status/2071291294593564941/photo/1",
        "ext_master_playlist_only": [],
        "ext_media_availability": {
          "status": "Available"
        },
        "ext_playlists": [],
        "features": {
          "large": {
            "faces": []
          },
          "orig": {
            "faces": []
          }
        },
        "id_str": "2071291290785038336",
        "indices": [
          279,
          302
        ],
        "media_key": "3_2071291290785038336",
        "media_results": {
          "id": "QXBpTWVkaWFSZXN1bHRzOgwAAQoAARy+tHRtmkAACgACHL60dVCbwQ0AAA==",
          "result": {
            "__typename": "ApiMedia",
            "id": "QXBpTWVkaWE6DAABCgABHL60dG2aQAAKAAIcvrR1UJvBDQAA",
            "media_key": "3_2071291290785038336"
          }
        },
        "media_url_https": "https://pbs.twimg.com/media/HL60dG2aQAAwa_j.jpg",
        "original_info": {
          "focus_rects": [
            {
              "h": 900,
              "w": 1608,
              "x": 0,
              "y": 0
            },
            {
              "h": 1608,
              "w": 1608,
              "x": 0,
              "y": 0
            },
            {
              "h": 1833,
              "w": 1608,
              "x": 0,
              "y": 0
            },
            {
              "h": 1866,
              "w": 933,
              "x": 46,
              "y": 0
            },
            {
              "h": 1866,
              "w": 1608,
              "x": 0,
              "y": 0
            }
          ],
          "height": 1866,
          "width": 1608
        },
        "sizes": {
          "large": {
            "h": 1866,
            "w": 1608
          }
        },
        "type": "photo",
        "url": "https://t.co/YekcRpsslL"
      }
    ]
  },
  "card": null,
  "place": {},
  "entities": {
    "hashtags": [],
    "symbols": [],
    "urls": [
      {
        "display_url": "arxiv.org/abs/2606.26917",
        "expanded_url": "https://arxiv.org/abs/2606.26917",
        "indices": [
          624,
          647
        ],
        "url": "https://t.co/tUAYC57MVy"
      },
      {
        "display_url": "academy.dair.ai",
        "expanded_url": "https://academy.dair.ai/",
        "indices": [
          700,
          723
        ],
        "url": "https://t.co/LRnpZN7L4c"
      }
    ],
    "user_mentions": []
  },
  "quoted_tweet": null,
  "retweeted_tweet": null,
  "isLimitedReply": false,
  "communityInfo": null,
  "article": null
}