🐦 Twitter Post Details

Viewing enriched Twitter post

@omarsar0

NEW Stanford & MIT paper on Model Harnesses. Changing the harness around a fixed LLM can produce a 6x performance gap on the same benchmark. What if we automated harness engineering itself? The work introduces Meta-Harness, an agentic system that searches over harness code by exposing the full history through a filesystem. The proposer reads source code, execution traces, and scores from all prior candidates, referencing over 20 past attempts per step. On text classification, it improves over SOTA context management by 7.7 points while using 4x fewer tokens. On agentic coding, it outperforms all hand-engineered baselines on TerminalBench-2, scoring 37.6% versus Claude Code's 27.5%. This is a big deal! Here is why: The harness around a model often matters as much as the model itself. Meta-Harness shows that giving an optimizer rich access to prior experience, not just compressed scores, unlocks automated engineering that beats human-designed scaffolding. Paper: https://t.co/hqkZaWbBTl Learn to build effective AI agents in our academy: https://t.co/1e8RZKs4uX

Media 1
Media 2

📊 Media Metadata

{
  "media": [
    {
      "type": "photo",
      "url": "https://crmoxkoizveukayfjuyo.supabase.co/storage/v1/object/public/media/posts/2038967842075500870/media_0.jpg",
      "filename": "media_0.jpg"
    },
    {
      "type": "photo",
      "url": "https://crmoxkoizveukayfjuyo.supabase.co/storage/v1/object/public/media/posts/2038967842075500870/media_1.png",
      "filename": "media_1.png"
    }
  ],
  "processed_at": "2026-03-31T13:14:58.383465",
  "pipeline_version": "2.0"
}

🔧 Raw API Response

{
  "type": "tweet",
  "id": "2038967842075500870",
  "url": "https://x.com/omarsar0/status/2038967842075500870",
  "twitterUrl": "https://twitter.com/omarsar0/status/2038967842075500870",
  "text": "NEW Stanford & MIT paper on Model Harnesses.\n\nChanging the harness around a fixed LLM can produce a 6x performance gap on the same benchmark.\n\nWhat if we automated harness engineering itself?\n\nThe work introduces Meta-Harness, an agentic system that searches over harness code by exposing the full history through a filesystem.\n\nThe proposer reads source code, execution traces, and scores from all prior candidates, referencing over 20 past attempts per step.\n\nOn text classification, it improves over SOTA context management by 7.7 points while using 4x fewer tokens.\n\nOn agentic coding, it outperforms all hand-engineered baselines on TerminalBench-2, scoring 37.6% versus Claude Code's 27.5%.\n\nThis is a big deal! Here is why:\n\nThe harness around a model often matters as much as the model itself.\n\nMeta-Harness shows that giving an optimizer rich access to prior experience, not just compressed scores, unlocks automated engineering that beats human-designed scaffolding.\n\nPaper: https://t.co/hqkZaWbBTl\n\nLearn to build effective AI agents in our academy: https://t.co/1e8RZKs4uX",
  "source": "Twitter for iPhone",
  "retweetCount": 2,
  "replyCount": 0,
  "likeCount": 2,
  "quoteCount": 0,
  "viewCount": 103,
  "createdAt": "Tue Mar 31 13:13:10 +0000 2026",
  "lang": "en",
  "bookmarkCount": 4,
  "isReply": false,
  "inReplyToId": null,
  "conversationId": "2038967842075500870",
  "displayTextRange": [
    0,
    283
  ],
  "inReplyToUserId": null,
  "inReplyToUsername": null,
  "author": {
    "type": "user",
    "userName": "omarsar0",
    "url": "https://x.com/omarsar0",
    "twitterUrl": "https://twitter.com/omarsar0",
    "id": "3448284313",
    "name": "elvis",
    "isVerified": false,
    "isBlueVerified": true,
    "verifiedType": null,
    "profilePicture": "https://pbs.twimg.com/profile_images/939313677647282181/vZjFWtAn_normal.jpg",
    "coverPicture": "https://pbs.twimg.com/profile_banners/3448284313/1565974901",
    "description": "",
    "location": "DAIR.AI Academy",
    "followers": 296157,
    "following": 797,
    "status": "",
    "canDm": true,
    "canMediaTag": true,
    "createdAt": "Fri Sep 04 12:59:26 +0000 2015",
    "entities": {
      "description": {
        "urls": []
      },
      "url": {}
    },
    "fastFollowersCount": 0,
    "favouritesCount": 35250,
    "hasCustomTimelines": true,
    "isTranslator": true,
    "mediaCount": 4572,
    "statusesCount": 17548,
    "withheldInCountries": [],
    "affiliatesHighlightedLabel": {},
    "possiblySensitive": false,
    "pinnedTweetIds": [
      "2038656035666247824"
    ],
    "profile_bio": {
      "description": "Building @dair_ai • Prev: Meta AI, Elastic, PhD • New AI learning portal: https://t.co/1e8RZKs4uX",
      "entities": {
        "description": {
          "hashtags": [],
          "symbols": [],
          "urls": [
            {
              "display_url": "academy.dair.ai",
              "expanded_url": "https://academy.dair.ai/",
              "indices": [
                74,
                97
              ],
              "url": "https://t.co/1e8RZKs4uX"
            }
          ],
          "user_mentions": [
            {
              "id_str": "0",
              "indices": [
                9,
                17
              ],
              "name": "",
              "screen_name": "dair_ai"
            }
          ]
        },
        "url": {
          "urls": [
            {
              "display_url": "dair.ai",
              "expanded_url": "https://www.dair.ai/",
              "indices": [
                0,
                23
              ],
              "url": "https://t.co/XQto5ypSIk"
            }
          ]
        }
      }
    },
    "isAutomated": false,
    "automatedBy": null
  },
  "extendedEntities": {
    "media": [
      {
        "display_url": "pic.twitter.com/EHvnFNcn1c",
        "expanded_url": "https://twitter.com/omarsar0/status/2038967842075500870/photo/1",
        "ext_media_availability": {
          "status": "Available"
        },
        "features": {
          "large": {
            "faces": []
          },
          "orig": {
            "faces": []
          }
        },
        "id_str": "2038967838904602624",
        "indices": [
          284,
          307
        ],
        "media_key": "3_2038967838904602624",
        "media_results": {
          "id": "QXBpTWVkaWFSZXN1bHRzOgwAAQoAARxL3nJSWtAACgACHEvecw9a4UYAAA==",
          "result": {
            "__typename": "ApiMedia",
            "id": "QXBpTWVkaWE6DAABCgABHEveclJa0AAKAAIcS95zD1rhRgAA",
            "media_key": "3_2038967838904602624"
          }
        },
        "media_url_https": "https://pbs.twimg.com/media/HEveclJa0AAux36.jpg",
        "original_info": {
          "focus_rects": [
            {
              "h": 950,
              "w": 1696,
              "x": 0,
              "y": 0
            },
            {
              "h": 1624,
              "w": 1624,
              "x": 72,
              "y": 0
            },
            {
              "h": 1624,
              "w": 1425,
              "x": 263,
              "y": 0
            },
            {
              "h": 1624,
              "w": 812,
              "x": 569,
              "y": 0
            },
            {
              "h": 1624,
              "w": 1696,
              "x": 0,
              "y": 0
            }
          ],
          "height": 1624,
          "width": 1696
        },
        "sizes": {
          "large": {
            "h": 1624,
            "w": 1696
          }
        },
        "type": "photo",
        "url": "https://t.co/EHvnFNcn1c"
      }
    ]
  },
  "card": null,
  "place": {},
  "entities": {
    "hashtags": [],
    "symbols": [],
    "urls": [
      {
        "display_url": "arxiv.org/abs/2603.28052",
        "expanded_url": "https://arxiv.org/abs/2603.28052",
        "indices": [
          985,
          1008
        ],
        "url": "https://t.co/hqkZaWbBTl"
      },
      {
        "display_url": "academy.dair.ai",
        "expanded_url": "https://academy.dair.ai/",
        "indices": [
          1061,
          1084
        ],
        "url": "https://t.co/1e8RZKs4uX"
      }
    ],
    "user_mentions": []
  },
  "quoted_tweet": null,
  "retweeted_tweet": null,
  "isLimitedReply": false,
  "article": null
}