🐦 Twitter Post Details

Viewing enriched Twitter post

@omarsar0

This is a super interesting paper on multi-agents for code patching. Claims SOTA on the SWE-bench Verified leaderboard (79.4%). Why this matters: Automated bug fixing is improving fast. But there's a catch. Patches that pass existing tests often fail on edge cases. The tests weren't designed to stress the fix. The fix wasn't designed to handle unusual inputs. Both are developed in isolation. This creates fragile patches that work in testing but break in production. This new research introduces InfCode, a framework where tests and patches challenge each other through adversarial iteration. The key idea: treat test generation and patch creation as opposing forces. Tests try to break patches. Patches evolve to survive. Both get stronger through conflict. The framework operates in cycles. Generate tests designed to expose patch weaknesses. Refine patches to handle those failures. Generate harder tests. Repeat until the patch is robust. What makes this powerful: patches earn their reliability. They don't just pass tests designed before the fix existed. They survive tests specifically crafted to break them. Evaluated on SWE-Bench Verified, the approach shows measurable gains in patch quality and coverage. Leads to fewer regressions and more robust fixes. Paper: https://t.co/DvAIxIKiPK

View on Twitter

📊 Media Metadata

{
  "media": [
    {
      "url": "https://crmoxkoizveukayfjuyo.supabase.co/storage/v1/object/public/media/posts/1993725704392077630/media_0.jpg?",
      "media_url": "https://crmoxkoizveukayfjuyo.supabase.co/storage/v1/object/public/media/posts/1993725704392077630/media_0.jpg?",
      "type": "photo",
      "filename": "media_0.jpg"
    }
  ],
  "processed_at": "2025-12-04T20:38:16.814956",
  "pipeline_version": "2.0"
}

🔧 Raw API Response

{
  "type": "tweet",
  "id": "1993725704392077630",
  "url": "https://x.com/omarsar0/status/1993725704392077630",
  "twitterUrl": "https://twitter.com/omarsar0/status/1993725704392077630",
  "text": "This is a super interesting paper on multi-agents for code patching.\n\nClaims SOTA on the SWE-bench Verified leaderboard (79.4%).\n\nWhy this matters:\n\nAutomated bug fixing is improving fast. But there's a catch. Patches that pass existing tests often fail on edge cases. The tests weren't designed to stress the fix. The fix wasn't designed to handle unusual inputs. Both are developed in isolation.\n\nThis creates fragile patches that work in testing but break in production.\n\nThis new research introduces InfCode, a framework where tests and patches challenge each other through adversarial iteration.\n\nThe key idea: treat test generation and patch creation as opposing forces. Tests try to break patches. Patches evolve to survive. Both get stronger through conflict.\n\nThe framework operates in cycles. Generate tests designed to expose patch weaknesses. Refine patches to handle those failures. Generate harder tests. Repeat until the patch is robust.\n\nWhat makes this powerful: patches earn their reliability. They don't just pass tests designed before the fix existed. They survive tests specifically crafted to break them.\n\nEvaluated on SWE-Bench Verified, the approach shows measurable gains in patch quality and coverage. Leads to fewer regressions and more robust fixes.\n\nPaper: https://t.co/DvAIxIKiPK",
  "source": "Twitter for iPhone",
  "retweetCount": 38,
  "replyCount": 5,
  "likeCount": 184,
  "quoteCount": 5,
  "viewCount": 18606,
  "createdAt": "Wed Nov 26 16:57:04 +0000 2025",
  "lang": "en",
  "bookmarkCount": 155,
  "isReply": false,
  "inReplyToId": null,
  "conversationId": "1993725704392077630",
  "displayTextRange": [
    0,
    279
  ],
  "inReplyToUserId": null,
  "inReplyToUsername": null,
  "author": {
    "type": "user",
    "userName": "omarsar0",
    "url": "https://x.com/omarsar0",
    "twitterUrl": "https://twitter.com/omarsar0",
    "id": "3448284313",
    "name": "elvis",
    "isVerified": false,
    "isBlueVerified": true,
    "verifiedType": null,
    "profilePicture": "https://pbs.twimg.com/profile_images/939313677647282181/vZjFWtAn_normal.jpg",
    "coverPicture": "https://pbs.twimg.com/profile_banners/3448284313/1565974901",
    "description": "",
    "location": "DAIR.AI Academy",
    "followers": 277867,
    "following": 724,
    "status": "",
    "canDm": true,
    "canMediaTag": true,
    "createdAt": "Fri Sep 04 12:59:26 +0000 2015",
    "entities": {
      "description": {
        "urls": []
      },
      "url": {}
    },
    "fastFollowersCount": 0,
    "favouritesCount": 33719,
    "hasCustomTimelines": true,
    "isTranslator": true,
    "mediaCount": 4356,
    "statusesCount": 16656,
    "withheldInCountries": [],
    "affiliatesHighlightedLabel": {},
    "possiblySensitive": false,
    "pinnedTweetIds": [
      "1996595107924263287"
    ],
    "profile_bio": {
      "description": "Building agents @dair_ai • Ex Meta AI, Elastic, PhD • Sharing research & insights on AI Agents • New cohort: https://t.co/tn8LKG5d20",
      "entities": {
        "description": {
          "urls": [
            {
              "display_url": "dair-ai.thinkific.com/courses/claude…",
              "expanded_url": "https://dair-ai.thinkific.com/courses/claude-code",
              "indices": [
                109,
                132
              ],
              "url": "https://t.co/tn8LKG5d20"
            }
          ],
          "user_mentions": [
            {
              "id_str": "0",
              "indices": [
                16,
                24
              ],
              "name": "",
              "screen_name": "dair_ai"
            }
          ]
        },
        "url": {
          "urls": [
            {
              "display_url": "dair-ai.thinkific.com",
              "expanded_url": "https://dair-ai.thinkific.com/",
              "indices": [
                0,
                23
              ],
              "url": "https://t.co/JBU5beHQNs"
            }
          ]
        }
      }
    },
    "isAutomated": false,
    "automatedBy": null
  },
  "extendedEntities": {
    "media": [
      {
        "display_url": "pic.twitter.com/obFehUzMIe",
        "expanded_url": "https://twitter.com/omarsar0/status/1993725704392077630/photo/1",
        "ext_media_availability": {
          "status": "Available"
        },
        "features": {
          "large": {
            "faces": [
              {
                "h": 65,
                "w": 65,
                "x": 145,
                "y": 645
              },
              {
                "h": 191,
                "w": 191,
                "x": 1705,
                "y": 443
              }
            ]
          },
          "orig": {
            "faces": [
              {
                "h": 86,
                "w": 86,
                "x": 192,
                "y": 851
              },
              {
                "h": 252,
                "w": 252,
                "x": 2247,
                "y": 584
              }
            ]
          }
        },
        "id_str": "1993725700751405056",
        "indices": [
          280,
          303
        ],
        "media_key": "3_1993725700751405056",
        "media_results": {
          "id": "QXBpTWVkaWFSZXN1bHRzOgwAAQoAARurIvYs2rAACgACG6si9wXa8T4AAA==",
          "result": {
            "__typename": "ApiMedia",
            "id": "QXBpTWVkaWE6DAABCgABG6si9izasAAKAAIbqyL3BdrxPgAA",
            "media_key": "3_1993725700751405056"
          }
        },
        "media_url_https": "https://pbs.twimg.com/media/G6si9izasAAEPEx.jpg",
        "original_info": {
          "focus_rects": [
            {
              "h": 1511,
              "w": 2698,
              "x": 0,
              "y": 0
            },
            {
              "h": 1648,
              "w": 1648,
              "x": 0,
              "y": 0
            },
            {
              "h": 1648,
              "w": 1446,
              "x": 0,
              "y": 0
            },
            {
              "h": 1648,
              "w": 824,
              "x": 0,
              "y": 0
            },
            {
              "h": 1648,
              "w": 2698,
              "x": 0,
              "y": 0
            }
          ],
          "height": 1648,
          "width": 2698
        },
        "sizes": {
          "large": {
            "h": 1251,
            "w": 2048
          }
        },
        "type": "photo",
        "url": "https://t.co/obFehUzMIe"
      }
    ]
  },
  "card": null,
  "place": {},
  "entities": {
    "urls": [
      {
        "display_url": "arxiv.org/abs/2511.16004",
        "expanded_url": "https://arxiv.org/abs/2511.16004",
        "indices": [
          1286,
          1309
        ],
        "url": "https://t.co/DvAIxIKiPK"
      }
    ]
  },
  "quoted_tweet": null,
  "retweeted_tweet": null,
  "isLimitedReply": false,
  "article": null
}