🐦 Twitter Post Details

Viewing enriched Twitter post

@omarsar0

One Tool is Enough. When building AI agents, more tools often lead to more failure points. This new research introduces RepoNavigator, an LLM agent equipped with a single execution-aware tool: jump. It retrieves the definition of any symbol in a given file, mirroring actual code execution semantics. They find that a single capable tool outperforms multiple narrow-scope tools executed in sequence. RepoNavigator is trained end-to-end via RL directly from pretrained Qwen models, with no closed-source distillation required. The reward combines localization accuracy (Dice score) with tool-calling success rate. On SWE-bench Verified, the 7B model outperforms 14B baselines. The 14B model surpasses 32B competitors. The 32B model exceeds Claude-3.7-Sonnet. The access scope of "jump" is inherently smaller than the full repository. By recursively resolving symbol references from an entry point, the tool traverses only semantically relevant code paths. This constrained search space yields higher precision without sacrificing recall. Ablations confirm that adding tools like GetClass, GetFunc, and GetStruc actually decreases performance. IoU drops from 24.28% with jump alone to 13.71% with all four tools. Tool design for agents should prioritize capability over quantity. One execution-aware tool, jointly optimized with RL, provides stronger robustness than multi-tool pipelines.

View on Twitter

📊 Media Metadata

{
  "media": [
    {
      "url": "https://crmoxkoizveukayfjuyo.supabase.co/storage/v1/object/public/media/posts/2005999079265034729/media_0.jpg?",
      "media_url": "https://crmoxkoizveukayfjuyo.supabase.co/storage/v1/object/public/media/posts/2005999079265034729/media_0.jpg?",
      "type": "photo",
      "filename": "media_0.jpg"
    }
  ],
  "processed_at": "2025-12-31T02:47:51.569620",
  "pipeline_version": "2.0"
}

🔧 Raw API Response

{
  "type": "tweet",
  "id": "2005999079265034729",
  "url": "https://x.com/omarsar0/status/2005999079265034729",
  "twitterUrl": "https://twitter.com/omarsar0/status/2005999079265034729",
  "text": "One Tool is Enough.\n\nWhen building AI agents, more tools often lead to more failure points.\n\nThis new research introduces RepoNavigator, an LLM agent equipped with a single execution-aware tool: jump. It retrieves the definition of any symbol in a given file, mirroring actual code execution semantics.\n\nThey find that a single capable tool outperforms multiple narrow-scope tools executed in sequence.\n\nRepoNavigator is trained end-to-end via RL directly from pretrained Qwen models, with no closed-source distillation required. The reward combines localization accuracy (Dice score) with tool-calling success rate.\n\nOn SWE-bench Verified, the 7B model outperforms 14B baselines. The 14B model surpasses 32B competitors. The 32B model exceeds Claude-3.7-Sonnet.\n\nThe access scope of \"jump\" is inherently smaller than the full repository. By recursively resolving symbol references from an entry point, the tool traverses only semantically relevant code paths. This constrained search space yields higher precision without sacrificing recall.\n\nAblations confirm that adding tools like GetClass, GetFunc, and GetStruc actually decreases performance. IoU drops from 24.28% with jump alone to 13.71% with all four tools.\n\nTool design for agents should prioritize capability over quantity. One execution-aware tool, jointly optimized with RL, provides stronger robustness than multi-tool pipelines.",
  "source": "Twitter for iPhone",
  "retweetCount": 17,
  "replyCount": 13,
  "likeCount": 172,
  "quoteCount": 1,
  "viewCount": 13094,
  "createdAt": "Tue Dec 30 13:47:04 +0000 2025",
  "lang": "en",
  "bookmarkCount": 149,
  "isReply": false,
  "inReplyToId": null,
  "conversationId": "2005999079265034729",
  "displayTextRange": [
    0,
    301
  ],
  "inReplyToUserId": null,
  "inReplyToUsername": null,
  "author": {
    "type": "user",
    "userName": "omarsar0",
    "url": "https://x.com/omarsar0",
    "twitterUrl": "https://twitter.com/omarsar0",
    "id": "3448284313",
    "name": "elvis",
    "isVerified": false,
    "isBlueVerified": true,
    "verifiedType": null,
    "profilePicture": "https://pbs.twimg.com/profile_images/939313677647282181/vZjFWtAn_normal.jpg",
    "coverPicture": "https://pbs.twimg.com/profile_banners/3448284313/1565974901",
    "description": "",
    "location": "DAIR.AI Academy",
    "followers": 282031,
    "following": 752,
    "status": "",
    "canDm": true,
    "canMediaTag": true,
    "createdAt": "Fri Sep 04 12:59:26 +0000 2015",
    "entities": {
      "description": {
        "urls": []
      },
      "url": {}
    },
    "fastFollowersCount": 0,
    "favouritesCount": 34254,
    "hasCustomTimelines": true,
    "isTranslator": true,
    "mediaCount": 4420,
    "statusesCount": 16895,
    "withheldInCountries": [],
    "affiliatesHighlightedLabel": {},
    "possiblySensitive": false,
    "pinnedTweetIds": [
      "2006004138220605920"
    ],
    "profile_bio": {
      "description": "Building @dair_ai • Prev: Meta AI, Elastic, PhD • New cohort: https://t.co/GZMhf39NRs",
      "entities": {
        "description": {
          "urls": [
            {
              "display_url": "dair-ai.thinkific.com/courses/claude…",
              "expanded_url": "https://dair-ai.thinkific.com/courses/claude-code-for-everyone-2",
              "indices": [
                62,
                85
              ],
              "url": "https://t.co/GZMhf39NRs"
            }
          ],
          "user_mentions": [
            {
              "id_str": "0",
              "indices": [
                9,
                17
              ],
              "name": "",
              "screen_name": "dair_ai"
            }
          ]
        },
        "url": {
          "urls": [
            {
              "display_url": "dair.ai",
              "expanded_url": "https://www.dair.ai/",
              "indices": [
                0,
                23
              ],
              "url": "https://t.co/XQto5ypkSM"
            }
          ]
        }
      }
    },
    "isAutomated": false,
    "automatedBy": null
  },
  "extendedEntities": {
    "media": [
      {
        "display_url": "pic.twitter.com/X9mO00jVRS",
        "expanded_url": "https://twitter.com/omarsar0/status/2005999079265034729/photo/1",
        "ext_media_availability": {
          "status": "Available"
        },
        "features": {
          "large": {},
          "orig": {}
        },
        "id_str": "2005999075825704965",
        "indices": [
          302,
          325
        ],
        "media_key": "3_2005999075825704965",
        "media_results": {
          "id": "QXBpTWVkaWFSZXN1bHRzOgwAAQoAARvWvYf+WmAFCgACG9a9iMtaYekAAA==",
          "result": {
            "__typename": "ApiMedia",
            "id": "QXBpTWVkaWE6DAABCgABG9a9h/5aYAUKAAIb1r2Iy1ph6QAA",
            "media_key": "3_2005999075825704965"
          }
        },
        "media_url_https": "https://pbs.twimg.com/media/G9a9h_5aYAUQoSG.jpg",
        "original_info": {
          "focus_rects": [
            {
              "h": 904,
              "w": 1614,
              "x": 0,
              "y": 0
            },
            {
              "h": 1614,
              "w": 1614,
              "x": 0,
              "y": 0
            },
            {
              "h": 1800,
              "w": 1579,
              "x": 35,
              "y": 0
            },
            {
              "h": 1800,
              "w": 900,
              "x": 714,
              "y": 0
            },
            {
              "h": 1800,
              "w": 1614,
              "x": 0,
              "y": 0
            }
          ],
          "height": 1800,
          "width": 1614
        },
        "sizes": {
          "large": {
            "h": 1800,
            "w": 1614
          }
        },
        "type": "photo",
        "url": "https://t.co/X9mO00jVRS"
      }
    ]
  },
  "card": null,
  "place": {},
  "entities": {},
  "quoted_tweet": null,
  "retweeted_tweet": null,
  "isLimitedReply": false,
  "article": null
}