🐦 Twitter Post Details

Viewing enriched Twitter post

@dair_ai

Agent skills look great in demos. Hand them a curated toolbox, and they shine. But what happens when the agent has to find the right skill from a large, unfiltered collection on its own? New research benchmarks LLM skill usage in realistic settings and finds that performance gains degrade consistently as conditions become more realistic, with pass rates approaching no-skill baselines. The fix is to introduce query-specific skill refinement, which substantially recovers lost performance. On Terminal-Bench 2.0, this approach improved Claude Opus 4.6's pass rate from 57.7% to 65.5%. As skill and tool ecosystems grow, agents won't have curated toolboxes handed to them. They'll face noisy, overlapping, and irrelevant options. Paper: https://t.co/Dm7JxredRI Learn to build effective AI agents in our academy: https://t.co/LRnpZN7L4c

View on Twitter

📊 Media Metadata

{
  "media": [
    {
      "type": "photo",
      "url": "https://crmoxkoizveukayfjuyo.supabase.co/storage/v1/object/public/media/posts/2041540525539614797/media_0.jpg",
      "filename": "media_0.jpg"
    },
    {
      "type": "photo",
      "url": "https://crmoxkoizveukayfjuyo.supabase.co/storage/v1/object/public/media/posts/2041540525539614797/media_1.png",
      "filename": "media_1.png"
    }
  ],
  "processed_at": "2026-04-07T15:47:40.739422",
  "pipeline_version": "2.0"
}

🔧 Raw API Response

{
  "type": "tweet",
  "id": "2041540525539614797",
  "url": "https://x.com/dair_ai/status/2041540525539614797",
  "twitterUrl": "https://twitter.com/dair_ai/status/2041540525539614797",
  "text": "Agent skills look great in demos.\n\nHand them a curated toolbox, and they shine.\n\nBut what happens when the agent has to find the right skill from a large, unfiltered collection on its own?\n\nNew research benchmarks LLM skill usage in realistic settings and finds that performance gains degrade consistently as conditions become more realistic, with pass rates approaching no-skill baselines.\n\nThe fix is to introduce query-specific skill refinement, which substantially recovers lost performance. On Terminal-Bench 2.0, this approach improved Claude Opus 4.6's pass rate from 57.7% to 65.5%.\n\nAs skill and tool ecosystems grow, agents won't have curated toolboxes handed to them.\n\nThey'll face noisy, overlapping, and irrelevant options.\n\nPaper: https://t.co/Dm7JxredRI\n\nLearn to build effective AI agents in our academy: https://t.co/LRnpZN7L4c",
  "source": "Twitter for iPhone",
  "retweetCount": 0,
  "replyCount": 1,
  "likeCount": 4,
  "quoteCount": 0,
  "viewCount": 385,
  "createdAt": "Tue Apr 07 15:36:05 +0000 2026",
  "lang": "en",
  "bookmarkCount": 8,
  "isReply": false,
  "inReplyToId": null,
  "conversationId": "2041540525539614797",
  "displayTextRange": [
    0,
    278
  ],
  "inReplyToUserId": null,
  "inReplyToUsername": null,
  "author": {
    "type": "user",
    "userName": "dair_ai",
    "url": "https://x.com/dair_ai",
    "twitterUrl": "https://twitter.com/dair_ai",
    "id": "889050642903293953",
    "name": "DAIR.AI",
    "isVerified": false,
    "isBlueVerified": true,
    "verifiedType": null,
    "profilePicture": "https://pbs.twimg.com/profile_images/1643277398522187778/31dedbLo_normal.jpg",
    "coverPicture": "https://pbs.twimg.com/profile_banners/889050642903293953/1773242460",
    "description": "",
    "location": "",
    "followers": 95437,
    "following": 1,
    "status": "",
    "canDm": true,
    "canMediaTag": true,
    "createdAt": "Sun Jul 23 09:12:45 +0000 2017",
    "entities": {
      "description": {
        "urls": []
      },
      "url": {}
    },
    "fastFollowersCount": 0,
    "favouritesCount": 4311,
    "hasCustomTimelines": true,
    "isTranslator": false,
    "mediaCount": 186,
    "statusesCount": 3069,
    "withheldInCountries": [],
    "affiliatesHighlightedLabel": {},
    "possiblySensitive": false,
    "pinnedTweetIds": [
      "2041540525539614797"
    ],
    "profile_bio": {
      "description": "Democratizing AI research, education, and technologies. New AI learning portal: https://t.co/LRnpZN7L4c",
      "entities": {
        "description": {
          "hashtags": [],
          "symbols": [],
          "urls": [
            {
              "display_url": "academy.dair.ai",
              "expanded_url": "https://academy.dair.ai/",
              "indices": [
                80,
                103
              ],
              "url": "https://t.co/LRnpZN7L4c"
            }
          ],
          "user_mentions": []
        },
        "url": {
          "urls": [
            {
              "display_url": "dair.ai",
              "expanded_url": "https://www.dair.ai/",
              "indices": [
                0,
                23
              ],
              "url": "https://t.co/lkqPZtMU5s"
            }
          ]
        }
      }
    },
    "isAutomated": false,
    "automatedBy": null
  },
  "extendedEntities": {
    "media": [
      {
        "display_url": "pic.twitter.com/OEGKziw1Ai",
        "expanded_url": "https://twitter.com/dair_ai/status/2041540525539614797/photo/1",
        "ext_media_availability": {
          "status": "Available"
        },
        "features": {
          "large": {
            "faces": [
              {
                "h": 76,
                "w": 76,
                "x": 5,
                "y": 563
              },
              {
                "h": 169,
                "w": 169,
                "x": 179,
                "y": 932
              }
            ]
          },
          "orig": {
            "faces": [
              {
                "h": 76,
                "w": 76,
                "x": 5,
                "y": 563
              },
              {
                "h": 169,
                "w": 169,
                "x": 179,
                "y": 932
              }
            ]
          }
        },
        "id_str": "2041540522331062272",
        "indices": [
          279,
          302
        ],
        "media_key": "3_2041540522331062272",
        "media_results": {
          "id": "QXBpTWVkaWFSZXN1bHRzOgwAAQoAARxVAkncG8AACgACHFUCSptaYE0AAA==",
          "result": {
            "__typename": "ApiMedia",
            "id": "QXBpTWVkaWE6DAABCgABHFUCSdwbwAAKAAIcVQJKm1pgTQAA",
            "media_key": "3_2041540522331062272"
          }
        },
        "media_url_https": "https://pbs.twimg.com/media/HFUCSdwbwAArTFu.jpg",
        "original_info": {
          "focus_rects": [
            {
              "h": 755,
              "w": 1348,
              "x": 0,
              "y": 0
            },
            {
              "h": 1348,
              "w": 1348,
              "x": 0,
              "y": 0
            },
            {
              "h": 1506,
              "w": 1321,
              "x": 0,
              "y": 0
            },
            {
              "h": 1506,
              "w": 753,
              "x": 0,
              "y": 0
            },
            {
              "h": 1506,
              "w": 1348,
              "x": 0,
              "y": 0
            }
          ],
          "height": 1506,
          "width": 1348
        },
        "sizes": {
          "large": {
            "h": 1506,
            "w": 1348
          }
        },
        "type": "photo",
        "url": "https://t.co/OEGKziw1Ai"
      }
    ]
  },
  "card": null,
  "place": {},
  "entities": {
    "hashtags": [],
    "symbols": [],
    "urls": [
      {
        "display_url": "arxiv.org/abs/2604.04323",
        "expanded_url": "https://arxiv.org/abs/2604.04323",
        "indices": [
          745,
          768
        ],
        "url": "https://t.co/Dm7JxredRI"
      },
      {
        "display_url": "academy.dair.ai",
        "expanded_url": "https://academy.dair.ai/",
        "indices": [
          821,
          844
        ],
        "url": "https://t.co/LRnpZN7L4c"
      }
    ],
    "user_mentions": []
  },
  "quoted_tweet": null,
  "retweeted_tweet": null,
  "isLimitedReply": false,
  "communityInfo": null,
  "article": null
}