🐦 Twitter Post Details

Viewing enriched Twitter post

@stevibe

Introducing HermesAgent-20, a new Bench Pack for BenchLocal. 20 scenarios extracted straight from the Hermes Agent source code, run against a REAL Hermes instance. The actual workload you'd put your model through. Why I built BenchLocal in the first place: most benchmarks are too abstract. We use local LLMs for practical work, and finding the right model for YOUR task efficiently is the single most important thing, especially when you're constrained to what fits on your machine. BenchLocal is a framework: providers, models, side-by-side comparison, all in one UI. Bench Packs are the unit of testing: ToolCall-15 and BugFind-15 shipped first, and when I launched the BenchLocal 0.1.0, added StructOutput, ReasonMath, InstructFollow, DataExtract. Now, HermesAgent-20 is the newest. Bench Packs install like VS Code extensions. The SDK is open, write your own, share it, grow the ecosystem. Here's the goal: a community-built, practical evaluation layer for the local LLM space. Early numbers on HermesAgent-20: > GLM 5.1 — 85 > Gemma4 31B — 83 > Qwen3.5 27B — 79 > MiniMax M2.7 — 76 Upgrade to the latest BenchLocal to install HermesAgent-20 (SDK update required).

View on Twitter

📊 Media Metadata

{
  "media": [
    {
      "url": "https://crmoxkoizveukayfjuyo.supabase.co/storage/v1/object/public/media/posts/2045165824294658539/media_0.mp4",
      "media_url": "https://crmoxkoizveukayfjuyo.supabase.co/storage/v1/object/public/media/posts/2045165824294658539/media_0.mp4",
      "type": "video",
      "filename": "media_0.mp4"
    }
  ],
  "processed_at": "2026-04-17T21:51:46.808717",
  "pipeline_version": "2.0"
}

🔧 Raw API Response

{
  "type": "tweet",
  "id": "2045165824294658539",
  "url": "https://x.com/stevibe/status/2045165824294658539",
  "twitterUrl": "https://twitter.com/stevibe/status/2045165824294658539",
  "text": "Introducing HermesAgent-20, a new Bench Pack for BenchLocal.\n\n20 scenarios extracted straight from the Hermes Agent source code, run against a REAL Hermes instance. The actual workload you'd put your model through.\n\nWhy I built BenchLocal in the first place: most benchmarks are too abstract. We use local LLMs for practical work, and finding the right model for YOUR task efficiently is the single most important thing, especially when you're constrained to what fits on your machine.\n\nBenchLocal is a framework: providers, models, side-by-side comparison, all in one UI.\n\nBench Packs are the unit of testing: ToolCall-15 and BugFind-15 shipped first, and when I launched the BenchLocal 0.1.0, added StructOutput, ReasonMath, InstructFollow, DataExtract.\n\nNow, HermesAgent-20 is the newest.\n\nBench Packs install like VS Code extensions. The SDK is open, write your own, share it, grow the ecosystem. Here's the goal: a community-built, practical evaluation layer for the local LLM space.\n\nEarly numbers on HermesAgent-20:\n> GLM 5.1 — 85\n> Gemma4 31B — 83\n> Qwen3.5 27B — 79\n> MiniMax M2.7 — 76\n\nUpgrade to the latest BenchLocal to install HermesAgent-20 (SDK update required).",
  "source": "Twitter for iPhone",
  "retweetCount": 18,
  "replyCount": 16,
  "likeCount": 151,
  "quoteCount": 3,
  "viewCount": 9716,
  "createdAt": "Fri Apr 17 15:41:44 +0000 2026",
  "lang": "en",
  "bookmarkCount": 101,
  "isReply": false,
  "inReplyToId": null,
  "conversationId": "2045165824294658539",
  "displayTextRange": [
    0,
    278
  ],
  "inReplyToUserId": null,
  "inReplyToUsername": null,
  "author": {
    "type": "user",
    "userName": "stevibe",
    "url": "https://x.com/stevibe",
    "twitterUrl": "https://twitter.com/stevibe",
    "id": "56969965",
    "name": "stevibe",
    "isVerified": false,
    "isBlueVerified": true,
    "verifiedType": null,
    "profilePicture": "https://pbs.twimg.com/profile_images/1476819230557614081/dIp-8a5r_normal.jpg",
    "coverPicture": "https://pbs.twimg.com/profile_banners/56969965/1771165329",
    "description": "",
    "location": "",
    "followers": 19579,
    "following": 1533,
    "status": "",
    "canDm": true,
    "canMediaTag": true,
    "createdAt": "Wed Jul 15 09:07:22 +0000 2009",
    "entities": {
      "description": {
        "urls": []
      },
      "url": {}
    },
    "fastFollowersCount": 0,
    "favouritesCount": 10364,
    "hasCustomTimelines": true,
    "isTranslator": false,
    "mediaCount": 285,
    "statusesCount": 3047,
    "withheldInCountries": [],
    "affiliatesHighlightedLabel": {},
    "possiblySensitive": false,
    "pinnedTweetIds": [
      "2036809734611988818"
    ],
    "profile_bio": {
      "description": "LLM. Local AI addict. Building @BenchLocalApp\nBuilds things nobody asked for. Benchmarks things for fun.",
      "entities": {
        "description": {
          "hashtags": [],
          "symbols": [],
          "urls": [],
          "user_mentions": [
            {
              "id_str": "0",
              "indices": [
                31,
                45
              ],
              "name": "",
              "screen_name": "BenchLocalApp"
            }
          ]
        },
        "url": {
          "urls": [
            {
              "display_url": "stevibe.ai",
              "expanded_url": "http://stevibe.ai",
              "indices": [
                0,
                23
              ],
              "url": "https://t.co/5G1EU5g5jG"
            }
          ]
        }
      }
    },
    "isAutomated": false,
    "automatedBy": null
  },
  "extendedEntities": {
    "media": [
      {
        "additional_media_info": {
          "monetizable": true
        },
        "allow_download_status": {
          "allow_download": true
        },
        "display_url": "pic.twitter.com/dE8FrYseJk",
        "expanded_url": "https://twitter.com/stevibe/status/2045165824294658539/video/1",
        "ext_media_availability": {
          "status": "Available"
        },
        "id_str": "2045165548422791168",
        "indices": [
          279,
          302
        ],
        "media_key": "13_2045165548422791168",
        "media_results": {
          "id": "QXBpTWVkaWFSZXN1bHRzOgwABAoAARxh4zsEm4AAAAA=",
          "result": {
            "__typename": "ApiMedia",
            "id": "QXBpTWVkaWE6DAAECgABHGHjOwSbgAAAAA==",
            "media_key": "13_2045165548422791168"
          }
        },
        "media_url_https": "https://pbs.twimg.com/amplify_video_thumb/2045165548422791168/img/WK3Orkg6SaBGEViU.jpg",
        "original_info": {
          "focus_rects": [],
          "height": 2160,
          "width": 3148
        },
        "sizes": {
          "large": {
            "h": 1405,
            "w": 2048
          }
        },
        "type": "video",
        "url": "https://t.co/dE8FrYseJk",
        "video_info": {
          "aspect_ratio": [
            787,
            540
          ],
          "duration_millis": 20000,
          "variants": [
            {
              "content_type": "application/x-mpegURL",
              "url": "https://video.twimg.com/amplify_video/2045165548422791168/pl/gMQvVVLPkM-Qd3hR.m3u8?tag=21"
            },
            {
              "bitrate": 256000,
              "content_type": "video/mp4",
              "url": "https://video.twimg.com/amplify_video/2045165548422791168/vid/avc1/392x270/heXkDrMSKAsT5Ofk.mp4?tag=21"
            },
            {
              "bitrate": 832000,
              "content_type": "video/mp4",
              "url": "https://video.twimg.com/amplify_video/2045165548422791168/vid/avc1/524x360/ZqW7i4FrehFVYxnC.mp4?tag=21"
            },
            {
              "bitrate": 2176000,
              "content_type": "video/mp4",
              "url": "https://video.twimg.com/amplify_video/2045165548422791168/vid/avc1/1048x720/N2JAXFNokcNAA4jw.mp4?tag=21"
            },
            {
              "bitrate": 10368000,
              "content_type": "video/mp4",
              "url": "https://video.twimg.com/amplify_video/2045165548422791168/vid/avc1/1574x1080/9wD41-rH3BbhmzMd.mp4?tag=21"
            },
            {
              "bitrate": 25128000,
              "content_type": "video/mp4",
              "url": "https://video.twimg.com/amplify_video/2045165548422791168/vid/avc1/3148x2160/1oLuPQGxQbBBXfLK.mp4?tag=21"
            }
          ]
        }
      }
    ]
  },
  "card": null,
  "place": {},
  "entities": {
    "hashtags": [],
    "symbols": [],
    "timestamps": [],
    "urls": [],
    "user_mentions": []
  },
  "quoted_tweet": null,
  "retweeted_tweet": null,
  "isLimitedReply": false,
  "communityInfo": null,
  "article": null
}