@siegelz_
New results on CORE-Bench are in! The new Claude Sonnet outperforms all other models with our CORE-Agent: - Claude 3.5 Sonnet: 37.8% Pass@1 - o1-mini: 24.4% Pass@1 - Previous SOTA (gpt-4o): 21.5% Pass@ https://t.co/d5SkV18ccq
Viewing enriched Twitter post
New results on CORE-Bench are in! The new Claude Sonnet outperforms all other models with our CORE-Agent: - Claude 3.5 Sonnet: 37.8% Pass@1 - o1-mini: 24.4% Pass@1 - Previous SOTA (gpt-4o): 21.5% Pass@ https://t.co/d5SkV18ccq
{
"user": {
"created_at": "2023-07-03T01:35:53.000Z",
"default_profile_image": false,
"description": "@PrincetonCS '25",
"fast_followers_count": 0,
"favourites_count": 33,
"followers_count": 64,
"friends_count": 107,
"has_custom_timelines": false,
"is_translator": false,
"listed_count": 3,
"location": "Princeton, NJ",
"media_count": 5,
"name": "Zachary Siegel",
"normal_followers_count": 64,
"possibly_sensitive": false,
"profile_image_url_https": "https://pbs.twimg.com/profile_images/1843450600018546688/U4eS2LeX_normal.jpg",
"screen_name": "siegelz_",
"statuses_count": 22,
"translator_type": "none",
"url": "https://t.co/WBwL9XmTC2",
"verified": false,
"withheld_in_countries": [],
"id_str": "1675679597944356869"
},
"id": "1858551617139912971",
"conversation_id": "1858551617139912971",
"full_text": "New results on CORE-Bench are in! The new Claude Sonnet outperforms all other models with our CORE-Agent:\n- Claude 3.5 Sonnet: 37.8% Pass@1\n- o1-mini: 24.4% Pass@1\n- Previous SOTA (gpt-4o): 21.5% Pass@ https://t.co/d5SkV18ccq",
"reply_count": 2,
"retweet_count": 2,
"favorite_count": 9,
"hashtags": [],
"symbols": [],
"user_mentions": [],
"urls": [],
"media": [
{
"media_url": "https://pbs.twimg.com/media/GcrmgR7bkAALbGC.jpg",
"type": "photo"
}
],
"url": "https://twitter.com/siegelz_/status/1858551617139912971",
"created_at": "2024-11-18T16:43:30.000Z",
"#sort_index": "1858551617139912971",
"view_count": 5211,
"quote_count": 2,
"is_quote_tweet": false,
"is_retweet": false,
"is_pinned": false,
"is_truncated": false,
"startUrl": "https://x.com/siegelz_/status/1858551617139912971"
}