@HelloSurgeAI
"Prognosticative pastry." "A hound circling a tree, nose to bark." Believe it or not, those quotes aren't jokes. They're real outputs from SOTA models! And many leaderboards are rewarding this kind of slop with top rankings. To fix the broken state of AI evaluation, we're launching *Hemingway-bench*: a new writing leaderboard, designed for nuance and impact. Not two-second vibes and fluff. Explore the data and the full leaderboard here (congrats Gemini and Claude for the top positions!): Leaderboard: https://t.co/iNV6LUB2QE Deep Dive Blog: https://t.co/1qII9lQwKu