G

gerardsans

@gerardsans

📅

Jun 16, 2026

25d ago

🆔54102174

@ryanbrewer https://t.co/Kkl7TbMLA0

@gerardsans • Fri Jun 05 21:48

AI is reaching crypto-like hate levels.

🖼️ Media

View Details View on X ↗

G

gerardsans

@gerardsans

📅

Jun 16, 2026

25d ago

🆔32585200

@business https://t.co/Kkl7TbMLA0

@gerardsans • Fri Jun 05 21:48

AI is reaching crypto-like hate levels.

🖼️ Media

View Details View on X ↗

G

gerardsans

@gerardsans

📅

Jun 16, 2026

25d ago

🆔25241531

@Polymarket https://t.co/Kkl7TbMLA0

@gerardsans • Fri Jun 05 21:48

AI is reaching crypto-like hate levels.

🖼️ Media

View Details View on X ↗

G

gerardsans

@gerardsans

📅

Jun 16, 2026

25d ago

🆔39165478

@ClaudeDevs https://t.co/Kkl7TbMLA0

@gerardsans • Fri Jun 05 21:48

AI is reaching crypto-like hate levels.

🖼️ Media

View Details View on X ↗

G

gerardsans

@gerardsans

📅

Jun 16, 2026

25d ago

🆔19778790

@bridgemindai https://t.co/Kkl7TbMLA0

@gerardsans • Fri Jun 05 21:48

AI is reaching crypto-like hate levels.

🖼️ Media

View Details View on X ↗

G

gerardsans

@gerardsans

📅

Jun 16, 2026

25d ago

🆔31315972

@TheEconomist https://t.co/Kkl7TbMLA0

@gerardsans • Fri Jun 05 21:48

AI is reaching crypto-like hate levels.

🖼️ Media

View Details View on X ↗

G

github

@github

📅

Jun 16, 2026

25d ago

🆔94704531

What does a real workday using the GitHub Copilot app look like? 🤔 We sat down with @pierceboggan to see how he uses the Copilot app to focus on the work that matters and carry it from issue to merge. https://t.co/En7wXo1zS9

🖼️ Media

View Details View on X ↗

G

github

@github

📅

Jun 16, 2026

25d ago

🆔20259080

Try out the GitHub Copilot app yourself. ⬇️ https://t.co/SiEBSwHJJg

🖼️ Media

View Details View on X ↗

G

gerardsans

@gerardsans

📅

Jun 16, 2026

25d ago

🆔63087172

@kinopee_ai https://t.co/Kkl7TbMLA0

@gerardsans • Fri Jun 05 21:48

AI is reaching crypto-like hate levels.

🖼️ Media

View Details View on X ↗

G

gerardsans

@gerardsans

📅

Jun 16, 2026

25d ago

🆔94435540

@arstechnica https://t.co/Kkl7TbMLA0

@gerardsans • Fri Jun 05 21:48

AI is reaching crypto-like hate levels.

🖼️ Media

View Details View on X ↗

G

gerardsans

@gerardsans

📅

Jun 16, 2026

25d ago

🆔25192027

@business https://t.co/Kkl7TbMLA0

@gerardsans • Fri Jun 05 21:48

AI is reaching crypto-like hate levels.

🖼️ Media

View Details View on X ↗

G

gerardsans

@gerardsans

📅

Jun 16, 2026

25d ago

🆔09875388

@TheEconomist https://t.co/Kkl7TbMLA0

@gerardsans • Fri Jun 05 21:48

AI is reaching crypto-like hate levels.

🖼️ Media

View Details View on X ↗

A

ArtificialAnlys

@ArtificialAnlys

📅

Jun 16, 2026

26d ago

🆔16544998

GDPval-AA v2 is the highest weighted evaluation in the Intelligence Index v4.1. The upgrade re-baselines ELO to human performance at 1000, introduces a rotating panel of frontier-model judges, and raises the turn limit from 100 to 250 for longer-horizon agent trajectories. Claude Fable 5 (with fallback) leads at 1818, followed by Claude Opus 4.8 (1638). GPT-5.5 (xhigh) scores 1531. Claude Fable 5 is not currently available for use

🖼️ Media

View Details View on X ↗

E

emollick

@emollick

📅

Jun 16, 2026

25d ago

🆔07427534

This was not a good benchmark before it was updated and it is not a good benchmark now. Having AIs evaluate the work of other AIs on publicly available questions from a different closed benchmark doesn’t tell you very much. And it is unclear how they establish the human ELO. https://t.co/tmZG8MDkRA

@ArtificialAnlys • Tue Jun 16 01:51

GDPval-AA v2 is the highest weighted evaluation in the Intelligence Index v4.1. The upgrade re-baselines ELO to human performance at 1000, introduces a rotating panel of frontier-model judges, and raises the turn limit from 100 to 250 for longer-horizon agent trajectories. Claude

🖼️ Media

View Details View on X ↗

A

AlexFrommeyer

@AlexFrommeyer

📅

Jun 16, 2026

25d ago

🆔66003630

why wouldn't Elon take just a tiny slice of the capital he just raised, let's say $2B, and buy more #BTC at these depressed levels? @elonmusk https://t.co/Emf4vNDWE2

🖼️ Media

View Details View on X ↗

E

EvanKirstel

@EvanKirstel

📅

Jun 16, 2026

25d ago

🆔89030085

@GaryMarcus https://t.co/1jDUxuv8u6

🖼️ Media

View Details View on X ↗

J

jxnlco

@jxnlco

📅

Jun 16, 2026

25d ago

🆔03227437

did you get your tickets for AI Engineer SF? I'll be giving a talk (and following tibo) as well as a workshop on setting yourself up with success with codex https://t.co/Cus3e75cUM

🖼️ Media

View Details View on X ↗

J

jxnlco

@jxnlco

📅

Jun 16, 2026

25d ago

🆔33188811

come say hi, get your tickets here https://t.co/5hy51RFSGh

🖼️ Media

View Details View on X ↗

K

KevinQHLin

@KevinQHLin

📅

Jun 16, 2026

25d ago

🆔17550309

thanks @_akhaliq for sharing our Data2Story! 🔮Turn the ‘Humanity's Last Exam’ dataset into a generative blog. Explore more agent-generated stories here https://t.co/TIIMWBamyU https://t.co/FU6uAqKdgV

@_akhaliq • Tue Jun 16 21:10

Data Journalist Agent Transforming Data into Verifiable Multimodal Stories https://t.co/11SNrYxNyp

🖼️ Media

View Details View on X ↗

J

jxnlco

@jxnlco

📅

Jun 16, 2026

25d ago

🆔03501294

if you want clarity on what the differences are https://t.co/Iw60pdzZje

@jxnlco • Tue Jun 16 19:45

https://t.co/e9PrQGAqZT

🖼️ Media

View Details View on X ↗

G

gerardsans

@gerardsans

📅

Jun 16, 2026

25d ago

🆔51327634

@kinopee_ai Update: a new benchmark shows [Opus x Fable] gap may be better explained by prompting not weights: https://t.co/0v5qNCnVrl

@gerardsans • Tue Jun 16 12:28

🚨 New @ValsAI benchmark confirms Claude Fable little secret How is Opus-through-Fable outperforming vanilla Opus? Simple. The model isn’t the star here. It’s the *agentic loop* doing all the work. Anthropic didn’t tell anyone. Full analysis: https://t.co/6tYxEWw67H

🖼️ Media

View Details View on X ↗

G

gerardsans

@gerardsans

📅

Jun 16, 2026

25d ago

🆔84092805

@Alfred_Lin Update: a new benchmark shows [Opus x Fable] gap may be better explained by prompting not weights: https://t.co/0v5qNCnVrl

@gerardsans • Tue Jun 16 12:28

🚨 New @ValsAI benchmark confirms Claude Fable little secret How is Opus-through-Fable outperforming vanilla Opus? Simple. The model isn’t the star here. It’s the *agentic loop* doing all the work. Anthropic didn’t tell anyone. Full analysis: https://t.co/6tYxEWw67H

🖼️ Media

View Details View on X ↗

G

gerardsans

@gerardsans

📅

Jun 16, 2026

25d ago

🆔21222590

@Polymarket Update: a new benchmark shows [Opus x Fable] gap may be better explained by prompting not weights: https://t.co/0v5qNCnVrl

@gerardsans • Tue Jun 16 12:28

🚨 New @ValsAI benchmark confirms Claude Fable little secret How is Opus-through-Fable outperforming vanilla Opus? Simple. The model isn’t the star here. It’s the *agentic loop* doing all the work. Anthropic didn’t tell anyone. Full analysis: https://t.co/6tYxEWw67H

🖼️ Media

View Details View on X ↗

G

gerardsans

@gerardsans

📅

Jun 16, 2026

25d ago

🆔56058399

@NewYorker Update: a new benchmark shows [Opus x Fable] gap may be better explained by prompting not weights: https://t.co/0v5qNCnVrl

@gerardsans • Tue Jun 16 12:28

🚨 New @ValsAI benchmark confirms Claude Fable little secret How is Opus-through-Fable outperforming vanilla Opus? Simple. The model isn’t the star here. It’s the *agentic loop* doing all the work. Anthropic didn’t tell anyone. Full analysis: https://t.co/6tYxEWw67H

🖼️ Media

View Details View on X ↗

G

gerardsans

@gerardsans

📅

Jun 16, 2026

25d ago

🆔26385629

@FT Update: a new benchmark shows [Opus x Fable] gap may be better explained by prompting not weights: https://t.co/0v5qNCnVrl

@gerardsans • Tue Jun 16 12:28

🚨 New @ValsAI benchmark confirms Claude Fable little secret How is Opus-through-Fable outperforming vanilla Opus? Simple. The model isn’t the star here. It’s the *agentic loop* doing all the work. Anthropic didn’t tell anyone. Full analysis: https://t.co/6tYxEWw67H

🖼️ Media

View Details View on X ↗

G

gerardsans

@gerardsans

📅

Jun 16, 2026

25d ago

🆔31707896

@yuno_miyako2 Update: a new benchmark shows [Opus x Fable] gap may be better explained by prompting not weights: https://t.co/0v5qNCnVrl

@gerardsans • Tue Jun 16 12:28

🚨 New @ValsAI benchmark confirms Claude Fable little secret How is Opus-through-Fable outperforming vanilla Opus? Simple. The model isn’t the star here. It’s the *agentic loop* doing all the work. Anthropic didn’t tell anyone. Full analysis: https://t.co/6tYxEWw67H

🖼️ Media

View Details View on X ↗

G

gerardsans

@gerardsans

📅

Jun 16, 2026

25d ago

🆔72234147

@suna_gaku Update: a new benchmark shows [Opus x Fable] gap may be better explained by prompting not weights: https://t.co/0v5qNCnVrl

@gerardsans • Tue Jun 16 12:28

🚨 New @ValsAI benchmark confirms Claude Fable little secret How is Opus-through-Fable outperforming vanilla Opus? Simple. The model isn’t the star here. It’s the *agentic loop* doing all the work. Anthropic didn’t tell anyone. Full analysis: https://t.co/6tYxEWw67H

🖼️ Media

View Details View on X ↗

G

gerardsans

@gerardsans

📅

Jun 16, 2026

25d ago

🆔28044298

@super_bonochin Update: a new benchmark shows [Opus x Fable] gap may be better explained by prompting not weights: https://t.co/0v5qNCnVrl

@gerardsans • Tue Jun 16 12:28

🚨 New @ValsAI benchmark confirms Claude Fable little secret How is Opus-through-Fable outperforming vanilla Opus? Simple. The model isn’t the star here. It’s the *agentic loop* doing all the work. Anthropic didn’t tell anyone. Full analysis: https://t.co/6tYxEWw67H

🖼️ Media

View Details View on X ↗

G

gerardsans

@gerardsans

📅

Jun 16, 2026

25d ago

🆔67518033

@nemumusitocha Update: a new benchmark shows [Opus x Fable] gap may be better explained by prompting not weights: https://t.co/0v5qNCnVrl

@gerardsans • Tue Jun 16 12:28

🚨 New @ValsAI benchmark confirms Claude Fable little secret How is Opus-through-Fable outperforming vanilla Opus? Simple. The model isn’t the star here. It’s the *agentic loop* doing all the work. Anthropic didn’t tell anyone. Full analysis: https://t.co/6tYxEWw67H

🖼️ Media

View Details View on X ↗

G

gerardsans

@gerardsans

📅

Jun 16, 2026

25d ago

🆔68037246

@azukiazusa9 Update: a new benchmark shows [Opus x Fable] gap may be better explained by prompting not weights: https://t.co/0v5qNCnVrl

@gerardsans • Tue Jun 16 12:28

🚨 New @ValsAI benchmark confirms Claude Fable little secret How is Opus-through-Fable outperforming vanilla Opus? Simple. The model isn’t the star here. It’s the *agentic loop* doing all the work. Anthropic didn’t tell anyone. Full analysis: https://t.co/6tYxEWw67H

🖼️ Media

View Details View on X ↗

G

gerardsans

@gerardsans

📅

Jun 16, 2026

25d ago

🆔43923899

@allegrajacchia Update: a new benchmark shows [Opus x Fable] gap may be better explained by prompting not weights: https://t.co/0v5qNCnVrl

@gerardsans • Tue Jun 16 12:28

🚨 New @ValsAI benchmark confirms Claude Fable little secret How is Opus-through-Fable outperforming vanilla Opus? Simple. The model isn’t the star here. It’s the *agentic loop* doing all the work. Anthropic didn’t tell anyone. Full analysis: https://t.co/6tYxEWw67H

🖼️ Media

View Details View on X ↗

G

gerardsans

@gerardsans

📅

Jun 16, 2026

25d ago

🆔31516036

@Reuters Update: a new benchmark shows [Opus x Fable] gap may be better explained by prompting not weights: https://t.co/0v5qNCnVrl

@gerardsans • Tue Jun 16 12:28

🚨 New @ValsAI benchmark confirms Claude Fable little secret How is Opus-through-Fable outperforming vanilla Opus? Simple. The model isn’t the star here. It’s the *agentic loop* doing all the work. Anthropic didn’t tell anyone. Full analysis: https://t.co/6tYxEWw67H

🖼️ Media

View Details View on X ↗