Your curated collection of saved posts and media
Interesting trend: models have been getting a lot more aligned over the course of 2025. The fraction of misaligned behavior found by automated auditing has been going down not just at Anthropic but for GDM and OpenAI as well. https://t.co/8DYm9SP7wF
Check out this open source implementation by @kaifronsdal (who supplied the data for this plot), @sleepinyourhat, and many others https://t.co/HDmaJ480Bp
@jankulveit @kaifronsdal @sleepinyourhat yeah this does not include every category. It's the "concerning" axes. This should be the prompt for the judge model: https://t.co/Ow5CqG4KEH
1 right 2 wrong https://t.co/TZ7KhnHQIN

Research-level mathematics draws on advanced techniques from vast literature, with papers often spanning dozens of pages. While foundation models possess a large knowledge base from pretraining, their understanding of advanced subjects remains superficial due to data scarcity, and they are also prone to hallucinations. As such, in the first paper, "Towards Autonomous Mathematics Research", we built #Aletheia (ancient Greek word for "Truth"), a math research agent, that can iteratively generate, verify, and revise solutions end-to-end in natural language. Link to the paper: https://t.co/8mqzYEbhjZ (to be on arXiv soon!) There are 3 main sources that power Aletheia ...

cool new model https://t.co/Va84RqVehy

Thrilled to share: #Aletheia, our math research agent, just solved 6/10 notoriously hard FirstProof problems autonomously, the best result in the inaugural challenge! To me, this is even bigger than our historic IMO-gold achievement last year; these problems challenge even top mathematicians. We share our results transparently, see paper and full thoughts in the thread. π
We ran our internal system Aletheia (Deep Think) on FirstProofβs research problems during the week they were released. Aletheia returned solutions to problems 2, 5, 7, 8, 9, and 10. We think thereβs a pretty good chance they are correct, based on expert analysis. https://t.co/7lC8RmDVx1

https://t.co/BihEy9UYik
"Were you nervous having Sam Altman on the podcast?" "No, he's my brother" π https://t.co/0zyu68ViJv
A statement from Anthropic CEO, Dario Amodei, on our discussions with the Department of War. https://t.co/rM77LJejuk
Thank you for your attention to this matter. cc: @AnthropicAI @DarioAmodei https://t.co/FLCByLHF73

A statement on the comments from Secretary of War Pete Hegseth. https://t.co/Gg7Zb09IMR
"We will challenge any supply chain risk designation in court" - Anthropic They are saying Department of War cannot restrict customers' use of Claude outside of Dep of War contract work. https://t.co/3FDsXmmcZi
huh, why hasn't Wikipedia been updated in more than 2 years on HuggingFace? https://t.co/DxbO8RpM6G
Marco Rubio finding out he has to run Anthropic now too. https://t.co/Ffc5jsvzLi
I know we donβt do facts anymore, but hereβs the "dangerous and collapsing" EU that Elon Musk and MAGA influencers keep warning you about. https://t.co/RZrT5p9p9D
This has been the plan all along. - Foment violence and chaos in the streets of MN - Implement the Insurrection Act - Declare Martial Law - Suspend elections https://t.co/Ky3Jf3awqc
As a three-time combat veteran, I get pretty damn hot when a five-time draft dodger like @realDonaldTrump pounds his chest and bangs the war drums. America is over it. No more sending our sons & daughters to fight for oil. https://t.co/sgyXKme3h3
Whereβs the outrage? https://t.co/Hd0Br38yOF
Whereβs the outrage? https://t.co/Hd0Br38yOF
Le siΓ¨ge social dβAMI Labs, la sociΓ©tΓ© de β¦Yann Le Cunβ©, qui a longtemps Γ©tΓ© chief scientist de Meta (Facebook, Whatsapp et Instagram). sera Γ Paris. Le chercheur β¦β¦@ylecunβ©, qui a dΓ©cidΓ©ment de top lecturesπ, reste professeur Γ β¦β¦β¦β¦@nyuniversityβ© https://t.co/g7cAHoF7zw
We told you the Venezuela invasion was just corruption. It took one whole week to get the proof. Trump took Venezuela's oil at gunpoint, and gave it to one of his biggest campaign donors. 1/ But when you learn the details, it's even worse. A shortπ§΅on this corruption story. https://t.co/ZExM5S89VK
Nostalgia has tricked people into thinking the 1990s and early 2000s were a time of cutesy and comfortable digital disruption. https://t.co/yPMO0Ab3Ga
