Your curated collection of saved posts and media

Showing 32 posts Β· last 14 days Β· by score
S
SpencerHakimian
@SpencerHakimian
πŸ“…
Sep 12, 2025
225d ago
πŸ†”55826907

Daily reminder that gun deaths are completely a choice. Nobody dies from guns in the rest of the functioning world. It’s like dying from a fever. An ancient plague that we have solved as a species and moved on from. Except if you live in the United States. https://t.co/ZdKsMdwJM7

Media 1
πŸ–ΌοΈ Media
S
ShashwatGoel7
@ShashwatGoel7
πŸ“…
Sep 12, 2025
225d ago
πŸ†”68637972

Paper fresh of the press: The Illusion of Diminishing Returns: Measuring Long Horizon Execution in LLMs. Are small models the future of agentic AI? Is scaling LLM compute not worth the cost due to diminishing returns? Are autoregressive LLMs doomed, and thinking an illusion? The bear cases for LLM scaling are all connected to a single capability: Long Horizon Execution. However, thats exactly why you should be bullish on scaling model size, and test-time compute! > First, remember the METR plot? It might be explained by @ylecun 's model of compounding errors > the horizon length of a model grows super-exponentially (@DaveShapi) in single step accuracy. > Upshot 1: Don't be fooled by slowing progress on typical short-task benchmarks > that is enough for exponential growth in horizon length. But we go beyond @ylecun's model, testing LLMs empirically... > Just execution is also hard for LLMs, even when you provide them the needed plan and knowledge. > We should not misinterpret execution failures as an inability to "reason". > Even when a small model has 100% single-step accuracy, larger models can execute far more turns above a success rate threshold. > Noticed how your agent performs worse as the task gets longer? Its not just long-context limitations.. > We observe: The Self-Conditioning Effect! > When models see errors they made earlier in their history, they become more likely to make errors in future turns. > Increasing model size worsens this problem - a rare case of inverse scaling! So what about thinking...? > Thinking is not an illusion. It is the engine for execution! > Where even DeepSeek v3, Kimi K2 fail to execute even 5 turns latently when asked to execute without CoT... > With CoT, they can do 10x more. So what about the frontier? > GPT-5 Thinking is far ahead of all other models we tested. It can execute 1000+ step tasks in one go. > At second with 432 steps is Claude 4 Sonnet... and then Grok-4 at 384 > Gemini 2.5 Pro and DeepSeek R1 lag far behind, at just 120. > Is that why GPT-5 was codenamed Horizon? πŸ€” > Open-source has a long ;) way to go! > Let's grow it together! We release all code and data. We did a longggg deep dive, and present you the best takeaways with awesome plots below πŸ‘‡

Media 1
πŸ–ΌοΈ Media
I
IAmPoliticsGirl
@IAmPoliticsGirl
πŸ“…
Sep 12, 2025
225d ago
πŸ†”33174501

I have something to say about the Charlie Kirk murder. https://t.co/N3P7crTKBH

πŸ–ΌοΈ Media
πŸ”ylecun retweeted
I
PoliticsGirl
@IAmPoliticsGirl
πŸ“…
Sep 12, 2025
225d ago
πŸ†”33174501

I have something to say about the Charlie Kirk murder. https://t.co/N3P7crTKBH

❀️26,283
likes
πŸ”8,832
retweets
πŸ–ΌοΈ Media
R
rao2z
@rao2z
πŸ“…
Sep 13, 2025
224d ago
πŸ†”39768982

In the year since LRMs ("reasoning models") hit the scene, we have been trying to understand, analyze and demystify them.. Here are our efforts to date--conveniently all in one place (First..) Evaluation of LRMs on Planning πŸ“œhttps://t.co/uu3aZzVSX4 (9/24)& πŸ“œhttps://t.co/dRg7qa3uoz (TMLR) 🧡 https://t.co/CvHuWhlKNj Semantics of Intermediate Tokens (CoT's) Study on Mazes: πŸ“œhttps://t.co/4LGWfiCZ5e 🧡 https://t.co/y3BthniqSG Study on CoTemp Q&A: πŸ“œ https://t.co/Cnlb96mqKd 🧡 https://t.co/CaeVu0ex46 Analysis of RL on LRMs πŸ“œhttps://t.co/021pXx842x 🧡https://t.co/XbqAyJIyB4 Interpretability of Intermediate tokens πŸ“œhttps://t.co/e2J5pQLhGj 🧡https://t.co/74FSZvQ7c2 Intermediate tokens and problem complexity πŸ“œhttps://t.co/C5y772QIue 🧡 https://t.co/UKgCwgHKeQ Perspective on LRMs πŸ“œhttps://t.co/Skv2WIKyZY (also at https://t.co/d2fIIX82NT) (Position against anthropomorphization of Intermediate Tokens) πŸ“œhttps://t.co/4f5eg5vRnA 🧡https://t.co/f6E3c2j4dm Relevant recent talks https://t.co/6lyhPLYVcY TBC..

@rao2z β€’ 2025-08-06T16:36

[cost] The improved performance of LRM o1 however comes at considerably higher time/compute/cost compared to both LLMs (o1 costs 42$ compared to 1.8$ for GPT4) & normal planners (e.g. FD) that get 100% within a tiny fraction of time on local desktop (see the tables πŸ‘† & πŸ‘‡). As we

Media 1Media 2
πŸ–ΌοΈ Media
S
shashj
@shashj
πŸ“…
Sep 13, 2025
224d ago
πŸ†”62175868

A data-driven look at political violence in America. β€œA separate tally by the Anti-Defamation League, an advocacy group, shows that 76% of extremist-related murders over the past decade were committed by those on the right. Such tallies, however, depend on how extremism is defined and how ideology is assigned.” https://t.co/mAfQsGdeEc

Media 1
πŸ–ΌοΈ Media
R
ralifromparis
@ralifromparis
πŸ“…
Sep 13, 2025
224d ago
πŸ†”84417933

πŸ‡ΊπŸ‡Έ Cette stat est dingue. Jamais CNews n’en parlera. Mais en gros, l’énorme majoritΓ© du terrorisme US est d’extrΓͺme droite … https://t.co/PRouXFOeqS

Media 1
πŸ–ΌοΈ Media
S
SpencerHakimian
@SpencerHakimian
πŸ“…
Sep 14, 2025
223d ago
πŸ†”29690743

It’s not the video games. https://t.co/0JF0VUVI7R

Media 1
πŸ–ΌοΈ Media
πŸ”ylecun retweeted
S
Spencer Hakimian
@SpencerHakimian
πŸ“…
Sep 14, 2025
223d ago
πŸ†”29690743

It’s not the video games. https://t.co/0JF0VUVI7R

Media 1
❀️25,609
likes
πŸ”4,245
retweets
πŸ–ΌοΈ Media
S
steeve
@steeve
πŸ“…
Sep 13, 2025
224d ago
πŸ†”29400752

We upgraded our workbenches machines to ThreadRippers because we needed more PCI lanes to the NIC. If you come to see my talk @aiDotEngineer on Sep 24th, you might get why https://t.co/Bcb67UcaIU

Media 1
πŸ–ΌοΈ Media
R
rohanpaul_ai
@rohanpaul_ai
πŸ“…
Sep 14, 2025
223d ago
πŸ†”01408764

New @AIatMeta builds a vision language world model that turns videos into text plans and reasons to pick better actions. 27% higher Elo for system-2 planning over system-1. The gap it tackles, agents must predict how actions change the world rather than only label frames. VLWM, the Vision Language World Model, represents the hidden state in plain language, predicting a goal and interleaved actions with their state changes. Training targets come from a Tree of Captions that compresses each video, then an LLM refines them into goals and state updates. The model jointly learns a policy to propose the next action and a dynamics model to predict the next state. In fast mode it completes the plan text left to right, which is quick but can lock in early mistakes. In reflective mode it searches candidate plans, rolls out futures, and picks the lowest cost path. The critic that supplies this cost is trained without labels by ranking valid progress below distractors or shuffled steps. Across planning benchmarks and human head to head comparisons, reflective search produces cleaner, more reliable plans. ---- Paper – arxiv. org/abs/2509.02722 Paper Title: "Planning with Reasoning using Vision Language World Model"

Media 1
πŸ–ΌοΈ Media
Z
ZhugeEX
@ZhugeEX
πŸ“…
Sep 14, 2025
223d ago
πŸ†”62851555

I don't understand why we keep having this conversation when the data is clear. https://t.co/YrTOHCCzQZ

@Slasher β€’ Sun Sep 14 01:37

Fox News hosts laying the blame on gamers. FBI agents on CNN saying violent video games caused this. a Kennedy wants the government to research links to mass shooters and first person shooters. it is 1998

Media 1
πŸ–ΌοΈ Media
πŸ”ylecun retweeted
Z
Daniel Ahmad
@ZhugeEX
πŸ“…
Sep 14, 2025
223d ago
πŸ†”62851555

I don't understand why we keep having this conversation when the data is clear. https://t.co/YrTOHCCzQZ

Media 1
❀️112,681
likes
πŸ”13,699
retweets
πŸ–ΌοΈ Media
D
DisavowTrump20
@DisavowTrump20
πŸ“…
Sep 15, 2025
222d ago
πŸ†”74576238

This is journalist Karen Attiah, the only Black opinion columnist at the Washington Post. She was just fired for sharing a statement made by Charlie Kirk insulting Black women. RETWEET if you stand with @KarenAttiah! https://t.co/VJdQtakaOI

Media 1
πŸ–ΌοΈ Media
J
JacksonAtkinsX
@JacksonAtkinsX
πŸ“…
Sep 14, 2025
223d ago
πŸ†”78350342

Meta just made training AI agents 25x faster. This is a breakthrough for robotics and complex planning. Meta's FAIR open sourced a new method called Scalable Option Learning. It trains a specialized agent at the scale previously seen only with LLMs. Here's how it works: The reason this type of AI (Agents trained with Hierarchical Reinforcement Learning) has been slow to train is a parallelization bottleneck. Imagine an AI team with a planner and many specialist workers (the sub-tasks). Older methods struggled because they had to process each planner's decision one-by-one before training the workers. SOL solves this with a new system design: A Single, Unified Brain: Instead of separate models, it uses a single actor-critic network to house the planner (controller policy) and all the workers (option policies). A Digital "Switch": It tells this unified brain which role to play at any given moment using a one-hot vector, a flag that says, "for this input, act as the 'navigation' worker." This allows thousands of different decisions for different policies to be batched and sent to the GPU at once. A Smart "Filter" for Learning: After the actions are taken, it uses a technique called tensorized masking. Think of this as a smart filter that ensures the right performance feedback (the rewards and advantages) goes to the correct worker policy. This is what breaks the one-at-a-time update problem. This architecture allows the entire hierarchical system to learn in parallel batches and removes the bottlenecks that held the field back. Why this matters: This new training method changes the viability of building agents that can reason and execute long-horizon tasks. - Business Leaders: This architecture is a key to developing sophisticated autonomous systems. A 25x faster training cycle accelerates R&D in robotics, logistics, and multi-stage process automation, making complex, strategic AI commercially achievable. - Practitioners: The authors plan to open-source SOL. You can implement agents that learn long-horizon skills without the performance penalty of older HRL methods, creating a path to more structured and potentially more robust models. - Researchers: This paper presents a validated solution to the HRL scaling problem (Section 3.2). The system for enabling high-throughput, asynchronous updates for a hierarchical agent is a major contribution that opens the door for large-scale experiments in temporal abstraction and credit assignment.

Media 1
πŸ–ΌοΈ Media
A
alxndrdavies
@alxndrdavies
πŸ“…
Sep 12, 2025
225d ago
πŸ†”66001801

Excited to share details on two of our longest running and most effective safeguard collaborations, one with Anthropic and one with OpenAI. We've identifiedβ€”and they've patchedβ€”a large number of vulnerabilities and together strengthened their safeguards. 🧡 1/6 https://t.co/GD39MAHjXW

Media 1Media 2
πŸ–ΌοΈ Media
C
CollinRugg
@CollinRugg
πŸ“…
Sep 10, 2025
227d ago
πŸ†”22987478

Charlie Kirk has passed away at the age of 31. A husband, a father of two, and a man of God. He completely reshaped our country and had so much ahead of him. Gut-wrenching. Rest in peace, Charlie. https://t.co/IKAiHAKN5c

πŸ–ΌοΈ Media
T
TulsiGabbard
@TulsiGabbard
πŸ“…
Sep 10, 2025
227d ago
πŸ†”87729590

Please pray for my friend Charlie Kirk. My heart is with him, his wife and children during this critical time πŸ™πŸ½ https://t.co/c6KYZPUyso

Media 1
πŸ–ΌοΈ Media
D
danwootton
@danwootton
πŸ“…
Sep 10, 2025
227d ago
πŸ†”19640185

CHARLIE KIRK SHOT DEAD AT 31. A GREAT AMERICAN, IMPORTANT ALLY OF BRITAIN, FUTURE PRESIDENT, HUSBAND AND FATHER. Patriots are not your enemy. This man was fighting to save the UK and the West. God bless America. God help us all. We cannot go on like this. πŸ™πŸ‡ΊπŸ‡Έ

Media 1
πŸ–ΌοΈ Media
W
WhiteHouse
@WhiteHouse
πŸ“…
Sep 10, 2025
227d ago
πŸ†”59470994

"The Great, and even Legendary, Charlie Kirk, is dead. No one understood or had the Heart of the Youth in the United States of America better than Charlie. He was loved and admired by ALL, especially me, and now, he is no longer with us. Melania and my Sympathies go out to his beautiful wife Erika, and family. Charlie, we love you!" - President Donald J. Trump

Media 1
πŸ–ΌοΈ Media
R
RobertKennedyJr
@RobertKennedyJr
πŸ“…
Sep 10, 2025
227d ago
πŸ†”19939701

Once again, a bullet has silenced the most eloquent truth teller of an era. My dear friend Charlie Kirk was our country's relentless and courageous crusader for free speech. We pray for Erika and the children. Charlie is already in paradise with the angels. We ask his prayers for our country.

Media 1
πŸ–ΌοΈ Media
T
TRobinsonNewEra
@TRobinsonNewEra
πŸ“…
Sep 10, 2025
227d ago
πŸ†”63351963

Charlie Kirk gave everyone a voice, gave everyone a chance to speak, open debates, all ideas, all races, religions, welcome. He was one of the good guys. And they still shot him. Truly evil. https://t.co/nSBX3wZFfR

πŸ–ΌοΈ Media
C
constellationr
@constellationr
πŸ“…
Sep 11, 2025
226d ago
πŸ†”36838383

🚨 Big news! The 2025-2026 #AI150 list is here β€” celebrating the most visionary AI leaders shaping the future. Meet the changemakers πŸ‘‰ https://t.co/wt1picGqpl Join us at #AIForum25 on Sept 30! πŸ‘‰ https://t.co/SdaX3dOAFA @rwang0 @MMinevich https://t.co/Y3jXJnC12W

Media 1Media 2
πŸ–ΌοΈ Media
S
SamanthaTaghoy
@SamanthaTaghoy
πŸ“…
Sep 10, 2025
227d ago
πŸ†”58768153

I lost my father in tragic circumstances when I was around the same age as Charlie Kirk’s daughter. And I can say with absolute certainty that there is no pain like losing your dad. To know that he will never get to see you grow up. To know you will never be able to hug him again, or sit in his lap, or be held in his arms. That loss follows you everywhere. It shows up at graduations, at birthdays, on Christmas mornings. It lingers in the empty chair at family dinners. It cuts deepest during those ordinary, quiet moments when you wish you could just pick up the phone and hear his voice. The world was robbed of Charlie Kirk. But, most tragically, his family was robbed of him. From one fatherless daughter to another, my heart aches for his little girl. I would not wish this kind of pain on anyone. May she find comfort and strength in the knowledge that he is safe in God’s Kingdom, still loving and protecting them from above.

Media 1
πŸ–ΌοΈ Media
M
MarioNawfal
@MarioNawfal
πŸ“…
Sep 12, 2025
225d ago
πŸ†”78203665

πŸ‡ͺπŸ‡Ί DEADLY DRUG-RESISTANT FUNGUS SPREADING FAST IN EUROPEAN HOSPITALS C. auris cases surged 67% in 2023, hitting a record 1,346 - up from zero a decade ago. The fungus clings to hospital surfaces, resists treatment, and can kill up to 60% of infected patients. It’s now so widespread in countries like Greece, Italy, and Spain that outbreaks can’t even be traced. The ECDC says early detection and isolation still work, but only 17 countries even track it properly. On top of all that, no one’s writing checks to develop the drugs we actually need. Source: Bloomberg

Media 1
πŸ–ΌοΈ Media
E
e_durneika
@e_durneika
πŸ“…
Sep 13, 2025
224d ago
πŸ†”70594068

My latest in @RedState on Beijing’s Sept. 3 Victory Day Parade and the actual message it sent about the regime and its partnerships. Link in comments. πŸ‘‡ https://t.co/UmvJhK7YFt

Media 1
πŸ–ΌοΈ Media
O
observer
@observer
πŸ“…
Sep 17, 2025
220d ago
πŸ†”73649098

This year’s A.I. Power Index chronicles 100 leaders navigating trillion-dollar infrastructure investments, geopolitical competition and the tension between moving fast and moving responsibly. Explore the people and companies driving A.I. influenceβ€”for better or worse: https://t.co/uqclUNLXgi

Media 1
πŸ–ΌοΈ Media
M
MMinevich
@MMinevich
πŸ“…
Sep 18, 2025
220d ago
πŸ†”19508765

Observer’s 2025 A.I. Power Index is out πŸš€ Proud to be named among the leaders defining the future of AI. This is just the beginning. @observer πŸ”— https://t.co/r3D53aIquz #ObserverPowerIndex #AI https://t.co/339TQxA4gB

Media 1Media 2
+3 more
πŸ–ΌοΈ Media
B
briannekimmel
@briannekimmel
πŸ“…
Nov 02, 2018
2731d ago
πŸ†”92238337

Working on a research piece on the history of vocational schools in America. Why? By 2020, there will be 1.4M more software dev jobs than applicants who can fill them. https://t.co/lTBcYZAKaL

Media 1
πŸ–ΌοΈ Media
B
briannekimmel
@briannekimmel
πŸ“…
Sep 07, 2025
230d ago
πŸ†”28850575

My birthday is next week, so the group chat has been non stop β€œyou’re such a Virgo” followed by a bunch of insults, backhanded compliments, and stupid things I completely forgot about. So, I casually dropped this and left the chat… https://t.co/m7wCVczRoD

Media 1
πŸ–ΌοΈ Media
B
briannekimmel
@briannekimmel
πŸ“…
Sep 07, 2025
230d ago
πŸ†”48091999

The US job market is being propped up primarily by ongoing employment gains in the health care industry. https://t.co/Tot4An6FnL

Media 1
πŸ–ΌοΈ Media
B
briannekimmel
@briannekimmel
πŸ“…
Sep 09, 2025
229d ago
πŸ†”43584315

β€œIf a picture is worth a thousand words, then a prototype is worth a thousand meetings.” Loved this conversation on how designers, product managers, and Zillow’s co-founder himself use @Replit to ship hundreds & hundreds of prototypes. @amasad @LloydFrink @theallinpod https://t.co/IDmwecEk9m

Media 1
πŸ–ΌοΈ Media