Your curated collection of saved posts and media
I asked Cursor to add Vim support to the Ladybird browser. It automatically set up the environment to run the browser, made the code changes, and sent me a recorded demo. Not just for web apps! https://t.co/qDxnOr6CHU
My new favorite tmux dev layout features @opencode (with Kimi K2.5 running on @FireworksAI_HQ) on top and Claude Code on the bottom. I start almost all agent tasks with Kimi (so fast!), then ask Claude if I need a second opinion/more advanced stuff. Great combo! https://t.co/cUxfPgHFlW
Claude renovated my GitHub homepage for me by automatically setting up a CRON that pulls in my latest blog posts, and found images and other details to make things a bit nicer :) https://t.co/42GrcQ4w05
@amankhan Yeah. Just for clarity what I'm talking about is that this is when I'm interacting with claude it makes a mini interface https://t.co/dIg883ww2t
@garrytan We are absolutely back! https://t.co/ruVRC4CGxD
1/5 Happy CNYπ Still bothered by RL off-policy instability in LLM? Introducing a new wayπ‘Adaptive Layerwise Perturbation (ALP)π‘, a simple but robust fix that outperforms GRPO/MIS/Bypass, achieves better stability (KL, entropy) and exploration! π Blog: https://t.co/0def1Nb7uI https://t.co/9epsd4xJNp

https://t.co/YNGWOvywHf for those donβt remember it
pewdiepie just trained his own llm, and it beats gpt-4o on coding benchmarks. an apocalyptic, civilization-ending catastrophe of laughably, cosmically disproportionate magnitude for the entire ml research job category https://t.co/loDJZPnwN5
Wait, what?! PewDiePie using @axolotl_ai for his project! π₯ https://t.co/vnXeDfMzcc
For those following the DoW AI drama, I highly recommend reading this post explaining how @OpenAI approached the negotiations with the DoW.

https://t.co/3luoY7bgyM

Meta presents VecGlypher Unified Vector Glyph Generation with Language Models paper: https://t.co/anAFlgLMMV https://t.co/Nh3OpUBwa9

π₯Tongyi Lab releases Mobile-Agent-v3.5οΌ20+SOTA GUI benchmarks: (1) GUI automation, 56.5OSWorld, 71.6AndroidWorld, and48.4WebArena; (2) Grounding, 80.3ScreenSpotPro; (3) tool-calling , 47.6OSWorld-MCP @_akhaliq #LLM #Agent #GUI https://t.co/xCbyL0JZLl
Gradio's new HTML component is crazy! 3D Camera Control designed as a game-pad style toggleπ€― Try it for free on @huggingface π https://t.co/xoGdbrej3F
PewDiePie using Hugging Face π₯ https://t.co/9D7PUA0sun
seeing Hugging Face and The Stack/BigCode on PewDiePie video wasn't in my 2026 bingo card https://t.co/AIIOGzwOZT
seeing Hugging Face and The Stack/BigCode on PewDiePie video wasn't in my 2026 bingo card https://t.co/AIIOGzwOZT
The Trinity of Consistency as a Defining Principle for General World Models paper: https://t.co/21cbl3hAdu https://t.co/9YmzIPmsBJ

From Statics to Dynamics Physics-Aware Image Editing with Latent Transition Priors paper: https://t.co/Duflv5VmKj https://t.co/da73QuSURs

Introducing Code Review Bench v0: https://t.co/iAZDURyqol The first independent code review benchmark. 200,000+ PRs. Unbiased. Fully OSS. Updated daily. Tool performance highlights π§΅π Featuring: @augmentcode @baz_scm @claudeai @coderabbitai @cursor @GeminiApp @github @graphite @greptile @kilocode @OpenAIDevs @propelcode @QodoAI
Introducing Code Review Bench v0: https://t.co/iAZDURyqol The first independent code review benchmark. 200,000+ PRs. Unbiased. Fully OSS. Updated daily. Tool performance highlights π§΅π Featuring: @augmentcode @baz_scm @claudeai @coderabbitai @cursor @GeminiApp @github @graphite @greptile @kilocode @OpenAIDevs @propelcode @QodoAI
βοΈΒ Hello? AI Selves now have phone numbers! Put them in your imessage or SMS to be there when youβre not, settle arguments in your group chats, and make talking to yourself more normal. More ideas ππ§΅ Plus, weβre letting more people in off of our waitlist! QRT to get your own early access code.
Imagination Helps Visual Reasoning, But Not Yet in Latent Space Causal mediation analysis reveals latent visual reasoning in MLLMs fails: latent tokens ignore inputs and barely affect answers. CapImagine, a text-based alternative, teaches explicit imagination and significantly outperforms latent baselines.
Top AI Papers of The Week (Feb 24 - Mar 2) - A Very Big Video Reasoning Suite: 200 tasks, 1M+ video clips for video reasoning research - Does Your Reasoning Model Implicitly Know When to Stop Thinking? Introducing SAGE paradigm - AgentFly: Fine-tuning LLM agents without fine-tuning LLMs - Microsoft rStar2-Agent: 80.6% on AIME24 with just 14B parameters - From Blind Spots to Gains: Diagnostic-driven iterative training for LMMs - VibeVoice: Synthesizing 90-minute multi-speaker conversational speech - Alibaba MobilityBench: Benchmarking real-world route-planning agents - NVIDIA's data engineering strategies for scaling LLM terminal capabilities - VESPO: Variational sequence-level soft policy optimization for stable RL training - Beyond Pass@1: Self-play with variational problem synthesis sustains RLVR Find them below: