Your curated collection of saved posts and media

Showing 10 posts Β· last 14 days Β· by score
βž• Add New Post
G
gerardsans
@gerardsans
πŸ“…
Jun 21, 2026
13d ago
πŸ†”02002516
⭐0.44

Original paper: https://t.co/oka6G5cnMB Refutations already in the literature: Gong et al. (2026) showed Patchscopes are unreliable: injected states overridden by model priors (faithfulness drops sharply). The β€œlayer 6 belief” is just a partial vector sum the rest of the pass overwrites. https://t.co/Z2Dxtvgmnt The architecture is unchanged. The interpretive frame drifted. Time to review the math. Apply null hypotheses thoroughly. Leave anthropomorphic narratives behind for good.

E
emollick
@emollick
πŸ“…
Jun 29, 2026
5d ago
πŸ†”62127927

I took the new AA-Briefcase scores from @ArtificialAnlys (basically having the AI do multi-week consulting gigs with a lot of complexity) and graphed the frontier curve for open and closed models: 1) Surprise, rapid gains! 2) The open weights gap is clear https://t.co/a1QGQC2hey https://t.co/bqJHA0WU0j

Media 1Media 2
πŸ–ΌοΈ Media
H
hardmaru
@hardmaru
πŸ“…
Jun 23, 2026
11d ago
πŸ†”93144318

Sakana Fugu Technical Report https://t.co/6e6WuA8FVB Release Notes: https://t.co/7xWGpOicFN https://t.co/g2yaZvex35

Media 1
πŸ–ΌοΈ Media
πŸ”ai_fast_track retweeted
A
Qwen
@Alibaba_Qwen
πŸ“…
Jun 24, 2026
10d ago
πŸ†”42719867
⭐0.34

πŸ“£πŸ“£ Meet Qwen-AgentWorld β€” a native language world model that simulates 7 agent environments (MCP, Search, Terminal, SWE, Web, OS, Android) within a single model. Environment modeling is the training objective from day one, not a post-hoc adaptation. πŸ€” LLMs are trained to be better agents β€” better at acting in environments. But nobody has trained them to model the environments themselves. πŸ—ΊοΈ Our roadmap: investigate how language world modeling can push the boundaries of general agent capabilities, along two routes: 1️⃣ Build a foundation model for environment simulation β€” outperforming Claude Opus 4.8 and GPT-5.4 on AgentWorldBench 2️⃣ Investigate how world modeling enhances agent training: πŸ”¬ Controllable Sim RL (agentic RL with LWM as environments) surpasses training in real environments 🧠 Learning to predict environments (LWM warm-up) makes agents stronger β€” remarkably, even without any agent-specific training, this predictive knowledge transfers to agentic tasks with zero fine-tuning πŸ“‘ Paper: https://t.co/Jx2l5RKq71 πŸ“– Blog: https://t.co/7tVcKyhsx2 πŸ’» GitHub: https://t.co/B5Lvb1UZCn πŸ€— HuggingFace: https://t.co/Kw3QBL1TM5 🧩 ModelScope: https://t.co/YBnGYgMWWI

❀️4,705
likes
πŸ”783
retweets
πŸ”ylecun retweeted
R
Randall Balestriero
@randall_balestr
πŸ“…
Jul 02, 2026
2d ago
πŸ†”40998064
⭐0.38

Oops, SIGReg did it again! Large scale (CC12M->Datacomp-L) vision-language JEPA pretraining beats CLIP and SigLIP objectives! Thanks to SIGReg, our LeVLJEPA has no collapse, no EMA, no stop-gradient, no negatives, no problem! Checkpoints/demo are live: https://t.co/wz6S6tYB6p

❀️163
likes
πŸ”25
retweets
S
SarvamForDevs
@SarvamForDevs
πŸ“…
Jul 02, 2026
2d ago
πŸ†”53989608

AI Infra Day | SGLang Γ— Sarvam with Hugging Face India's AI infra community is coming together. A day of deep-dives and technical discussions with researchers and engineers building the future of AI infrastructure. Bangalore | 11th July | 12:00–4:00PM Register Now: https://t.co/StSC0dxac9

Media 1
πŸ–ΌοΈ Media
B
ben_burtenshaw
@ben_burtenshaw
πŸ“…
Jul 02, 2026
2d ago
πŸ†”75706032

the wildest part of this intelligence per watt paper (71.3% of chat queries could be local) is that the model is only a gpt-oss 20b. which is about a year old! compared to the current batch of small moe models (gemma 4, liquid LFM, Qwen-3.6, etc.) this is nothing. https://t.co/d4Oem5d35t

Media 1
πŸ–ΌοΈ Media
πŸ”ylecun retweeted
R
Randall Balestriero
@randall_balestr
πŸ“…
Jul 01, 2026
3d ago
πŸ†”00590573
⭐0.38

The Sensorimotor World Model (https://t.co/K5iWbk7Izs): a deep dive into the role of inverse dynamics modeling as an anti-collapse regularization for JEPAs. IDM is weaker than SIGReg as it doesn't have to fill the space--it only captures what is affected by the agent's actions🧡 https://t.co/kdnVGbhkht

❀️171
likes
πŸ”27
retweets
πŸ”Tim_Dettmers retweeted
T
Tianqi Chen
@tqchenml
πŸ“…
Jun 23, 2026
11d ago
πŸ†”02734099
⭐0.34

We taught a brand-new mini-series this year at @SCSatCMU on Modern GPU Programming for ML Systems, as part of the ML Systems course, touching on fun questions like what data layout swizzling is, how to use 3D TMA, and state-of-the-art Blackwell programming. We released a curated online book based on the materials: https://t.co/5ZJg2lySNO check it out

❀️674
likes
πŸ”101
retweets
M
Meituan_LongCat
@Meituan_LongCat
πŸ“…
Jun 30, 2026
4d ago
πŸ†”05308721

Introducing LongCat-2.0 🐱 1.6T parameters Β· MoE with ~48B active Β· 1M context The full model behind Owl Alpha on @OpenRouter β€” now available. Built for agentic coding from the ground up: β—† LongCat Sparse Attention (LSA) β€” scales efficiently for 1M-context tokens β—† Zero-Compute Experts β€” dynamic activation 33B–56B per token, zero wasted compute β—† MOPD β€” three specialized expert groups (Agent / Reasoning / Interaction), gate-routed per task How it stacks up: β†’ Terminal-Bench 2.1: 70.8 β†’ SWE-bench Pro: 59.5 (GPT-5.5: 58.6) β†’ SWE-bench Multilingual: 77.3 β†’ FORTE: 73.2 Β· RWSearch: 78.8 Β· BrowseComp: 79.9 πŸ“– Tech Blog: https://t.co/4KrjyKiDBn Try it across different scenarios πŸ§΅πŸ‘‡

Media 1Media 2
πŸ–ΌοΈ Media