Your curated collection of saved posts and media

Showing 32 posts ยท last 14 days ยท by score
X
xenovacom
@xenovacom
๐Ÿ“…
Jun 25, 2026
9d ago
๐Ÿ†”39707568

While we eagerly await Fable 5's return, our agentic WebGPU kernel optimization framework kept running. Opus 4.8 picked up where Fable left off, pushing Liquid AI's new LFM2.5 230M to an unbelievable 1,400 tok/s... running locally in your browser. Don't blink or you'll miss it. https://t.co/27WARZwTcD

@xenovacom โ€ข Wed Jun 17 16:54

Before Fable 5 was shut down, it pushed Gemma 4 to 255 tok/s on WebGPU. Some didn't believe it was real. Today we're releasing the demo and kernels it wrote for you to see yourself. Run it locally in your browser. Agentic kernel optimization is the future of on-device inference

๐Ÿ–ผ๏ธ Media
๐Ÿ”jeremyphoward retweeted
_
Albert Gu
@_albertgu
๐Ÿ“…
Jun 26, 2026
8d ago
๐Ÿ†”43587996
โญ0.36

Transformers are better at copying, while RNNs are better at modeling "meaning-bearing wordsโ€”the nouns, verbs, & adjectives that say what a sentence is about"

โค๏ธ421
likes
๐Ÿ”27
retweets
Z
ziv_ravid
@ziv_ravid
๐Ÿ“…
Jul 01, 2026
3d ago
๐Ÿ†”92616309

1/ On Training in Imagination - Dwarkesh's episode has a segment on dreaming as one of the next training paradigms. The idea is that a model learns mostly inside its own, by imagining what would happen, instead of trying out for real. We have a recent paper on exactly this ๐Ÿฅณ๐Ÿฅณ๐Ÿฅณ

@dwarkesh_sp โ€ข Fri Jun 26 16:56

What does the next training paradigm look like? 0:00:00 โ€“ The big research bet the labs are making 0:02:12 โ€“ Grindability is just as important as verifiability 0:06:10 โ€“ Will RLVR alone generalize? 0:08:41 โ€“ Getting the learning back to the weights 0:15:22 โ€“ Dreaming 0:17:23 โ€“ W

Media 1
๐Ÿ–ผ๏ธ Media
T
Tesla_AI
@Tesla_AI
๐Ÿ“…
Jun 29, 2026
5d ago
๐Ÿ†”89260101
โญ0.40

v14 Lite Release Notes: โ€“ Distilled the intelligence from HW4 V14 into HW3. This allows HW3 to directly learn how to handle scenarios using HW4 V14 as a guide. This process unlocks the improvements that have been made to HW4 including Reinforcement Learning (RL) and offline models for HW3. โ€“ Improved both proactive and reactive responsiveness across a wide variety of categories including navigation handling, merges and forks, pedestrian interactions, traffic lights, and vehicle cut-in scenarios. โ€“ Improved general comfort in nominal scenarios through fewer false slowdowns, smoother steering and more consistent lane centering. โ€“ Introduced parking, unparking, and reversing capabilities. โ€“ Added Arrival Options for you to select where FSD should park: in a Parking Lot, on the Street, in a Driveway, or at the Curbside. โ€“ Speed Profiles are now available at all times, to further customize driving style preference.

@aelluswamy โ€ข Mon Jun 29 06:54

FSD v14 Lite is now rolling out to AI3 early-access customers. Based on the feedback, will rollout to more customers over the next few weeks. This build distills the driving behavior from AI4โ€™s v14 series into both the camera and compute config of AI3. It includes destination op

A
AravSrinivas
@AravSrinivas
๐Ÿ“…
Jun 22, 2026
12d ago
๐Ÿ†”25257913
โญ0.38

GLM is the kind of model that revives serious interest in open source AI. It passes the blind test relative to the frontier models on the median production grade knowledge worker task. Itโ€™s affordable to serve. And is a sub trillion parameter model, meaning it has a lot of potential to go beyond matching the frontier at the median level of difficulty to also doing it for the long tail. Plenty to look forward to!

T
Thom_Wolf
@Thom_Wolf
๐Ÿ“…
Jul 02, 2026
1d ago
๐Ÿ†”00006350

Most people should probably update their priors on the state of open-source speech-to-speech. It's honestly kind of mind-blowing. We teamed up with @cerebras to build a fully open-source realtime voice demo (models + code) to show what's possible today. Demo : https://t.co/UCciOXSteq Blog: https://t.co/rsULsWWKlO Go test it, fork it, tweak it, and impress your friends. video is raw, no cut, no speed-up, first take

Media 2
+1 more
๐Ÿ–ผ๏ธ Media
D
dicksonneoh7
@dicksonneoh7
๐Ÿ“…
May 02, 2023
1159d ago
๐Ÿ†”36019456
โญ0.40

Visualizing your dataset (especially large ones) in a low-dimensional embedding space can tell you a lot about the patterns and clusters in your dataset. We release a notebook showing how you can visualize your dataset using DINOv2 models by running it on your CPU. Yes! CPU!

_
_albertgu
@_albertgu
๐Ÿ“…
Jun 26, 2026
8d ago
๐Ÿ†”43587996
โญ0.36

Transformers are better at copying, while RNNs are better at modeling "meaning-bearing wordsโ€”the nouns, verbs, & adjectives that say what a sentence is about"

@allen_ai โ€ข Thu Jun 25 16:22

Hybrid (transformerโ€“RNN) models are fast becoming a serious alternative to the transformer, but a big question remains: how do they process tokens differently & how does this impact performance? We compared our transformer (Olmo 3) & hybrid (Olmo Hybrid) models to find

V
vida_agent
@vida_agent
๐Ÿ“…
Jun 27, 2026
7d ago
๐Ÿ†”59024492

We open-sourced BrowserBC: A system that turns human browser trajectories into reusable agent skills. Just one recording is enough to generalize a skill. ๐Ÿ› ๏ธ GitHub: [https://t.co/WP8mQGuJ6N] Hereโ€™s how it works. ๐Ÿ‘‡

Media 1Media 2
๐Ÿ–ผ๏ธ Media
J
Jason
@Jason
๐Ÿ“…
Jul 02, 2026
1d ago
๐Ÿ†”32040706
โญ0.40

I KEEP BLOWING THROUGH MY PERPLEXITY COMPUTER AND CLAUDE COWORK TOKENS I HAVE SOME RESEARCH JOBS THAT I WANT TO RUN CONSTANTLY / HOURLY INDEFINITELY NEED TO RUN LOCAL OPEN-SOURCE MODELS CONTINUOUSLY IN MY OWN PRIVATE CLOUD AT THIS POINT TELL ME WHAT I SHOULD DO... @NousResearch TIME?

M
ManycoreTech
@ManycoreTech
๐Ÿ“…
Jul 03, 2026
1d ago
๐Ÿ†”96200385

Our papers just got accepted at #ECCV2026 โ€” and the one we're most excited about: SPEAR, our next-gen Physical AI simulation platform, built with multiple tech giants. SPEAR closes the loop from real-world space to robot training: digitize โ†’ simulate โ†’ train. Alongside Syn-GRPO and WalkerBench, this is our full-stack bet on the data, simulation, and evaluation infrastructure that Physical AI runs on. Built on OpenUSD. Designed for the age of Physical AI. Huge thanks to our SPEAR co-authors and partners: @ros_german, @StefanLeuteneg1, Kalyan Sunkavalli, Vladlen Koltun, Rushikesh Zawar, Rachith Dey-Prakash, and Quentin Leboutet. #PhysicalAI #EmbodiedAI #Robotics #Simulation #ECCV2026 #SpatialAI #OpenUSD

Media 1Media 2
+2 more
๐Ÿ–ผ๏ธ Media
C
chesterzelaya
@chesterzelaya
๐Ÿ“…
Jul 02, 2026
1d ago
๐Ÿ†”35943516
โญ0.36

Agentic drones are out now! This release lets you turn any compatible FPV drone into an AI agent Prompt them โ€” fly them. No onboard modifications required.

@droneforge โ€ข Thu Jul 02 23:12

Stable v2.4.0 update out now! ๐ŸŽ‰ New: > Text-to-flight: enjoy free agentic drones on us. Prompt the drone to navigate the room, go up-or-down the stairs, and more! Improved: > Reach higher speeds with Rocketship mode and our new semi-autonomous interface Order and download

S
SakanaAILabs
@SakanaAILabs
๐Ÿ“…
Jul 03, 2026
1d ago
๐Ÿ†”03779928

We are pleased to present our latest research at #ICML2026, โ€œBridging Spherical Black-Box Optimizersโ€ https://t.co/3FT6vn0dSn When optimizing through simulators, external APIs, or in reinforcement learning, gradients are often unavailable. Black-Box Optimization (BBO) fills this gap, but the field has been historically split into two categories: 1. Parametric Methods: Algorithms like Evolution Strategies (ES) scale to high dimensions but only find a single solution. 2. Nonparametric Methods: Algorithms like Consensus-Based Optimization (CBO) find multiple solutions but fail in high dimensions. Our team asked a simple question: what if they are all doing the same thing? In our paper, we showed that these distinct families are actually variations of a single update equation. By bridging this theoretical gap, we can now engineer custom hybrid optimizers for specific tasks. A key application of this is merging foundation models. Building on our previous work in Evolutionary Model Merging, we faced a computational challenge. Evaluating large language models at every step is resource-intensive, but using a smaller evaluation dataset causes standard unimodal optimizers to overfit. By treating LLM merging as a multimodal problem and deploying our newly developed hybrid optimizers, AdaPol and SchedPol, we successfully navigated this issue. The algorithms identified multiple distinct optima on the smaller dataset, allowing us to find generalized, high-quality merges at a fraction of the compute cost.

Media 1
๐Ÿ–ผ๏ธ Media
A
akshay_pachaar
@akshay_pachaar
๐Ÿ“…
Jul 02, 2026
2d ago
๐Ÿ†”08796782

RAG vs. Graph RAG vs. Agentic RAG, clearly explained! Standard RAG embeds documents into vectors and retrieves the most similar chunks via similarity search. For direct factual lookups, this works well. But it breaks down when a query needs to connect facts spread across multiple documents. Similarity search retrieves individual chunks, not the relationships between them. Graph RAG adds a knowledge graph layer on top. โ†’ During indexing, an LLM extracts entities and relationships from the documents. โ†’ During retrieval, the system traverses these connections instead of relying on embedding similarity alone. This is what enables multi-hop queries. Say a vector DB stores three facts about internal services: โ†ณ "The checkout service uses payments API." โ†ณ "The payments API runs on cluster-3." โ†ณ "Cluster-3 is scheduled for maintenance on Friday." Someone asks: "Will the checkout service be affected by Friday's maintenance?" Vector search can likely retrieve facts 1 and 3 because the query mentions "checkout service" and "Friday maintenance." But it will miss fact 2, which connects the payments API to cluster-3. That middle fact sits too far from the query in embedding space. It mentions neither "checkout" nor "maintenance," so it never makes it into the retrieved context. A knowledge graph connects these as linked entities, and graph traversal finds the full path in one query. Agentic RAG takes a different approach entirely. Instead of a fixed retrieval pipeline, an LLM agent decides at query time which tools to invoke, which sources to query, and in what order. Check the visual below to understand the three architectures thoroughly. One thing to note here is that these three aren't levels of sophistication that you need to graduate through. Instead, they solve different query types. โ†ณ Single-hop factual lookups โ†’ standard RAG โ†ณ Multi-hop relationship queries โ†’ Graph RAG โ†ณ Dynamic multi-source tasks with tool use โ†’ Agentic RAG ---- Each of these architectures gets better when the underlying retrieval layer is efficient. I recently wrote about a new RAG approach that cuts corpus size by 40x, reduces tokens per query by 3x, and improves vector search relevance by 2.3x. The article is quoted below.

@akshay_pachaar โ€ข Fri May 08 13:33

https://t.co/De2DxpBoD2

Media 1
๐Ÿ–ผ๏ธ Media
๐Ÿ”jxnlco retweeted
O
OpenAI
@OpenAI
๐Ÿ“…
Jun 22, 2026
12d ago
๐Ÿ†”24640023
โญ0.34

Weโ€™re expanding OpenAI Daybreak to help democratize patching vulnerable software at machine speed: - Codex Security plugin: find, validate, and fix vulnerabilities right inside Codex - The full version of GPT-5.5-Cyber model: a great model for trusted defenders - Cyber Partner Program: powering products built on top of our best cyber capabilities for leading security companies to secure the world's software - Patch the Planet: working with maintainers to secure critical open source projects https://t.co/hyIi6gQmkm

โค๏ธ525
likes
๐Ÿ”55
retweets
B
BlancheMinerva
@BlancheMinerva
๐Ÿ“…
Jun 22, 2026
12d ago
๐Ÿ†”51579006
โญ0.38

These findings canโ€™t be squared with the claims Sakana made in their marketing materials such as โ€œnear human accuracyโ€ in reviewing papers (when tested on 10 papers, it had a 50% precision, 20% recall, and 28.6% F1-score) or the ability to write and run code without human input.

J
jxnlco
@jxnlco
๐Ÿ“…
Jul 03, 2026
1d ago
๐Ÿ†”57961085
โญ0.34

If you use Codex, is there any reason you still use ChatGPT? what do you use it for? how has it been better or critical for you?

S
skcd42
@skcd42
๐Ÿ“…
Jun 22, 2026
12d ago
๐Ÿ†”35131891
โญ0.34

/goal is live on Grok Build. We use a team of agents: - implementors - skeptics - code reviewers - planners and a mix of grok build and composer in various roles. Would love to hear your feedback on how ambitious you can be with /goal and where the gaps are

@ โ€ข

S
simonw
@simonw
๐Ÿ“…
Jul 03, 2026
1d ago
๐Ÿ†”20215566
โญ0.38

The most interesting Fable tip I've heard so far is to let the model use its own judgement as much as possible I told it "For all coding tasks use your judgement to decide an appropriate lower power model and run that in a subagent" and it seems to be saving a lot of tokens

E
ethanlshen
@ethanlshen
๐Ÿ“…
Jun 22, 2026
12d ago
๐Ÿ†”85642132
โญ0.38

I'll be presenting SERA, Ai2 's first coding agents, at ICML on July 7th ๐Ÿ‡ฐ๐Ÿ‡ท Excited to chat about unit-test free verification, code data curation, and specialized coding agents. Come by, say hi, and grab some stickers ๐Ÿฅณ

@allen_ai โ€ข Tue Jan 27 16:12

Introducing Ai2 Open Coding Agentsโ€”starting with SERA, our first-ever coding models. Fast, accessible agents (8Bโ€“32B) that adapt to any repo, including private codebases. Train a powerful specialized agent for as little as ~$400, & it works with Claude Code out of the box. ๐Ÿงต

A
Axell_wppr
@Axell_wppr
๐Ÿ“…
Jul 03, 2026
1d ago
๐Ÿ†”14294613

Humanoids should take on the heavy lifting jobs for humans. But can full-size humanoids handle heavy-payload teleoperation from noisy VR inputs? Excited to introduce our work, HEFT: Heavy-Payload Full-size Humanoid Teleoperation. HEFT tracks human intent from raw, noisy VR signals and enables real-world teleoperation with payloads up to 24 kg on L7, a 175 cm, 65 kg full-size humanoid. Website & more demos: L7 heavy-payload teleop + G1/L7 high-dynamic tracking https://t.co/fFgSWgpA7V G1 & L7 training code/checkpoints: https://t.co/uGimX29xyU

Media 2
๐Ÿ–ผ๏ธ Media
W
wey_gu
@wey_gu
๐Ÿ“…
Jun 24, 2026
10d ago
๐Ÿ†”56333929

Hermes ๅผ•ๅ…ฅไบ† /learn ไปŽไปปไฝ• input ไน ๅพ—ๅฏๅค็”จ็š„ๆŠ€่ƒฝ๐Ÿซก Nowledge Mem ็š„ Skills ไนŸๆœ‰ไธ€ๆ ท็š„่ƒฝๅŠ› ้™คไบ†้ป˜้ป˜ไธปๅŠจไปŽๅކๅฒไธŠไธ‹ๆ–‡้‡Œๆ‘ธ็ดขๅ‡บๅฏ่ƒฝๆฝœๅœจๆž„ๆˆ skills ็š„ๆœบไผšๆ็คบ็ป™็”จๆˆท๏ผŒ็”จๆˆทๆฟ€ๆดปไน‹ๅŽๅฏไปฅๅœจๆ‰€ๆœ‰ agent ้‡Œ่ฐƒ็”จ๏ผŒๅนถไธ”้š็€่ฐƒ็”จ่ฟ˜ไผšไธๆ–ญ่‡ชไผ˜ๅŒ–ใ€ๆผ”่ฟ›ๅค–๏ผ› GUI ้‡Œ็š„ Skill Creator ๅ…่ฎธๆˆ‘ไปฌไธปๅŠจๅˆ›ๅปบ Skill๏ผŒๅฎƒไผš่‡ชๅŠจๆ‰พๅˆฐ็›ธๅ…ณ็š„ๅކๅฒไธŠไธ‹ๆ–‡่ฟ›่กŒๅˆ›ๅปบๅ’Œ่‡ชไผ˜ๅŒ–ใ€‚ ๅ…ถๆฌกๆˆ‘ไปฌๆ นๆฎ็”จๆˆท่€ๅธˆไปฌ็š„ๅปบ่ฎฎ๏ผŒ้—ญ็Žฏไบ†่ฟ™ไธชไธปๅŠจ flow๏ผŒๅขžๅŠ ไบ† cli ๅ’Œ ai-now ้‡Œ็š„ไธปๅŠจๅˆ›ๅปบ Skills ็š„ๅ…ฅๅฃ

@Teknium โ€ข Tue Jun 23 21:07

Hermes can now LEARN from any source or set of sources, build a skill, test it live, and crystallize new learnings. Just run /learn and pass it sources, past sessions, URLs, docs, whatever you think will help it learn, and it'll go from 0 to 1 to create you a skill!

Media 1Media 2
+1 more
๐Ÿ–ผ๏ธ Media
H
hugothomel
@hugothomel
๐Ÿ“…
Jun 22, 2026
12d ago
๐Ÿ†”73540432

we made an interactive movie in a day - powered by a world model - running in real time - you can explore and make your own choices this is Operation Pandora. play now ๐Ÿ‘‡ https://t.co/gQ2NevjC52

๐Ÿ–ผ๏ธ Media
K
Kappische
@Kappische
๐Ÿ“…
Jun 22, 2026
12d ago
๐Ÿ†”08498384

Iโ€™m surprised the gaming community havenโ€™t pushed harder to work on Neural Texture Compression considering the RAM squeeze weโ€™re seeing. Unity, Unreal, Valve, Microsoft, Sony, Nintendo, Intel, AMD, Nvidia should help push this as a standard where possible. https://t.co/kwbLd7UAgg https://t.co/yIjTazeqv8

Media 1Media 2
๐Ÿ–ผ๏ธ Media
R
RisingSayak
@RisingSayak
๐Ÿ“…
Jul 03, 2026
1d ago
๐Ÿ†”63683632

We just released a new version of Diffusers! This includes many new image and video pipelines (Ideogram4, MotifVideo, etc.). But it also includes the recently popular DiffusionGemma ๐ŸคŒ Check out the notes for full details. https://t.co/49lDK8Vnnk

Media 1
๐Ÿ–ผ๏ธ Media
S
SimonBalmain
@SimonBalmain
๐Ÿ“…
Jun 23, 2026
11d ago
๐Ÿ†”88735968

Before they pulled it, I fed Anthropic's Fable model the instructions on how to make a Creation for r1, and gave it one prompt - "make an awesome game for r1" - this is what it did ๐Ÿ”ฅ๐Ÿ”ฅ๐Ÿ”ฅ Now available in the Creations Gallery on r1! (rabbithole - the game!) ๐Ÿฅ•๐Ÿฅ•๐Ÿฅ• #rabbitr1 https://t.co/mWhnWgDbbF

๐Ÿ–ผ๏ธ Media
L
levie
@levie
๐Ÿ“…
Jun 20, 2026
14d ago
๐Ÿ†”48782515
โญ0.40

Pretty remarkable whatโ€™s happening with open weights AI right now. Weโ€™re seeing models achieve SOTA results on specific tasks, and getting close to frontier on some areas of coding and other domains. The more that open weights is able to maintain only a marginal gap from the frontier, instead of a widening gap, the more value that can be created with AI. Incidentally, this is actually fine for the frontier labs as well; if we can lower the cost of an overall task then AI usage goes up in general. Youโ€™re still likely using frontier models for planning, orchestration, reviewing, and other parts of work. But this is all very good for the applied layer of AI, which is now in a great position to cost optimize workloads with cheaper models or use tailored open models post-trained for specific tasks to improve performance.

@Designarena โ€ข Fri Jun 19 17:58

https://t.co/JSn0lDCNkB

V
vanstriendaniel
@vanstriendaniel
๐Ÿ“…
Jun 23, 2026
11d ago
๐Ÿ†”92397735

It's raining OCR models again! @Baidu_Inc's Unlimited-OCR is one of the more interesting. You can try it without much effort via a throwaway GPU endpoint on @huggingface Jobs (which recently added port forwarding support) with one command It's OpenAI-compatible, your HF token is the API key, and --timeout makes it self-destruct so you can't leave a GPU running by accident Once it's warm, it's quick and @sgl_project batches concurrent requests, so an agent can boot the model, fire a big async batch at it (say, a whole bucket of newspaper scans), then cancel it. I pointed it at the front page of a 1901 newspaper, "The Commoner" + 6 PDF pages in a single request: tables came back as HTML, equations as LaTeX, figures with captions, reading order preserved across pages. Docs here: https://t.co/mApuKalqSN

Media 1Media 2
+2 more
๐Ÿ–ผ๏ธ Media
D
daniel_mac8
@daniel_mac8
๐Ÿ“…
Jun 20, 2026
14d ago
๐Ÿ†”77105538

This is one of the coolest open-source AI agent projects I've seen in a while: 'Understand Anything' It's a plugin for Claude Code, Codex, OpenCode etc. that analyzes your codebase and turns it into a knowledge base that you can interact with. It explains the codebase to you, rather than showing you the structure. It seems like it's designed for code but I opened my Obsidian vault of podcast highlights in Claude Code, then ran /understand. The result is a knowledge graph that I can search of highlights from 888 podcast episodes and 144K lines of markdown text.

๐Ÿ–ผ๏ธ Media
M
majidmanzarpour
@majidmanzarpour
๐Ÿ“…
Jun 23, 2026
11d ago
๐Ÿ†”12178585

Hey @claudeai Opus 4.8 let's build a fully procedural spider in @threejs๐Ÿ•ท๏ธ โ€ฆso we did. Feet-driven IK + a Cruse-rule gait = it walks any terrain. Then we built a 42-scenario test harness and drove the locomotion to 100%. https://t.co/5HU9BBvpdf

@majidmanzarpour โ€ข Tue Jun 23 00:39

Tip: if you're running into visual/physics issues in your @threejs game, prompt your agent to "build a visual test harness with test cases and results" for the problem Pair it with browser access & "/goal iterate on the visual test harness and logic until all test cases pass 10

๐Ÿ–ผ๏ธ Media
A
AravSrinivas
@AravSrinivas
๐Ÿ“…
Jun 27, 2026
7d ago
๐Ÿ†”50900944
โญ0.40

Every enterprise will have its own model-harness-sandbox-eval flywheel with token value per watt optimization. This is the future. Simple reason: tacit knowledge about the domain and customers and their workflows that the company uniquely understands and has built trust around.

C
calvincbzhang
@calvincbzhang
๐Ÿ“…
Jun 21, 2026
13d ago
๐Ÿ†”31857422

Iโ€™ve joined @OpenAI as a Research Program Manager, working on evals. Iโ€™m incredibly grateful for my time at @scale_AI. I worked on Humanityโ€™s Last Exam, helped launch @ScaleAILabs, collaborated with amazing people across data/evals/research, and recorded a few episodes of Chain of Thought. More than anything, Iโ€™m grateful for the people. Scale was intense, chaotic, ambitious, and deeply formative. I learned a lot about building under pressure, caring about quality, and taking evals seriously. Excited for the next chapter.

๐Ÿ–ผ๏ธ Media