🔁jxnlco retweeted

S

Simon Willison

@simonw

📅

Mar 17, 2026

1h ago

🆔68704943

⭐0.34

... and a follow-up chapter about Subagents, now a feature of Codex and Claude Code and Gemini CLI and Mistral Vibe and OpenCode and VS Code and Cursor https://t.co/suGmK4g3Hp

❤️17

likes

🔁3

retweets

View Details View on X ↗

🔁ivanleomk retweeted

A

agrim singh

@agrimsingh

📅

Mar 17, 2026

37m ago

🆔08839930

⭐0.38

i dj'd a set this weekend and planned half of it with an app i built on @GoogleDeepMind's new multimodal embeddings model. it understands what music actually sounds like - not bpm, not genre tags. raw audio in, vibes out. built with @cursor_ai + @openai gpt 5.4, gemini embeddings and @convex to keep everything running smoothly @ericzakariasson @DynamicWebPaige @waynesutton @gabrielchua here's what it does (demo below) 🧵

❤️6

likes

🔁3

retweets

View Details View on X ↗

A

agrimsingh

@agrimsingh

📅

Mar 17, 2026

37m ago

🆔08839930

i dj'd a set this weekend and planned half of it with an app i built on @GoogleDeepMind's new multimodal embeddings model. it understands what music actually sounds like - not bpm, not genre tags. raw audio in, vibes out. built with @cursor_ai + @openai gpt 5.4, gemini embeddings and @convex to keep everything running smoothly @ericzakariasson @DynamicWebPaige @waynesutton @gabrielchua here's what it does (demo below) 🧵

🖼️ Media

View Details View on X ↗

🔁_akhaliq retweeted

N

Niels Rogge

@NielsRogge

📅

Mar 17, 2026

56m ago

🆔69938152

Tried the viral poster-skill with Claude Code on the trending Moonshot paper :) Not too bad! https://t.co/I8lb0aUrbT

❤️10

likes

🔁3

retweets

🖼️ Media

View Details View on X ↗

N

NielsRogge

@NielsRogge

📅

Mar 17, 2026

56m ago

🆔69938152

Tried the viral poster-skill with Claude Code on the trending Moonshot paper :) Not too bad! https://t.co/I8lb0aUrbT

@Kimi_Moonshot • Mon Mar 16 03:03

Introducing 𝑨𝒕𝒕𝒆𝒏𝒕𝒊𝒐𝒏 𝑹𝒆𝒔𝒊𝒅𝒖𝒂𝒍𝒔: Rethinking depth-wise aggregation. Residual connections have long relied on fixed, uniform accumulation. Inspired by the duality of time and depth, we introduce Attention Residuals, replacing standard depth-wise recurrence with learned, input-dep

🖼️ Media

View Details View on X ↗

🔁Modular retweeted

F

Forward Future

@ForwardFuture

📅

Mar 13, 2026

3d ago

🆔14582366

⭐0.32

“Everyone should be a GPU programmer.” @clattner_llvm's goal with @Modular: “What Modular is doing is opening up the box. We’re fixing the language problem and the platform problem. "The goal is to let more developers learn modern compute. And to give developers real choice in the hardware they use.” “Those two things unlock the ecosystem.”

❤️6

likes

🔁1

retweets

View Details View on X ↗

S

simonw

@simonw

📅

Mar 17, 2026

1h ago

🆔68704943

⭐0.38

... and a follow-up chapter about Subagents, now a feature of Codex and Claude Code and Gemini CLI and Mistral Vibe and OpenCode and VS Code and Cursor https://t.co/suGmK4g3Hp

View Details View on X ↗

H

HamelHusain

@HamelHusain

📅

Mar 17, 2026

1h ago

🆔89874302

⭐0.36

@matijagrcic @jxnlco I don’t doubt that the internal benchmarks are done rigorously with intellectual honesty The sad part is that many coding agents report their own internal benchmarks for which they are SOTA So it’s hard for the outside observer to put any stock in these despite best intentions

View Details View on X ↗

R

rasbt

@rasbt

📅

Mar 17, 2026

2h ago

🆔71094774

⭐0.32

@SalajSonar1086 Already done :). The respective tutorial articles are linked via the “View in Article” links there

View Details View on X ↗

🔁huggingface retweeted

G

Gabriele Berton

@gabriberton

📅

Mar 16, 2026

15h ago

🆔45334177

⭐0.34

VisMatch is on pypi! VisMatch is a wrapper for image matching models, like LightGlue, RoMa-v2, MASt3R, LoFTR, and 50+ more! It's literally as simple as: pip install vismatch vismatch-match --inputs img0 img1 --matcher choose_any To run image matching on any 2 images [1/4] https://t.co/dIr2YapWak

❤️183

likes

🔁28

retweets

View Details View on X ↗

🔁Scobleizer retweeted

R

Ray Fernando

@RayFernando1337

📅

Mar 16, 2026

16h ago

🆔10226271

⭐0.34

Nvidia GTC 2026 OpenClaw Setup on DGX Spark IRL https://t.co/zQwwfCF9XP

❤️28

likes

🔁3

retweets

View Details View on X ↗

🔁Scobleizer retweeted

L

lucas

@lucas_flatwhite

📅

Mar 17, 2026

10h ago

🆔99053607

⭐0.36

🛠️ Claude Code "opusplan" 말 그대로 하이브리드 모델.. 공식임! Claude Code에는 opusplan 모델을 선택할 수 있어요. > /model opusplan 하이브리드 모델 alias인데, 작업 단계에 따라 자동으로 모델을 전환해요. 복잡한 추론을 위한 플랜 모드에서는 Opus를 실행 단계에서는 Sonnet으로 자동 전환됩니다! Opus로 계획하고 구현까지 하는 것도 물론 가능해요. 하지만 이미 탄탄한 계획이 있다면, 실행은 Sonnet으로도 충분하고 더 저렴할 수 있어요. 각 작업에 맞는 모델을 쓰는 것 = 효율 🚀 플래닝과 실행은 요구되는 인지 부하가 달라요. Opus의 깊은 추론 능력은 계획 수립 단계에서 가장 빛나고, 일단 탄탄한 계획이 세워진 이후의 실행은 Sonnet으로 충분히 커버될 수 있어요. 언제 쓰면 좋냐구요? - 복잡한 기능 설계같이 아키텍처 결정이 중요한 작업 - 리팩토링 계획같은 영향 범위 분석이 필요한 경우 - Opus 풀파워 사용 대비 비용 절감이 필요할 때 이거 이제 많이 활용하실 듯!!!

❤️22

likes

🔁5

retweets

View Details View on X ↗

D

drmapavone

@drmapavone

📅

Mar 17, 2026

10h ago

🆔58775875

Jensen today announced Alpamayo 1.5 at #NVIDIAGTC! #Alpamayo 1.5 is a major update to Alpamayo 1—@nvidia’s open 10B-parameter chain-of-thought reasoning VLA model, first introduced at #CES. Built on the #Cosmos-Reason2 VLM backbone and post-trained with RL, it adds support for navigation guidance, flexible multi-camera setups, configurable camera parameters, and user question answering. The result is an interactive, steerable reasoning engine for the AV community. We’re also releasing post-training scripts to help researchers and developers adapt the model. Additionally, we’ve significantly expanded the Alpamayo open platform across data and simulation, including releasing highly requested reasoning labels for the PhysicalAI Autonomous Vehicles dataset (https://t.co/fD9eUcndya), as well as our chain-of-causation auto-labeling pipeline. 🔎 Learn more about Alpamayo 1.5 and the latest extensions to the Alpamayo open platform: https://t.co/P0nuqkwBab (please note that most of the links will become active in the next few days.) Happy building—and stay tuned for more in the coming months! @NVIDIADRIVE @NVIDIAAI

+1 more

🖼️ Media

View Details View on X ↗

L

lucas_flatwhite

@lucas_flatwhite

📅

Mar 17, 2026

10h ago

🆔99053607

🛠️ Claude Code "opusplan" 말 그대로 하이브리드 모델.. 공식임! Claude Code에는 opusplan 모델을 선택할 수 있어요. > /model opusplan 하이브리드 모델 alias인데, 작업 단계에 따라 자동으로 모델을 전환해요. 복잡한 추론을 위한 플랜 모드에서는 Opus를 실행 단계에서는 Sonnet으로 자동 전환됩니다! Opus로 계획하고 구현까지 하는 것도 물론 가능해요. 하지만 이미 탄탄한 계획이 있다면, 실행은 Sonnet으로도 충분하고 더 저렴할 수 있어요. 각 작업에 맞는 모델을 쓰는 것 = 효율 🚀 플래닝과 실행은 요구되는 인지 부하가 달라요. Opus의 깊은 추론 능력은 계획 수립 단계에서 가장 빛나고, 일단 탄탄한 계획이 세워진 이후의 실행은 Sonnet으로 충분히 커버될 수 있어요. 언제 쓰면 좋냐구요? - 복잡한 기능 설계같이 아키텍처 결정이 중요한 작업 - 리팩토링 계획같은 영향 범위 분석이 필요한 경우 - Opus 풀파워 사용 대비 비용 절감이 필요할 때 이거 이제 많이 활용하실 듯!!!

@dani_avila7 • Mon Mar 16 15:58

Did you know about the opusplan model in Claude Code? /model opusplan It's a hybrid alias that automatically uses Opus in plan mode for complex reasoning, then switches to Sonnet for execution. Best of both worlds: Opus thinks, Sonnet builds https://t.co/r7un0X5bVg

🖼️ Media

View Details View on X ↗

G

gdb

@gdb

📅

Mar 17, 2026

10h ago

🆔37895367

⭐0.32

Subagents are now supported in Codex. They're very fun and make it possible to get large amounts of work done *quickly*:

@OpenAIDevs • Mon Mar 16 20:09

Subagents are now available in Codex. You can accelerate your workflow by spinning up specialized agents to: • Keep your main context window clean • Tackle different parts of a task in parallel • Steer individual agents as work unfolds https://t.co/QJC2ZYtYcA

View Details View on X ↗

🔁omarsar0 retweeted

O

elvis

@omarsar0

📅

Mar 16, 2026

1d ago

🆔09077648

⭐0.38

Banger report from the Kimi team: Attention Residuals Residual connections made deep Transformers trainable. But they also force uncontrolled hidden-state growth with depth. This work proposes a cleaner alternative. It introduces Attention Residuals, which replace fixed residual accumulation with softmax attention over previous layer outputs. Instead of blindly summing everything, each layer selectively retrieves the earlier representations it actually needs. To keep this practical at scale, they add a blockwise version that compresses layers into block summaries, recovering most of the gains with minimal systems overhead. Why does it matter? Residual paths have barely changed across modern LLMs, even though they govern how information moves through depth. This paper shows that making the mixing content-dependent improves scaling laws, matches a baseline trained with 1.25x more compute, boosts GPQA-Diamond by +7.5 and HumanEval by +3.1, while keeping inference overhead under 2%. Paper: https://t.co/04IG6FDiVr Learn to build effective AI agents in our academy: https://t.co/1e8RZKs4uX

❤️130

likes

🔁17

retweets

View Details View on X ↗

🔁ivanleomk retweeted

Y

Yoeven

@yoeven

📅

Mar 17, 2026

13h ago

🆔65291100

The moment he realised that https://t.co/vWmBsnR1nt isn't fully built on transformers and we can run on a single GPU with high accuracy and lower cost https://t.co/ZJYuL62UB8

❤️4

likes

🔁1

retweets

🖼️ Media

View Details View on X ↗

Y

yoeven

@yoeven

📅

Mar 17, 2026

13h ago

🆔65291100

The moment he realised that https://t.co/vWmBsnR1nt isn't fully built on transformers and we can run on a single GPU with high accuracy and lower cost https://t.co/ZJYuL62UB8

🖼️ Media

View Details View on X ↗

🔁_akhaliq retweeted

H

Haocheng Xi

@HaochengXiUCB

📅

Mar 17, 2026

14h ago

🆔24284251

⭐0.34

Thanks for sharing our newest work @_akhaliq ! Classic algorithms like K-Means deserve to be revisited in the era of massive datasets and GPUs. Flash-KMeans rethinks the algorithm from a systems perspective to make exact K-Means fast and memory-efficient on modern hardware.

❤️9

likes

🔁1

retweets

View Details View on X ↗

H

HamelHusain

@HamelHusain

📅

Mar 17, 2026

14h ago

🆔89510900

⭐0.32

Claude Code CLI > Codex CLI Codex Desktop > Claude Code Desktop It’s a jagged UX frontier

View Details View on X ↗

H

HaochengXiUCB

@HaochengXiUCB

📅

Mar 17, 2026

14h ago

🆔24284251

⭐0.38

Thanks for sharing our newest work @_akhaliq ! Classic algorithms like K-Means deserve to be revisited in the era of massive datasets and GPUs. Flash-KMeans rethinks the algorithm from a systems perspective to make exact K-Means fast and memory-efficient on modern hardware.

@_akhaliq • Thu Mar 12 16:44

Flash-KMeans Fast and Memory-Efficient Exact K-Means paper: https://t.co/Yy7V7L12Bn https://t.co/c1mGipQl3f

View Details View on X ↗

C

code

@code

📅

Mar 17, 2026

14h ago

🆔94910880

🌐 Agentic Browser Tools (Experimental) in @code! Agents can now open pages, read content, click elements, and verify changes directly in the integrated browser while building your web app. Enable ⚙️ workbench.browser.enableChatTools to try it out. Learn mode: https://t.co/kNwugFcbIA

🖼️ Media

View Details View on X ↗

N

nvidianewsroom

@nvidianewsroom

📅

Mar 16, 2026

14h ago

🆔99149451

⭐0.42

#NVIDIAGTC news: NVIDIA Dynamo 1.0 enters production as the broadly adopted inference operating system for AI factories. Dynamo 1.0 boosts Blackwell inference performance by up to 7x. The industry is scaling on NVIDIA. ⬇️https://t.co/Iaq2H2SmhR

View Details View on X ↗

P

PyTorch

@PyTorch

📅

Mar 16, 2026

15h ago

🆔18333110

#ExecuTorch addresses fragmented native deployment for #AI agents as a #PyTorch native platform. It enables voice models across CPU, GPU, and NPU on Android, iOS, Linux, macOS & Windows 🔗 https://t.co/NeQQyUniL4 https://t.co/O3itnoQFoG

🖼️ Media

View Details View on X ↗

G

gabriberton

@gabriberton

📅

Mar 16, 2026

15h ago

🆔45334177

VisMatch is on pypi! VisMatch is a wrapper for image matching models, like LightGlue, RoMa-v2, MASt3R, LoFTR, and 50+ more! It's literally as simple as: pip install vismatch vismatch-match --inputs img0 img1 --matcher choose_any To run image matching on any 2 images [1/4] https://t.co/dIr2YapWak

🖼️ Media

View Details View on X ↗

🔁jxnlco retweeted

E

edwin

@edwinarbus

📅

Mar 16, 2026

19h ago

🆔50334333

⭐0.34

Matt Maher tested frontier models in Cursor v. other harnesses. Cursor boosted model performance by 11% on average: Gemini: 52% → 57% GPT-5.4: 82% → 88% Opus: 77% → 93% His benchmark measures how well models implement a 100-feature PRD. @cursor_ai consistently outperformed. https://t.co/hrjCmWMNKN

❤️176

likes

🔁17

retweets

View Details View on X ↗

Z

ZGojcic

@ZGojcic

📅

Mar 16, 2026

16h ago

🆔28929828

⭐0.40

A new generation in AV simulation is here! We are announcing AlpaDreams, a real time interactive generative world model for AV simualtion! Just a year ago it took minutes to generate a few seconds of video, today it is real time and interactive! https://t.co/FbhKu3PMqe

View Details View on X ↗

R

RayFernando1337

@RayFernando1337

📅

Mar 16, 2026

16h ago

🆔10226271

⭐0.34

Nvidia GTC 2026 OpenClaw Setup on DGX Spark IRL https://t.co/zQwwfCF9XP

View Details View on X ↗

J

jasteinerman

@jasteinerman

📅

Mar 16, 2026

17h ago

🆔75976987

⭐0.32

Love this submission from our world models hackathon this weekend - a generative FPS!

@AnshulDhawan001 • Mon Mar 16 21:08

Spent the weekend hacking at the Worlds in Action hackathon at @fdotinc by @SensAIHackademy. It was so much fun playing with the world models by @theworldlabs . I believe generative games are the future where characters, rules and even parts of the world can be generated and ad

View Details View on X ↗

A

ArtificialAnlys

@ArtificialAnlys

📅

Mar 16, 2026

18h ago

🆔52868861

NVIDIA has released Nemotron 3 VoiceChat! A ~12B parameter Speech to Speech model that leads our open weights Conversational Dynamics vs. Speech Reasoning pareto frontier Understanding Speech to Speech model performance is multidimensional - two key and distinct dimensions are raw intelligence and conversational dynamics: how well a model handles the natural rhythms of human conversation such as turn-taking, interruptions. Amongst full duplex open weights models, NVIDIA’s new Nemotron 3 VoiceChat, V1, leads in balancing these dimensions, setting itself apart from other models on the Conversational Dynamics vs. Speech Reasoning pareto frontier. Key benchmarking results: ➤ Conversational Dynamics (Full Duplex Bench): Nemotron 3 VoiceChat (V1) scores 77.8%, second among open weights speech to speech models behind NVIDIA's own PersonaPlex (91.0%) and ahead of FLM-Audio (62.0%), Moshi (61.0%) and Freeze-Omni (58.7%) ➤ Speech Reasoning (Big Bench Audio): Nemotron 3 VoiceChat (V1) scores 29.2%, second among open weights speech to speech models behind Freeze-Omni (33.9%) and well ahead of PersonaPlex (12.6%), FLM-Audio (5.3%) and Moshi (1.7%) ➤ Pareto leader: While Freeze-Omni leads on speech reasoning and PersonaPlex leads on conversational dynamics, Nemotron 3 VoiceChat (V1) is the only open weights model that performs amongst the top 3 on both - making it the clear leader on the pareto frontier between these two critical dimensions ➤ Larger than other open weights models but still relatively small compared to LLMs: Nemotron 3 VoiceChat (V1) has 12B parameters, making it one of the larger open weights speech to speech models, while NVIDIA's PersonaPlex is ~7B. While larger compared to other larger open weights speech to speech models the model still is relatively small compared to leading LLMs ➤ Context vs. proprietary models: While this release materially advances open weights performance, open weights speech to speech models still significantly underperform leading proprietary offerings. For comparison, proprietary models on our Big Bench Audio benchmark score substantially higher - Step-Audio R1.1 at 96%, Grok Voice Agent at 92%, Gemini 2.5 Flash (Thinking) at 92%, and Nova 2.0 Sonic at 87%. The gap between open weights and proprietary remains large in this modality. As the capability and adoption of Speech to Speech models increases, we expect to expand our set of benchmarks to include elements such as tool-calling and multi-turn instruction following. See more details below ⬇️

🖼️ Media

View Details View on X ↗

🔁jeremyphoward retweeted

R

raia hadsell

@RaiaHadsell

📅

Mar 16, 2026

21h ago

🆔56989392

⭐0.36

It's been about 20 years since I first started working on embeddings with Yann LeCun (siamese networks!), and I've been fascinated ever since. Gemini Embeddings 2 approaches the platonic ideal: native embedding of text, image, video, audio, and docs to a single space.

❤️277

likes

🔁24

retweets

View Details View on X ↗

H

HuggingPapers

@HuggingPapers

📅

Mar 16, 2026

18h ago

🆔83694046

OmniForcing unlocks real-time joint audio-visual generation Achieves ~25 FPS with 0.7s latency—a 35× speedup over offline diffusion models—by distilling bidirectional LTX-2 into a causal streaming generator with maintained multi-modal fidelity. https://t.co/UGYGMyTQOs

🖼️ Media

View Details View on X ↗