Your curated collection of saved posts and media

Showing 24 posts Β· last 7 days Β· quality filtered
A
AcerFur
@AcerFur
πŸ“…
Mar 05, 2026
7d ago
πŸ†”13955357

Also, come on OpenAI. If you want an automated AI researcher, this needs to start going up, not down. https://t.co/0ZQ4UhdNyu

Media 1
πŸ–ΌοΈ Media
πŸ”jeremyphoward retweeted
A
Acer
@AcerFur
πŸ“…
Mar 05, 2026
7d ago
πŸ†”13955357

Also, come on OpenAI. If you want an automated AI researcher, this needs to start going up, not down. https://t.co/0ZQ4UhdNyu

Media 1
❀️522
likes
πŸ”27
retweets
πŸ–ΌοΈ Media
πŸ”jeremyphoward retweeted
A
Acer
@AcerFur
πŸ“…
Mar 05, 2026
7d ago
πŸ†”13955357

Also, come on OpenAI. If you want an automated AI researcher, this needs to start going up, not down. https://t.co/0ZQ4UhdNyu

Media 1
❀️522
likes
πŸ”27
retweets
πŸ–ΌοΈ Media
G
GergelyOrosz
@GergelyOrosz
πŸ“…
Mar 05, 2026
8d ago
πŸ†”54361715

Google kicks out legit paying Antigravity customers for high usage [to solve their problem of not enough capacity]; does not tell them; does not offer refunds or any way to refund the service. This comic by @lmanul is so spot on with regards to Google (and Amazon!) https://t.co/x0LToYbOHX

Media 1
πŸ–ΌοΈ Media
G
GergelyOrosz
@GergelyOrosz
πŸ“…
Mar 05, 2026
8d ago
πŸ†”54361715

Google kicks out legit paying Antigravity customers for high usage [to solve their problem of not enough capacity]; does not tell them; does not offer refunds or any way to refund the service. This comic by @lmanul is so spot on with regards to Google (and Amazon!) https://t.co/x0LToYbOHX

Media 1
πŸ–ΌοΈ Media
T
tanishqkumar07
@tanishqkumar07
πŸ“…
Mar 04, 2026
8d ago
πŸ†”96631872

I've been working on a new LLM inference algorithm. It's called Speculative Speculative Decoding (SSD) and it's up to 2x faster than the strongest inference engines in the world. Collab w/ @tri_dao @avnermay. Details in thread.

πŸ–ΌοΈ Media
T
tanishqkumar07
@tanishqkumar07
πŸ“…
Mar 04, 2026
8d ago
πŸ†”96631872

I've been working on a new LLM inference algorithm. It's called Speculative Speculative Decoding (SSD) and it's up to 2x faster than the strongest inference engines in the world. Collab w/ @tri_dao @avnermay. Details in thread.

πŸ–ΌοΈ Media
W
wonmin_byeon
@wonmin_byeon
πŸ“…
Mar 04, 2026
8d ago
πŸ†”46418709

πŸš€ New paper: Mamba–Transformer hybrid VLMs can go fast without forgetting. We introduce stateful token reduction for long-video VLMs. βœ… Only 25% of visual tokens πŸš€ 3.8–4.2Γ— faster prefilling (TTFT) 🎯 Near-baseline accuracy (can exceed baseline with light finetuning) https://t.co/CJaCktyWCt

Media 1
πŸ–ΌοΈ Media
T
tedzadouri
@tedzadouri
πŸ“…
Mar 05, 2026
7d ago
πŸ†”06841236

Asymmetric hardware scaling is here. Blackwell tensor cores are now so fast, exp2 and shared memory are the wall. FlashAttention-4 changes the algorithm & pipeline so that softmax & SMEM bandwidth no longer dictate speed. Attn reaches ~1600 TFLOPs, pretty much at matmul speed! joint work w/ Markus Hoehnerbach, Jay Shah(@ultraproduct), Timmy Liu, Vijay Thakkar (@__tensorcore__ ), Tri Dao (@tri_dao) 1/

Media 1
πŸ–ΌοΈ Media
T
tri_dao
@tri_dao
πŸ“…
Mar 05, 2026
7d ago
πŸ†”58646344

Claude / Codex also have an easier time writing some components of FA4 thanks to the fast compile time. I got Claude to debug a deadlock when we first implemented 2CTA fwd. It ran autonomously overnight for 6 hours, figured out part of the fix, but then went down a rabbit hole convincing itself that the compiler is broken (so very human πŸ˜‚). After 6 hours, from Claude’s partial fix, I was able to fix the hang in 10 mins. More details here: https://t.co/ipGhC9FzET I’m hoping FA5 will be written completely by AI

Media 1
πŸ–ΌοΈ Media
M
MayankMish98
@MayankMish98
πŸ“…
Mar 05, 2026
7d ago
πŸ†”79317378

FA4 now available in lm-engine: https://t.co/n47TEinAfG 13.4% end-to-end speedup for Llama 8B training on 4x GB200s (1 node) πŸš€πŸš€πŸš€ 1005.55 TFLOPs for SDPA vs 1140.73 for FA4 (BF16 precision) @tedzadouri @ultraproduct @__tensorcore__ @tri_dao cooked Thanks to @bharatrunwal2 for running the experiment!

Media 1
πŸ–ΌοΈ Media
πŸ”tri_dao retweeted
M
Mayank Mishra
@MayankMish98
πŸ“…
Mar 05, 2026
7d ago
πŸ†”79317378

FA4 now available in lm-engine: https://t.co/n47TEinAfG 13.4% end-to-end speedup for Llama 8B training on 4x GB200s (1 node) πŸš€πŸš€πŸš€ 1005.55 TFLOPs for SDPA vs 1140.73 for FA4 (BF16 precision) @tedzadouri @ultraproduct @__tensorcore__ @tri_dao cooked Thanks to @bharatrunwal2 for running the experiment!

Media 1
❀️51
likes
πŸ”9
retweets
πŸ–ΌοΈ Media
S
StasBekman
@StasBekman
πŸ“…
Mar 05, 2026
7d ago
πŸ†”75487320

the FA4 integration into @huggingface Transformers is here https://t.co/48XPxmKbMv you will need to apply my proposed changes at the end for it to work if the owner hasn't done it already by the time you try it out

Media 1
πŸ–ΌοΈ Media
πŸ”tri_dao retweeted
S
Stas Bekman
@StasBekman
πŸ“…
Mar 05, 2026
7d ago
πŸ†”75487320

the FA4 integration into @huggingface Transformers is here https://t.co/48XPxmKbMv you will need to apply my proposed changes at the end for it to work if the owner hasn't done it already by the time you try it out

Media 1
❀️22
likes
πŸ”3
retweets
πŸ–ΌοΈ Media
πŸ”tri_dao retweeted
S
Stas Bekman
@StasBekman
πŸ“…
Mar 05, 2026
7d ago
πŸ†”75487320

the FA4 integration into @huggingface Transformers is here https://t.co/48XPxmKbMv you will need to apply my proposed changes at the end for it to work if the owner hasn't done it already by the time you try it out

Media 1
❀️22
likes
πŸ”3
retweets
πŸ–ΌοΈ Media
T
togethercompute
@togethercompute
πŸ“…
Mar 05, 2026
7d ago
πŸ†”35702061

Together Research has produced FlashAttention, ATLAS, ThunderKittens and more. This week at AI Native Conf: seven more releases, all coming to production soon. Thread β†’ #ainativeconf #ainativecloud https://t.co/XXIXMRRiLe

Media 1
πŸ–ΌοΈ Media
T
togethercompute
@togethercompute
πŸ“…
Mar 05, 2026
7d ago
πŸ†”35702061

Together Research has produced FlashAttention, ATLAS, ThunderKittens and more. This week at AI Native Conf: seven more releases, all coming to production soon. Thread β†’ #ainativeconf #ainativecloud https://t.co/XXIXMRRiLe

Media 1
πŸ–ΌοΈ Media
K
Kelaivy
@Kelaivy
πŸ“…
Mar 05, 2026
8d ago
πŸ†”23021787

@FPLGOAT7 I got lucky, sold Dango, sold Haaland. Tarkowski did it for me. https://t.co/BUmAWBP0W7

Media 1
πŸ–ΌοΈ Media
K
Kelaivy
@Kelaivy
πŸ“…
Mar 05, 2026
8d ago
πŸ†”19474729

@yehiael22 @FPL_Harry Same here https://t.co/2gZdylxMf1

Media 1
πŸ–ΌοΈ Media
K
Kelaivy
@Kelaivy
πŸ“…
Mar 05, 2026
8d ago
πŸ†”19474729

@yehiael22 @FPL_Harry Same here https://t.co/2gZdylxMf1

Media 1
πŸ–ΌοΈ Media
πŸ”NaderLikeLadder retweeted
A
Addy Osmani
@addyosmani
πŸ“…
Mar 05, 2026
8d ago
πŸ†”67805081

Introducing the Google Workspace CLI: https://t.co/8yWtbxiVPp - built for humans and agents. Google Drive, Gmail, Calendar, and every Workspace API. 40+ agent skills included.

Media 1
❀️14,229
likes
πŸ”1,490
retweets
πŸ–ΌοΈ Media
N
NVIDIAAIDev
@NVIDIAAIDev
πŸ“…
Mar 04, 2026
8d ago
πŸ†”50842580

⚠️ WARNING: THIS PRODUCT MAY CONTAIN SHELLFISH 🦞 https://t.co/zJ6n2auo6B

πŸ–ΌοΈ Media
C
cerebral_valley
@cerebral_valley
πŸ“…
Mar 02, 2026
10d ago
πŸ†”07880476

Do you want to demo your project at the Meta booth during GTC? 😎 Join @Meta and @nvidia, in partnership with CV, for a full-day hackathon at @SHACK15sf, writing high-performance GPU kernels with Helion, PyTorch's new kernel authoring DSL that delivers higher performance in fewer lines of code with autotuning. πŸ“… March 14th β€” Right before NVIDIA GTC. The perfect warm-up. πŸ† Prizes & perks: > Nvidia GPUs and Nvidia DGX Spark > Demo your project at the Meta booth during GTC > GTC conference passes > Ray-Ban Meta glasses > Mentoring from Meta AI researchers & NVIDIA engineers πŸ“ Fully in-person | Teams of up to 4 | Rolling review, limited spots Register below πŸ‘‡

πŸ–ΌοΈ Media
O
openinfradev
@openinfradev
πŸ“…
Mar 03, 2026
10d ago
πŸ†”55165932

How can we securelty contain #AI?. In this live discussion, experts will explore why traditional container isolation falls short for agent-based systems & what changes when agents have persistent memory, filesystem access, GPUs, or external execution authority https://t.co/qi4Mw97DPo

Media 1
πŸ–ΌοΈ Media