Your curated collection of saved posts and media

Showing 32 posts Β· last 14 days Β· by score
S
Daniel Svonava
@svonava
πŸ“…
Tue Jun 17
πŸ†”44576641

might use this on my profile as overheard in @HamelHusain 's course https://t.co/jVcZmbPJnJ

Media 1
❀️3
likes
πŸ”2
retweets
πŸ–ΌοΈ Media
A
Aran Komatsuzaki
@arankomatsuzaki
πŸ“…
Tue Jun 17
πŸ†”87663697

Serving Large Language Models on Huawei CloudMatrix384 - Integrates 384 Ascend 910C NPUs, interconnected via an ultra-high-bandwidth, low-latency UB network, optimized for large-scale MoE and distributed KV cache access - DeepSeek-R1 on CloudMatrix-Infer hits 2k tokens/s decode… https://t.co/zZuEAdu7Gn

Media 1
❀️122
likes
πŸ”18
retweets
πŸ–ΌοΈ Media
T
Zach Mueller
@TheZachMueller
πŸ“…
Wed
πŸ†”89105378

Stop wasting time guessing why your AI fails. The most valuable skill I learned recently: error analysis https://t.co/e5XWJB1L1a Hamel & Shreya teach you how to diagnose what's going wrong with your pipeline, and build evals you can trust at scale. Error analysis is just the… https://t.co/D2ptDf7fEF

Media 1
❀️432
likes
πŸ”34
retweets
πŸ–ΌοΈ Media
E
Ethan Mollick
@emollick
πŸ“…
Wed
πŸ†”11480928

No signs of an end to rapid gains in AI ability at ever-decreasing costs yet I did my best to update my chart to take into account the price drop in o3 & the new models released by Google. GPT-4 was 2.25 years ago,so its worth noting the trend when considering the future of AI. https://t.co/mF4CZ5eqQp

Media 1
❀️826
likes
πŸ”122
retweets
πŸ–ΌοΈ Media
I
Tanishq Mathew Abraham, Ph.D.
@iScienceLuvr
πŸ“…
Thu Jun 19
πŸ†”11453635

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective "We introduce GURU, a curated RL reasoning corpus of 92K verifiable examples spanning six reasoning domainsβ€”Math, Code, Science, Logic, Simulation, and Tabularβ€”each built through domain-specific… https://t.co/Toc6SAkTXA

Media 1
❀️221
likes
πŸ”45
retweets
πŸ–ΌοΈ Media
O
elvis
@omarsar0
πŸ“…
Wed
πŸ†”22629478

From Bytes to Ideas Avoids using predefined vocabs and memory-heavy embedding tables. Instead, it uses Autoregressive U-Nets to embed information directly from raw bytes. This is huge! Enables infinite vocab size and more. More in my notes below: https://t.co/AGonec9SzY

Media 1
❀️193
likes
πŸ”40
retweets
πŸ–ΌοΈ Media
D
Delip Rao e/Οƒ
@deliprao
πŸ“…
Thu Jun 19
πŸ†”97662284

Agents training in RL gym https://t.co/ZvP1SgSrAA

❀️1,798
likes
πŸ”130
retweets
πŸ–ΌοΈ Media
A
Aran Komatsuzaki
@arankomatsuzaki
πŸ“…
Wed
πŸ†”67017621

From Bytes to Ideas: Language Modeling with Autoregressive U-Nets Presents an autoregressive U-Net that processes raw bytes and learns hierarchical token representation Matches strong BPE baselines, with deeper hierarchies demonstrating promising scaling trends https://t.co/9ShElZ6GsS

Media 1
❀️363
likes
πŸ”57
retweets
πŸ–ΌοΈ Media
O
elvis
@omarsar0
πŸ“…
Tue Jun 17
πŸ†”09324336

Nice recap of the progress in LLMs. If you want to be a top AI dev, dive deep into these topics. There are tons of opportunities and problems to solve. My notes below: https://t.co/KKHEDXCMsx

Media 1
❀️219
likes
πŸ”36
retweets
πŸ–ΌοΈ Media
I
Ivan Leo
@ivanleomk
πŸ“…
Wed
πŸ†”01572768

sonnet u poor thing https://t.co/m8Ls1PKJMj

Media 1
❀️3
likes
πŸ–ΌοΈ Media
E
Ethan Mollick
@emollick
πŸ“…
Wed
πŸ†”88140266

veo 3: "three toy ships, one made of iron, the other of wood, and one out of loosely packed sugar, are dropped into a pool of water" AI video tools really do seem to be able to simulate physics well (but not perfectly) without having an underlying physics engine. A world model? https://t.co/Qen5S4VyoN

❀️539
likes
πŸ”35
retweets
πŸ–ΌοΈ Media
O
elvis
@omarsar0
πŸ“…
Tue Jun 17
πŸ†”13973196

Providing β€œcognitive tools” to GPT-4.1 increases performance on AIME2024 from 26.7% to 43.3%. Damn! That's very close to the performance of o1-preview. Reasoning as a tool goes hard! Here are my notes: https://t.co/a6o2Cd5swC

Media 1
❀️477
likes
πŸ”62
retweets
πŸ–ΌοΈ Media
N
Neil deGrasse Tyson
@neiltyson
πŸ“…
Mon
πŸ†”87007294

Every machine in a Hospital that diagnoses your body without cutting you open is based on a principle of Physics, discovered by a Physicist who had no interest in Medicine. If you think the world doesn’t need Basic Science, or that somehow Science has failed you, think again. https://t.co/fvcdwnpKza

Media 1
❀️11,906
likes
πŸ”2,290
retweets
πŸ–ΌοΈ Media
I
Tanishq Mathew Abraham, Ph.D.
@iScienceLuvr
πŸ“…
Wed
πŸ†”80105326

LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs "we propose LongLLaDA, a training-free method that integrates LLaDA with the NTK-based RoPE extrapolation. Our results validate that established extrapolation scaling laws remain effective for extending the… https://t.co/DDDl1L4dDc

Media 1
❀️215
likes
πŸ”39
retweets
πŸ–ΌοΈ Media
N
Nabla
@nabla_ai
πŸ“…
Tue Jun 17
πŸ†”49840700

We’re building the most advanced agentic AI assistant for clinicians. Today, we’re announcing a $70M Series C to accelerate our mission: restoring the human connection at the heart of healthcare through AI that delivers real clinical and financial impact. Thank you to… https://t.co/VVrOgpzwYu

Media 1
❀️65
likes
πŸ”9
retweets
πŸ–ΌοΈ Media
K
Andrej Karpathy
@karpathy
πŸ“…
Mon
πŸ†”69951434

An attempt to explain (current) ChatGPT versions. I still run into many, many people who don't know that: - o3 is the obvious best thing for important/hard things. It is a reasoning model that is much stronger than 4o and if you are using ChatGPT professionally and not using o3… https://t.co/1bQz0frqIc

Media 1
❀️13,653
likes
πŸ”1,668
retweets
πŸ–ΌοΈ Media
I
Tanishq Mathew Abraham, Ph.D.
@iScienceLuvr
πŸ“…
Tue Jun 17
πŸ†”58908343

Serving Large Language Models on Huawei CloudMatrix384 "this paper introduces Huawei CloudMatrix, a next-generation AI datacenter architecture that embodies Huawei’s vision for reshaping the foundation of AI infrastructure." "To fully leverage CloudMatrix384, we propose… https://t.co/8QpwKlZSX1

Media 1
❀️66
likes
πŸ”13
retweets
πŸ–ΌοΈ Media
O
elvis
@omarsar0
πŸ“…
Tue Jun 17
πŸ†”23842006

Great end-to-end platform for teams building with LLMs and using prompts in production. Adaline is a single platform where teams can iterate, evaluate, deploy, and monitor AI prompts. They are also giving $1M in API credits to the first 100 team workspaces this week. πŸ”„ Use… https://t.co/kyIUoJwegq

❀️103
likes
πŸ”18
retweets
πŸ–ΌοΈ Media
S
Lisan al Gaib
@scaling01
πŸ“…
Tue Jun 17
πŸ†”36585326

lmarena has a competitor Yupp is basically lmarena, but with more granular feedback and a credit system. Each message costs you some credits, but when you give high-quality feedback you get credits back to use on your favorite models. This is their multi-turn (5+ messages) VIBE… https://t.co/1R0zwHqast

Media 1Media 2
❀️278
likes
πŸ”18
retweets
πŸ–ΌοΈ Media
K
Konstantin Mishchenko
@konstmish
πŸ“…
Sun
πŸ†”80022913

I'm surprised this post didn't get more attention. Most papers arguing we are far from AGI just show LLMs failing on tasks too big for their context window. Sergey identifies something more fundamental: while LLMs learned to mimic intelligence from internet data, they never had… https://t.co/SGhOh15OIT

Media 1
❀️774
likes
πŸ”88
retweets
πŸ–ΌοΈ Media
S
Shreya Shankar
@sh_reya
πŸ“…
Mon
πŸ†”06014109

new blogpost on writing in the ~glorious~ age of LLMs https://t.co/EmownEiZ9S

Media 1
❀️1,403
likes
πŸ”169
retweets
πŸ–ΌοΈ Media
H
Hamel Husain
@HamelHusain
πŸ“…
Mon
πŸ†”25097636

"You can either actually try different ways to make your search results good, or you can just have seven meetings with VectorDB vendors who tell you their way is the best." - @BEBischof https://t.co/dR23WB2cAl https://t.co/hvMtWcvX5d

❀️34
likes
πŸ”5
retweets
πŸ–ΌοΈ Media
I
Ivan Leo
@ivanleomk
πŸ“…
Mon
πŸ†”49000131

benchmarking kura on lmsys and claude says the clustering is v v good https://t.co/QG2dmFth5y

Media 1
πŸ–ΌοΈ Media
L
LlamaIndex πŸ¦™
@llama_index
πŸ“…
Mon
πŸ†”32491717

Build a multi-agent system with MCP! @microsoft's new AI Travel Agents demo shows how to coordinate multiple AI agents using the Model Context Protocol, LlamaIndex.TS, and @Azure AI Foundry for complex travel planning scenarios. 🎯 Six specialized AI agents work together - from… https://t.co/cNyVAcnf6K

Media 1
❀️278
likes
πŸ”53
retweets
πŸ–ΌοΈ Media
E
Ethan Mollick
@emollick
πŸ“…
Mon
πŸ†”94745559

McKinsey's new report on AI agents shows the same mindset I see in many firms: a focus on making small, obsolete models do basic work (look at their suggested models!) rather than realizing that smarter models can do higher-end work (and those models are getting cheaper & better) https://t.co/GujvgeACxJ

Media 1
❀️2,146
likes
πŸ”230
retweets
πŸ–ΌοΈ Media
V
vishal
@vishal_learner
πŸ“…
Mon
πŸ†”11496518

Posted a video on how models failing boring tasks, the fastai top-down approach & the GE/AnswerAI "long leash/narrow fence" framework has led me to sign up for the AI Evals course by @HamelHusain & @sh_reya to help me find my moat as I continue to break into the ML industry πŸ‘‡ https://t.co/Vx1BDGHL47

Media 1
❀️7
likes
πŸ”3
retweets
πŸ–ΌοΈ Media
J
Jerry Liu
@jerryjliu0
πŸ“…
Mon
πŸ†”68627047

Our recent β€œpreset-modes” on LlamaParse lets you parse complex figures/charts πŸ“ˆπŸ“Š within research reports into formatted Mermaid diagrams/Markdown tables πŸ“. Some of these charts are extremely complex - a chart can show various lines and curves for different techniques, with… https://t.co/VvU3j6ZVAd

Media 1
❀️19
likes
πŸ”2
retweets
πŸ–ΌοΈ Media
T
Teknium (e/Ξ»)
@Teknium1
πŸ“…
Mon
πŸ†”06600165

Is over https://t.co/Y3k587Aztt

Media 1
❀️220
likes
πŸ”2
retweets
πŸ–ΌοΈ Media
T
Tetraslam
@Tetraslam
πŸ“…
Mon
πŸ†”24170756

arc agi 3 is in development @fchollet https://t.co/jylOLKO0QC

Media 1
❀️85
likes
πŸ”15
retweets
πŸ–ΌοΈ Media
I
John Carmack
@ID_AA_Carmack
πŸ“…
Mon
πŸ†”51705438

I never saw an official video, so here is my little bit from the induction of Quake into the Strong Museum’s World Video Game Hall of Fame last month. https://t.co/gts4sSnocK

❀️8,301
likes
πŸ”1,468
retweets
πŸ–ΌοΈ Media
M
Mark Saroufim
@marksaroufim
πŸ“…
Mon
πŸ†”81647607

Still feels surreal to have been on stage with one of the greatest CEO's of our time Dr Lisa Su where she explicitly called out GPU MODE and the work we did to enable the world's first $100K competitive kernel competition I never in a thousand years would have imagined that a… https://t.co/Pv2GTGWFHS

Media 1
❀️214
likes
πŸ”21
retweets
πŸ–ΌοΈ Media
D
Daniel Dhawan
@daniel_dhawan
πŸ“…
Mon
πŸ†”77199850

Rork grew 0 -> $1M ARR in ~3 months with 4 people 🀯 We grew to $1M faster than 99% of SaaS companies, starting from 2 people, no money, no company and a v0.1. Our product wasn't perfect but people still loved it Make something you want yourself https://t.co/PxBBMCOuTn

Media 1
❀️545
likes
πŸ”30
retweets
πŸ–ΌοΈ Media