Your curated collection of saved posts and media

Showing 32 posts ยท last 14 days ยท by score
A
Andrew Ng
@AndrewYNg
๐Ÿ“…
Wed
๐Ÿ†”92929149

New Course: Reinforcement Fine-Tuning LLMs with GRPO! Learn to use reinforcement learning to improve your LLM performance in this short course, built in collaboration with @Predibase, and taught by @TravisAddair, its Co-Founder and CTO, and @grg_arnav, its Senior Engineer andโ€ฆ https://t.co/j5AXn3swAD

โค๏ธ1,263
likes
๐Ÿ”183
retweets
๐Ÿ–ผ๏ธ Media
L
LlamaIndex ๐Ÿฆ™
@llama_index
๐Ÿ“…
Wed
๐Ÿ†”11217366

How do you manage a monorepo of 650+ community packages? Learn how we did it, including build our own open-source build management tool, LlamaDev! In this post, we'll cover how we migrated away from Poetry and Pants to uv and LlamaDev for faster, simpler development: โžก๏ธ 20%โ€ฆ https://t.co/dTHUcQl9Pb

Media 1
โค๏ธ99
likes
๐Ÿ”10
retweets
๐Ÿ–ผ๏ธ Media
A
adi
@adidoit
๐Ÿ“…
Wed
๐Ÿ†”05554006

AI evals remind me of Bungay's three gaps - knowledge, alignment and effects in 'The Art of Action' AI Systems are like complex organizations IMO and techniques from systems/complexity theory help! h/t for the image on the left from @HamelHusain and @sh_reya AI Evals course https://t.co/JPblJcDGlR

Media 1
โค๏ธ3
likes
๐Ÿ”2
retweets
๐Ÿ–ผ๏ธ Media
R
Rishi Jha
@rishi_d_jha
๐Ÿ“…
Wed
๐Ÿ†”68910340

Iโ€™m stoked to share our new paper: โ€œHarnessing the Universal Geometry of Embeddingsโ€ with @jxmnop, Collin Zhang, and @shmatikov. We present the first method to translate text embeddings across different spaces without any paired data or encoders. Here's why we're excited: ๐Ÿงต๐Ÿ‘‡๐Ÿพ https://t.co/FtQ7sYpWnV

Media 1
โค๏ธ1,786
likes
๐Ÿ”270
retweets
๐Ÿ–ผ๏ธ Media
R
Sayak Paul
@RisingSayak
๐Ÿ“…
Wed
๐Ÿ†”70540843

Nothing special just sharing a "bag of tricks" on the OG DiT architecture, inspired by the work on ModernBERT. About 2x less params, 43% better throughput, with longer training, FID improves. Felt nice, won't delete later. https://t.co/62jixLmPzH

Media 1
โค๏ธ106
likes
๐Ÿ”13
retweets
๐Ÿ–ผ๏ธ Media
T
Teknium (e/ฮป)
@Teknium1
๐Ÿ“…
Thu May 22
๐Ÿ†”17874305

Psyche's 40B run is only just beginning and already getting great signal! https://t.co/M4xTxa337l

Media 1
โค๏ธ229
likes
๐Ÿ”10
retweets
๐Ÿ–ผ๏ธ Media
E
Ethan Mollick
@emollick
๐Ÿ“…
Thu May 22
๐Ÿ†”46786206

Updated paper by physicians at Harvard, Stanford, and other academic medical centers testing o1-preview for medical reasoning & diagnosis tasks: โ€œIn all experimentsโ€”both vignettes and emergency room second opinionsโ€”the LLM displayed superhuman diagnostic and reasoning abilities.โ€ https://t.co/J3i549kMDK

Media 1Media 2
+1 more
โค๏ธ1,226
likes
๐Ÿ”216
retweets
๐Ÿ–ผ๏ธ Media
I
Ivan Leo
@ivanleomk
๐Ÿ“…
Thu May 22
๐Ÿ†”37748708

Instructor will support responses and MCP servers in the next release https://t.co/Qyjeh695Wi

Media 1
โค๏ธ5
likes
๐Ÿ–ผ๏ธ Media
A
Aran Komatsuzaki
@arankomatsuzaki
๐Ÿ“…
Thu May 22
๐Ÿ†”98145972

Tencent presents Hunyuan-TurboS - Hybrid Transformer-Mamba MoE (56B active params) trained on 16T tokens - Dynamically switching between rapid responses and deep โ€thinkingโ€ modes - Overall top 7 on LMSYS Chatbot Arena https://t.co/cUkJznZesL

Media 1
โค๏ธ115
likes
๐Ÿ”20
retweets
๐Ÿ–ผ๏ธ Media
X
Xin Eric Wang
@xwang_lk
๐Ÿ“…
Thu May 22
๐Ÿ†”92301170

๐˜๐˜ถ๐˜ฎ๐˜ข๐˜ฏ๐˜ด ๐˜ต๐˜ฉ๐˜ช๐˜ฏ๐˜ฌ ๐˜ง๐˜ญ๐˜ถ๐˜ช๐˜ฅ๐˜ญ๐˜บโ€”๐˜ฏ๐˜ข๐˜ท๐˜ช๐˜จ๐˜ข๐˜ต๐˜ช๐˜ฏ๐˜จ ๐˜ข๐˜ฃ๐˜ด๐˜ต๐˜ณ๐˜ข๐˜ค๐˜ต ๐˜ค๐˜ฐ๐˜ฏ๐˜ค๐˜ฆ๐˜ฑ๐˜ต๐˜ด ๐˜ฆ๐˜ง๐˜ง๐˜ฐ๐˜ณ๐˜ต๐˜ญ๐˜ฆ๐˜ด๐˜ด๐˜ญ๐˜บ, ๐˜ง๐˜ณ๐˜ฆ๐˜ฆ ๐˜ง๐˜ณ๐˜ฐ๐˜ฎ ๐˜ณ๐˜ช๐˜จ๐˜ช๐˜ฅ ๐˜ญ๐˜ช๐˜ฏ๐˜จ๐˜ถ๐˜ช๐˜ด๐˜ต๐˜ช๐˜ค ๐˜ฃ๐˜ฐ๐˜ถ๐˜ฏ๐˜ฅ๐˜ข๐˜ณ๐˜ช๐˜ฆ๐˜ด. But current reasoning models remain constrained by discrete tokens, limiting their fullโ€ฆ https://t.co/yt1KfjrNEO

Media 1
โค๏ธ932
likes
๐Ÿ”139
retweets
๐Ÿ–ผ๏ธ Media
J
Jake Boggs
@JakeABoggs
๐Ÿ“…
Thu May 22
๐Ÿ†”22018714

This past weekend I won 2nd place & $5000 at the @NousResearch RL Hackathon with my project VR-CLImax I implemented VR-CLI (Verified Rewards via Completion Likelihood Improvement) inside an Atropos environment to teach an LLM humor understanding using jokes scraped from Redditโ€ฆ https://t.co/fSoUcX0QA2

Media 1Media 2
โค๏ธ134
likes
๐Ÿ”9
retweets
๐Ÿ–ผ๏ธ Media
T
Teknium (e/ฮป)
@Teknium1
๐Ÿ“…
Thu May 22
๐Ÿ†”57122807

Okay I couldn't sleep so doing a write up - I had Jules by Google implement the SWE RL environment into Atropos, Nous' open source RL Environments framework, the environment was described by Meta's paper: https://t.co/t11YYB19B4 The paper outlines similarity as the rewardโ€ฆ https://t.co/xJMX2XbuGa

Media 1
โค๏ธ95
likes
๐Ÿ”4
retweets
๐Ÿ–ผ๏ธ Media
I
Ivan Leo
@ivanleomk
๐Ÿ“…
Thu May 22
๐Ÿ†”51376991

While this isn't perfect, I have tried to make my commands a lot more detailed. Using something like @WisprFlow helps a lot here, even if not everything is transcribed 100% correctly https://t.co/rVFdz2vRxl

Media 1
โค๏ธ4
likes
๐Ÿ–ผ๏ธ Media
I
Tanishq Mathew Abraham, Ph.D.
@iScienceLuvr
๐Ÿ“…
Thu May 22
๐Ÿ†”70710765

MMaDA: Multimodal Large Diffusion Language Models "We introduce MMaDA, a novel class of multimodal diffusion foundation models designed to achieve superior performance across diverse domains such as textual reasoning, multimodal understanding, and text-to-image generation"โ€ฆ https://t.co/59Imdks3zM

Media 1Media 2
โค๏ธ114
likes
๐Ÿ”20
retweets
๐Ÿ–ผ๏ธ Media
C
Charles ๐ŸŽ‰ Frye
@charles_irl
๐Ÿ“…
Wed
๐Ÿ†”68691724

The modal-examples repo contains >100 Python files that show how to use Modal for everything from AI to ETL to WebRTC. And the samples run, with two nines of reliability. But the repo is maintained by just one devrel, me, part-time. Here's how we do it (not AI): https://t.co/ZdRHEyFGCS

Media 1
โค๏ธ131
likes
๐Ÿ”13
retweets
๐Ÿ–ผ๏ธ Media
E
Ethan Mollick
@emollick
๐Ÿ“…
Wed
๐Ÿ†”82484418

Veo 3: "a big broadway musical about garlic bread, with elaborate costumes and a sondheim-like vibe" https://t.co/IOypw2tZwQ

โค๏ธ831
likes
๐Ÿ”47
retweets
๐Ÿ–ผ๏ธ Media
O
elvis
@omarsar0
๐Ÿ“…
Wed
๐Ÿ†”75222186

Efficiency in LLMs Pay attention, devs. This is one of the most comprehensive benchmarks to date on improving the efficiency of LLMs. You don't see reports like this every day. Here are my notes: https://t.co/4sOAcs0vDQ

Media 1
โค๏ธ755
likes
๐Ÿ”124
retweets
๐Ÿ–ผ๏ธ Media
J
jason liu
@jxnlco
๐Ÿ“…
Tue May 20
๐Ÿ†”48727950

my guy is on it https://t.co/zOzCdkr6TW

Media 1
โค๏ธ19
likes
๐Ÿ–ผ๏ธ Media
L
LlamaIndex ๐Ÿฆ™
@llama_index
๐Ÿ“…
Tue May 20
๐Ÿ†”30438443

Last chance to sign up ๐Ÿ‘‡ On May 29th, join @jerryjliu0 for an exclusive hands-on workshop in NY, on building agent workflows for financial analysis, due diligence, and more! Sign up here before it's too late: https://t.co/geDdBoe9aL https://t.co/kucxFKSUTo

Media 1
โค๏ธ14
likes
๐Ÿ”2
retweets
๐Ÿ–ผ๏ธ Media
G
Greg Ceccarelli
@gregce10
๐Ÿ“…
Tue May 20
๐Ÿ†”06615070

@HamelHusain and @sh_reya are spitting truth https://t.co/366LtrPL6C

Media 1
โค๏ธ16
likes
๐Ÿ”2
retweets
๐Ÿ–ผ๏ธ Media
O
Logan Kilpatrick
@OfficialLoganK
๐Ÿ“…
Tue May 20
๐Ÿ†”49768277

Gemini 2.5 Pro with "Deep Think", our models just keep getting more SOTA, more to share soon : ) https://t.co/o2xkz678VZ

Media 1
โค๏ธ2,929
likes
๐Ÿ”211
retweets
๐Ÿ–ผ๏ธ Media
J
jason liu
@jxnlco
๐Ÿ“…
Thu May 08
๐Ÿ†”16281394

cursor writes my pr bodies. so clean https://t.co/gMmtrSbKHM

Media 1
โค๏ธ12
likes
๐Ÿ–ผ๏ธ Media
I
Tanishq Mathew Abraham, Ph.D.
@iScienceLuvr
๐Ÿ“…
Sat
๐Ÿ†”98275625

AI timeline starting from 2022, so much has happened since then... https://t.co/9YrOKmh091

Media 1
โค๏ธ110
likes
๐Ÿ”10
retweets
๐Ÿ–ผ๏ธ Media
L
LlamaIndex ๐Ÿฆ™
@llama_index
๐Ÿ“…
Tue Apr 29
๐Ÿ†”91475054

PapersChat is an agentic AI application that allows you to chat with your papers and gather also information from papers on @arxiv and on PubMed. Powered by @llama_index, @qdrant_engine and @mistralai! โžก๏ธ Indexes all your papers โžก๏ธ Provides a nifty web UI to query them โžก๏ธโ€ฆ https://t.co/lYwXh27F9x

Media 1
โค๏ธ207
likes
๐Ÿ”45
retweets
๐Ÿ–ผ๏ธ Media
E
Ethan Mollick
@emollick
๐Ÿ“…
Tue Apr 29
๐Ÿ†”14359802

Kind of surprised that o3 is pretty good at poetry parodies compared to Claude 3.7. The new version is surprisingly literal compared to Claude 3.5, "The Destruction of Sennacherib, but for garlic bread" https://t.co/x82ile8QdQ

Media 1Media 2
+1 more
โค๏ธ82
likes
๐Ÿ”3
retweets
๐Ÿ–ผ๏ธ Media
G
๐š๐”ช๐Ÿพ๐šก๐šก๐Ÿพ
@gm8xx8
๐Ÿ“…
Tue Apr 29
๐Ÿ†”97397927

Atropos: a fully sovereign RL framework for frontier LLM training another strong open-source release from ๐๐Ž๐”๐’ with full examples, environments, and trainer scripts https://t.co/xYcKkGdgZo

Media 1
โค๏ธ53
likes
๐Ÿ”11
retweets
๐Ÿ–ผ๏ธ Media
B
BlinkDL
@BlinkDL_AI
๐Ÿ“…
Tue Apr 29
๐Ÿ†”30461513

RWKV7-G1 "GooseOne" ๐Ÿชฟ 1.5B release: pure RNN (attention-free) reasoning model, comparable with Qwen3 1.7B and fully multilingual. Chat demo & download on https://t.co/fZ7rmVKsKj Larger G1 training in progress. https://t.co/pGi060E0RY

Media 1
โค๏ธ175
likes
๐Ÿ”33
retweets
๐Ÿ–ผ๏ธ Media
J
John B. Holbein
@JohnHolbein1
๐Ÿ“…
Tue Apr 29
๐Ÿ†”29368781

โ€œAmong articles stating that data was available upon request, only 17% shared data upon request.โ€ https://t.co/YCuC5vONtO

Media 1
โค๏ธ2,229
likes
๐Ÿ”324
retweets
๐Ÿ–ผ๏ธ Media
S
Daniel Svonava
@svonava
๐Ÿ“…
Tue Apr 29
๐Ÿ†”86349331

Our Mixture of Experts embeddings enable e-com / travel / marketplace companies to build their own version of this: https://t.co/VQAmdMAoYP

Media 1
โค๏ธ7
likes
๐Ÿ”1
retweets
๐Ÿ–ผ๏ธ Media
T
Teknium (e/ฮป)
@Teknium1
๐Ÿ“…
Tue Apr 29
๐Ÿ†”26454548

Today at Nous we released our RL Environments Gym - Atropos. With it we've been able to train impressive models like our tool calling specialist that saw a 5x improvement on the @berkeley_ai function calling benchmark and several other models that we've released as artifacts onโ€ฆ https://t.co/Ereuqv5rE9

Media 1
โค๏ธ372
likes
๐Ÿ”39
retweets
๐Ÿ–ผ๏ธ Media
O
elvis
@omarsar0
๐Ÿ“…
Wed
๐Ÿ†”61818478

Just when I thought I'd seen everything about CoT. Chain of Recursive Thought doesn't sound like a novel idea, but it is a nice trick to make LLMs think harder. It works like a meta-prompt with a recursive component. https://t.co/qIJx4EVR8f

Media 1
โค๏ธ211
likes
๐Ÿ”38
retweets
๐Ÿ–ผ๏ธ Media
E
Ethan Mollick
@emollick
๐Ÿ“…
Tue Apr 29
๐Ÿ†”49329813

Deep Research with Gemini 2.5 has become very good. It spontaneously generates tables, scenarios, and compiles evidence. Havenโ€™t spotted errors in spot checks. https://t.co/4JjVulx8v0

Media 1Media 2
+2 more
โค๏ธ1,073
likes
๐Ÿ”87
retweets
๐Ÿ–ผ๏ธ Media