Your curated collection of saved posts and media

Recent Top

Showing 32 posts · last 14 days · by score

🖼️ Media

A

Andrew Ng

@AndrewYNg

📅

Wed

🆔92929149

New Course: Reinforcement Fine-Tuning LLMs with GRPO! Learn to use reinforcement learning to improve your LLM performance in this short course, built in collaboration with @Predibase, and taught by @TravisAddair, its Co-Founder and CTO, and @grg_arnav, its Senior Engineer and… https://t.co/j5AXn3swAD

❤️1,263

likes

🔁183

retweets

🖼️ Media

View Details View on X ↗

L

LlamaIndex 🦙

@llama_index

📅

Wed

🆔11217366

How do you manage a monorepo of 650+ community packages? Learn how we did it, including build our own open-source build management tool, LlamaDev! In this post, we'll cover how we migrated away from Poetry and Pants to uv and LlamaDev for faster, simpler development: ➡️ 20%… https://t.co/dTHUcQl9Pb

❤️99

likes

🔁10

retweets

🖼️ Media

View Details View on X ↗

A

adi

@adidoit

📅

Wed

🆔05554006

AI evals remind me of Bungay's three gaps - knowledge, alignment and effects in 'The Art of Action' AI Systems are like complex organizations IMO and techniques from systems/complexity theory help! h/t for the image on the left from @HamelHusain and @sh_reya AI Evals course https://t.co/JPblJcDGlR

❤️3

likes

🔁2

retweets

🖼️ Media

View Details View on X ↗

R

Rishi Jha

@rishi_d_jha

📅

Wed

🆔68910340

I’m stoked to share our new paper: “Harnessing the Universal Geometry of Embeddings” with @jxmnop, Collin Zhang, and @shmatikov. We present the first method to translate text embeddings across different spaces without any paired data or encoders. Here's why we're excited: 🧵👇🏾 https://t.co/FtQ7sYpWnV

❤️1,786

likes

🔁270

retweets

🖼️ Media

View Details View on X ↗

R

Sayak Paul

@RisingSayak

📅

Wed

🆔70540843

Nothing special just sharing a "bag of tricks" on the OG DiT architecture, inspired by the work on ModernBERT. About 2x less params, 43% better throughput, with longer training, FID improves. Felt nice, won't delete later. https://t.co/62jixLmPzH

❤️106

likes

🔁13

retweets

🖼️ Media

View Details View on X ↗

T

Teknium (e/λ)

@Teknium1

📅

Thu May 22

🆔17874305

Psyche's 40B run is only just beginning and already getting great signal! https://t.co/M4xTxa337l

❤️229

likes

🔁10

retweets

🖼️ Media

View Details View on X ↗

E

Ethan Mollick

@emollick

📅

Thu May 22

🆔46786206

Updated paper by physicians at Harvard, Stanford, and other academic medical centers testing o1-preview for medical reasoning & diagnosis tasks: “In all experiments—both vignettes and emergency room second opinions—the LLM displayed superhuman diagnostic and reasoning abilities.” https://t.co/J3i549kMDK

+1 more

❤️1,226

likes

🔁216

retweets

🖼️ Media

View Details View on X ↗

I

Ivan Leo

@ivanleomk

📅

Thu May 22

🆔37748708

Instructor will support responses and MCP servers in the next release https://t.co/Qyjeh695Wi

❤️5

likes

🖼️ Media

View Details View on X ↗

A

Aran Komatsuzaki

@arankomatsuzaki

📅

Thu May 22

🆔98145972

Tencent presents Hunyuan-TurboS - Hybrid Transformer-Mamba MoE (56B active params) trained on 16T tokens - Dynamically switching between rapid responses and deep ”thinking” modes - Overall top 7 on LMSYS Chatbot Arena https://t.co/cUkJznZesL

❤️115

likes

🔁20

retweets

🖼️ Media

View Details View on X ↗

X

Xin Eric Wang

@xwang_lk

📅

Thu May 22

🆔92301170

𝘏𝘶𝘮𝘢𝘯𝘴 𝘵𝘩𝘪𝘯𝘬 𝘧𝘭𝘶𝘪𝘥𝘭𝘺—𝘯𝘢𝘷𝘪𝘨𝘢𝘵𝘪𝘯𝘨 𝘢𝘣𝘴𝘵𝘳𝘢𝘤𝘵 𝘤𝘰𝘯𝘤𝘦𝘱𝘵𝘴 𝘦𝘧𝘧𝘰𝘳𝘵𝘭𝘦𝘴𝘴𝘭𝘺, 𝘧𝘳𝘦𝘦 𝘧𝘳𝘰𝘮 𝘳𝘪𝘨𝘪𝘥 𝘭𝘪𝘯𝘨𝘶𝘪𝘴𝘵𝘪𝘤 𝘣𝘰𝘶𝘯𝘥𝘢𝘳𝘪𝘦𝘴. But current reasoning models remain constrained by discrete tokens, limiting their full… https://t.co/yt1KfjrNEO

❤️932

likes

🔁139

retweets

🖼️ Media

View Details View on X ↗

J

Jake Boggs

@JakeABoggs

📅

Thu May 22

🆔22018714

This past weekend I won 2nd place & $5000 at the @NousResearch RL Hackathon with my project VR-CLImax I implemented VR-CLI (Verified Rewards via Completion Likelihood Improvement) inside an Atropos environment to teach an LLM humor understanding using jokes scraped from Reddit… https://t.co/fSoUcX0QA2

❤️134

likes

🔁9

retweets

🖼️ Media

View Details View on X ↗

T

Teknium (e/λ)

@Teknium1

📅

Thu May 22

🆔57122807

Okay I couldn't sleep so doing a write up - I had Jules by Google implement the SWE RL environment into Atropos, Nous' open source RL Environments framework, the environment was described by Meta's paper: https://t.co/t11YYB19B4 The paper outlines similarity as the reward… https://t.co/xJMX2XbuGa

❤️95

likes

🔁4

retweets

🖼️ Media

View Details View on X ↗

I

Ivan Leo

@ivanleomk

📅

Thu May 22

🆔51376991

While this isn't perfect, I have tried to make my commands a lot more detailed. Using something like @WisprFlow helps a lot here, even if not everything is transcribed 100% correctly https://t.co/rVFdz2vRxl

❤️4

likes

🖼️ Media

View Details View on X ↗

I

Tanishq Mathew Abraham, Ph.D.

@iScienceLuvr

📅

Thu May 22

🆔70710765

MMaDA: Multimodal Large Diffusion Language Models "We introduce MMaDA, a novel class of multimodal diffusion foundation models designed to achieve superior performance across diverse domains such as textual reasoning, multimodal understanding, and text-to-image generation"… https://t.co/59Imdks3zM

❤️114

likes

🔁20

retweets

🖼️ Media

View Details View on X ↗

C

Charles 🎉 Frye

@charles_irl

📅

Wed

🆔68691724

The modal-examples repo contains >100 Python files that show how to use Modal for everything from AI to ETL to WebRTC. And the samples run, with two nines of reliability. But the repo is maintained by just one devrel, me, part-time. Here's how we do it (not AI): https://t.co/ZdRHEyFGCS

❤️131

likes

🔁13

retweets

🖼️ Media

View Details View on X ↗

E

Ethan Mollick

@emollick

📅

Wed

🆔82484418

Veo 3: "a big broadway musical about garlic bread, with elaborate costumes and a sondheim-like vibe" https://t.co/IOypw2tZwQ

❤️831

likes

🔁47

retweets

🖼️ Media

View Details View on X ↗

O

elvis

@omarsar0

📅

Wed

🆔75222186

Efficiency in LLMs Pay attention, devs. This is one of the most comprehensive benchmarks to date on improving the efficiency of LLMs. You don't see reports like this every day. Here are my notes: https://t.co/4sOAcs0vDQ

❤️755

likes

🔁124

retweets

🖼️ Media

View Details View on X ↗

J

jason liu

@jxnlco

📅

Tue May 20

🆔48727950

my guy is on it https://t.co/zOzCdkr6TW

❤️19

likes

🖼️ Media

View Details View on X ↗

L

LlamaIndex 🦙

@llama_index

📅

Tue May 20

🆔30438443

Last chance to sign up 👇 On May 29th, join @jerryjliu0 for an exclusive hands-on workshop in NY, on building agent workflows for financial analysis, due diligence, and more! Sign up here before it's too late: https://t.co/geDdBoe9aL https://t.co/kucxFKSUTo

❤️14

likes

🔁2

retweets

🖼️ Media

View Details View on X ↗

G

Greg Ceccarelli

@gregce10

📅

Tue May 20

🆔06615070

@HamelHusain and @sh_reya are spitting truth https://t.co/366LtrPL6C

❤️16

likes

🔁2

retweets

🖼️ Media

View Details View on X ↗

O

Logan Kilpatrick

@OfficialLoganK

📅

Tue May 20

🆔49768277

Gemini 2.5 Pro with "Deep Think", our models just keep getting more SOTA, more to share soon : ) https://t.co/o2xkz678VZ

❤️2,929

likes

🔁211

retweets

🖼️ Media

View Details View on X ↗

J

jason liu

@jxnlco

📅

Thu May 08

🆔16281394

cursor writes my pr bodies. so clean https://t.co/gMmtrSbKHM

❤️12

likes

🖼️ Media

View Details View on X ↗

I

Tanishq Mathew Abraham, Ph.D.

@iScienceLuvr

📅

Sat

🆔98275625

AI timeline starting from 2022, so much has happened since then... https://t.co/9YrOKmh091

❤️110

likes

🔁10

retweets

🖼️ Media

View Details View on X ↗

L

LlamaIndex 🦙

@llama_index

📅

Tue Apr 29

🆔91475054

PapersChat is an agentic AI application that allows you to chat with your papers and gather also information from papers on @arxiv and on PubMed. Powered by @llama_index, @qdrant_engine and @mistralai! ➡️ Indexes all your papers ➡️ Provides a nifty web UI to query them ➡️… https://t.co/lYwXh27F9x

❤️207

likes

🔁45

retweets

🖼️ Media

View Details View on X ↗

E

Ethan Mollick

@emollick

📅

Tue Apr 29

🆔14359802

Kind of surprised that o3 is pretty good at poetry parodies compared to Claude 3.7. The new version is surprisingly literal compared to Claude 3.5, "The Destruction of Sennacherib, but for garlic bread" https://t.co/x82ile8QdQ

+1 more

❤️82

likes

🔁3

retweets

🖼️ Media

View Details View on X ↗

G

𝚐𝔪𝟾𝚡𝚡𝟾

@gm8xx8

📅

Tue Apr 29

🆔97397927

Atropos: a fully sovereign RL framework for frontier LLM training another strong open-source release from 𝐍𝐎𝐔𝐒 with full examples, environments, and trainer scripts https://t.co/xYcKkGdgZo

❤️53

likes

🔁11

retweets

🖼️ Media

View Details View on X ↗

B

BlinkDL

@BlinkDL_AI

📅

Tue Apr 29

🆔30461513

RWKV7-G1 "GooseOne" 🪿 1.5B release: pure RNN (attention-free) reasoning model, comparable with Qwen3 1.7B and fully multilingual. Chat demo & download on https://t.co/fZ7rmVKsKj Larger G1 training in progress. https://t.co/pGi060E0RY

❤️175

likes

🔁33

retweets

🖼️ Media

View Details View on X ↗

J

John B. Holbein

@JohnHolbein1

📅

Tue Apr 29

🆔29368781

“Among articles stating that data was available upon request, only 17% shared data upon request.” https://t.co/YCuC5vONtO

❤️2,229

likes

🔁324

retweets

🖼️ Media

View Details View on X ↗

S

Daniel Svonava

@svonava

📅

Tue Apr 29

🆔86349331

Our Mixture of Experts embeddings enable e-com / travel / marketplace companies to build their own version of this: https://t.co/VQAmdMAoYP

❤️7

likes

🔁1

retweets

🖼️ Media

View Details View on X ↗

T

Teknium (e/λ)

@Teknium1

📅

Tue Apr 29

🆔26454548

Today at Nous we released our RL Environments Gym - Atropos. With it we've been able to train impressive models like our tool calling specialist that saw a 5x improvement on the @berkeley_ai function calling benchmark and several other models that we've released as artifacts on… https://t.co/Ereuqv5rE9

❤️372

likes

🔁39

retweets

🖼️ Media

View Details View on X ↗

O

elvis

@omarsar0

📅

Wed

🆔61818478

Just when I thought I'd seen everything about CoT. Chain of Recursive Thought doesn't sound like a novel idea, but it is a nice trick to make LLMs think harder. It works like a meta-prompt with a recursive component. https://t.co/qIJx4EVR8f

❤️211

likes

🔁38

retweets

🖼️ Media

View Details View on X ↗

E

Ethan Mollick

@emollick

📅

Tue Apr 29

🆔49329813

Deep Research with Gemini 2.5 has become very good. It spontaneously generates tables, scenarios, and compiles evidence. Haven’t spotted errors in spot checks. https://t.co/4JjVulx8v0

+2 more

❤️1,073

likes

🔁87

retweets

🖼️ Media

View Details View on X ↗

← PreviousPage 579 of 656Next →