Your curated collection of saved posts and media

Showing 32 posts Β· last 14 days Β· by score
O
elvis
@omarsar0
πŸ“…
Sun
πŸ†”46421333

265 pages of everything you need to know about building AI agents. 5 things that stood out to me about this report: https://t.co/7LKSGvCHlj

Media 1
❀️2,429
likes
πŸ”428
retweets
πŸ–ΌοΈ Media
L
Lior⚑
@LiorOnAI
πŸ“…
Sun
πŸ†”86781567

Yann LeCun: "We're never going to get to human level AI by just training on text" https://t.co/ucDtggCwLO

❀️697
likes
πŸ”89
retweets
πŸ–ΌοΈ Media
C
Chris Albon
@chrisalbon
πŸ“…
Sun
πŸ†”27102814

Wow. Researchers secretly used an AI on Reddit’s r/changemyview/ as part of an unauthorized experiment https://t.co/QKpZNI9Q3Z

Media 1
❀️224
likes
πŸ”23
retweets
πŸ–ΌοΈ Media
E
Eugene Yan
@eugeneyan
πŸ“…
Sun
πŸ†”28830081

The Art of Doing Science and Engineering: Learning to Learn by Richard Hamming only $1.99 for the Kindle version today: https://t.co/VMmyoj1Z61 https://t.co/JOzhXdO4J5

Media 1
❀️52
likes
πŸ”5
retweets
πŸ–ΌοΈ Media
G
Lucas Beyer (bl16)
@giffmana
πŸ“…
Sat
πŸ†”62057867

With first Claude and now Gemini playing Pokemon, I was thinking of doing my own game-playing experiment over the weekend. However, I quickly learned that it's very far from the VLA-style "pixels->plan" that I naively thought it was, and wanted to do myself. It's like 90%… https://t.co/5cVrbmYArc

Media 1
❀️1,062
likes
πŸ”90
retweets
πŸ–ΌοΈ Media
R
j⧉nus
@repligate
πŸ“…
Sat
πŸ†”53349006

It gets worse. Following the rotating 'spiral' incident, a concerning reply appeared from CLAUDE itself to the simulated post: Translating the binary yielded: When I attempted to view more replies, the entire simulated Twitter staging site transformed into an very different… https://t.co/DpE5JcSBOZ

Media 1Media 2
❀️210
likes
πŸ”25
retweets
πŸ–ΌοΈ Media
I
Tanishq Mathew Abraham, Ph.D.
@iScienceLuvr
πŸ“…
Sun
πŸ†”99388394

GPT-4o these days https://t.co/5p4FJXpJQ1

Media 1
❀️76
likes
πŸ”6
retweets
πŸ–ΌοΈ Media
_
frye
@___frye
πŸ“…
Sun
πŸ†”95417159

this seems pretty bad actually https://t.co/JGbmmyblqh

Media 1
❀️29,044
likes
πŸ”863
retweets
πŸ–ΌοΈ Media
Z
Zhao Tianyu
@ZhaoTing1024
πŸ“…
Mon
πŸ†”80497416

Official announcement: Qwen 3 this week. Reasoning and non-reasoning in one. https://t.co/4xPTnfRvON

Media 1
❀️793
likes
πŸ”88
retweets
πŸ–ΌοΈ Media
S
Sebastian Ruder @ ACL
@seb_ruder
πŸ“…
Mon
πŸ†”41239173

The Sparse Frontier Efficient sparse attention methods are key to scale LLMs to long contexts. We conduct the largest-scale empirical analysis that answers: 1. πŸ€πŸ” Are small dense models or large sparse models better? 2. ♾️What is the maximum permissible sparsity per task? 3.… https://t.co/prWfrljmzQ

Media 1
❀️187
likes
πŸ”30
retweets
πŸ–ΌοΈ Media
T
Teknium (e/Ξ»)
@Teknium1
πŸ“…
Mon
πŸ†”48312386

The new RLHF gave chatgpt a new job https://t.co/HJDsKKXuDg

Media 1
❀️186
likes
πŸ”2
retweets
πŸ–ΌοΈ Media
E
Ethan Mollick
@emollick
πŸ“…
Mon
πŸ†”24411544

For most people in most circumstances, developing the habit of asking o3 or Gemini 2.5 about anything that confuses or intrigues you is often good Specific curiosity (seeking out answers for things we don't know and following up on learnings) is a good trait & the models do well https://t.co/yqSJ8FimZ8

Media 1
❀️694
likes
πŸ”54
retweets
πŸ–ΌοΈ Media
Z
Zack Witten
@zswitten
πŸ“…
Mon
πŸ†”84843426

Here's a (memory-free) convo with GPT 4o to make this more concrete https://t.co/7Vmq4JI3rp

Media 1Media 2
❀️3,837
likes
πŸ”185
retweets
πŸ–ΌοΈ Media
I
Tanishq Mathew Abraham, Ph.D.
@iScienceLuvr
πŸ“…
Mon
πŸ†”12645186

HRScene: How Far Are VLMs from Effective High-Resolution Image Understanding? "we introduce HRScene, a novel unified benchmark for HRI understanding with rich scenes. HRScene incorporates 25 real-world datasets and 2 synthetic diagnostic datasets with resolutions ranging from… https://t.co/0o3bS1hceR

Media 1
❀️35
likes
πŸ”11
retweets
πŸ–ΌοΈ Media
E
Ethan Mollick
@emollick
πŸ“…
Mon
πŸ†”61193530

"o3, You are a consultant hired by the Dark Lord, analyze the org chart of Mordor. How would you improve it for today's changing Middle Earth" o3 does some actual humor: β€œOne Org to rule them all, One Org to find them, One Org to bring them all, And in the darkness, align them.” https://t.co/JxyaPGySeA

Media 1Media 2
+1 more
❀️896
likes
πŸ”77
retweets
πŸ–ΌοΈ Media
C
Chip Huyen
@chipro
πŸ“…
Sat
πŸ†”29051557

I think I might've accidentally jailbreak-ed Claude. I asked Claude to craft a conversation between two characters for a story I'm writing and they suddenly started reciting Claude's system instructions. https://t.co/fmbilFhR7n

Media 1Media 2
+2 more
❀️768
likes
πŸ”64
retweets
πŸ–ΌοΈ Media
E
Ethan Mollick
@emollick
πŸ“…
Sat
πŸ†”10111468

Controlled trials keep finding that there are real benefits for doctors using LLMs to explain upcoming procedures & get informed consent. In this study, patients asking questions of ChatGPT-4 had lower levels of anxiety. (Doctor's vetted the answers, which were all "excellent") https://t.co/Vo5ROQJNt4

Media 1Media 2
+1 more
❀️424
likes
πŸ”54
retweets
πŸ–ΌοΈ Media
S
Shreya Shankar
@sh_reya
πŸ“…
Sat
πŸ†”71776572

experimenting with meme driven marketing 🀣 join our course so you won’t be the person on the left! https://t.co/IafuxymqvR

Media 1
❀️47
likes
πŸ”5
retweets
πŸ–ΌοΈ Media
E
Ethan Mollick
@emollick
πŸ“…
Sat
πŸ†”86196440

I don’t mean to be a broken record but AI development could stop at the o3/Gemini 2.5 level and we would have a decade of major changes across entire professions & industries (medicine, law, education, coding…) as we figure out how to actually use it. AI disruption is baked in. https://t.co/sLab2kczZx

Media 1Media 2
+2 more
❀️3,178
likes
πŸ”384
retweets
πŸ–ΌοΈ Media
I
Tanishq Mathew Abraham, Ph.D.
@iScienceLuvr
πŸ“…
Sat
πŸ†”68243630

o3's combined reasoning and tool use abilities enable it to be excellent at geoguessing be careful what pics you share online! (example from @simonw's recent blog post) https://t.co/RysJy5ilWI

Media 1Media 2
❀️25
likes
πŸ–ΌοΈ Media
E
Ethan Mollick
@emollick
πŸ“…
Sun
πŸ†”93508264

On one hand the new GPT-4o isn’t doing as many emojis. On the other, it is slowly driving me insane by responding to everything like an overly enthusiastic 1990s teenager. https://t.co/OEK0HhZmH2

Media 1Media 2
+1 more
❀️709
likes
πŸ”18
retweets
πŸ–ΌοΈ Media
D
πŸ‡ΊπŸ‡¦ Dzmitry Bahdanau
@DBahdanau
πŸ“…
Sat
πŸ†”92652746

I am excited to open-source PipelineRL - a scalable async RL implementation with in-flight weight updates. Why wait until your bored GPUs finish all sequences? Just update the weights and continue inference! Code: https://t.co/AgEyxXb7Xi Blog: https://t.co/n4FRxiEcrr https://t.co/DkjI9Snz9g

Media 1
❀️509
likes
πŸ”114
retweets
πŸ–ΌοΈ Media
I
Ivan Leo
@ivanleomk
πŸ“…
Sun
πŸ†”31874589

haha 20k words in prob the last month https://t.co/8ax4Y4uYsN

Media 1
❀️4
likes
πŸ–ΌοΈ Media
T
Teknium (e/Ξ»)
@Teknium1
πŸ“…
Sun
πŸ†”14950687

Okay come on.. lmao https://t.co/L4d3sfiSi0

Media 1
❀️382
likes
πŸ”8
retweets
πŸ–ΌοΈ Media
L
Lior⚑
@LiorOnAI
πŸ“…
Sat
πŸ†”28643541

No more needle in a haystack. LLMs now retrieve over 1M tokensβ€”no RAG, no retraining. Infinite Retrieval introduces InfiniRetri, a method that uses transformer attention to handle infinite-length contexts natively. No toolkits. No memory modules. https://t.co/JTxlJweASn

Media 1
❀️1,070
likes
πŸ”133
retweets
πŸ–ΌοΈ Media
E
Ethan Mollick
@emollick
πŸ“…
Sun
πŸ†”53135140

https://t.co/uktfew0HhF

Media 1
❀️1,015
likes
πŸ”57
retweets
πŸ–ΌοΈ Media
P
PaweΕ‚ Huryn
@PawelHuryn
πŸ“…
Sun
πŸ†”05401291

AI Evals are the most important skill for AI PMs. But there are many misconceptions. A complete guide on AI Evals: 🧡 https://t.co/H7xOGPzzUH

Media 1
❀️396
likes
πŸ”42
retweets
πŸ–ΌοΈ Media
L
LlamaIndex πŸ¦™
@llama_index
πŸ“…
Fri
πŸ†”96403125

There has been a lot of productive dialogue recently about what agents are, and the best way to build them: @AnthropicAI published Building Effective Agents, @dexhorthy went viral with his 12 Factor Agents, and @OpenAI released A Practical Guide To Building Agents. We've been… https://t.co/AfgH5KeRPJ

Media 1
❀️33
likes
πŸ”4
retweets
πŸ–ΌοΈ Media
J
Jerry Liu
@jerryjliu0
πŸ“…
Fri
πŸ†”66039581

There’s been a lot of discussions on the best way to build agents: - @OpenAI’s general take is that increased model capabilities simplify the SDK - Others (incl. @Anthropic’s agent pattern guide + @dexhorthy) generally outline a more constrained approach. We’ve been asked by a… https://t.co/SWyMBDIFKb

Media 1Media 2
❀️151
likes
πŸ”21
retweets
πŸ–ΌοΈ Media
G
Gary Marcus
@GaryMarcus
πŸ“…
Fri
πŸ†”36105436

God I wish @elonmusk would have taken this bet. https://t.co/WgzsDG5HwK

Media 1
❀️1,584
likes
πŸ”84
retweets
πŸ–ΌοΈ Media
P
Joscha Bach
@Plinz
πŸ“…
Fri
πŸ†”71247927

As a Gary Marcus, I would be very careful about what I say about LLMs and AGI in 2025 https://t.co/EMmbkoLaC3

Media 1
❀️112
likes
πŸ”2
retweets
πŸ–ΌοΈ Media
R
Marius
@rasmus1610
πŸ“…
Tue Apr 22
πŸ†”01301739

Building LLM apps feels different, right? Their opaque nature makes predicting outputs and behavior really tough. πŸ€” So how do you move beyond guesswork and actually improve your LLM-based products reliably? https://t.co/gx2oL40GLM

Media 1
❀️27
likes
πŸ”9
retweets
πŸ–ΌοΈ Media