Your curated collection of saved posts and media

Showing 32 posts Β· last 14 days Β· by score
I
Ivan Leo
@ivanleomk
πŸ“…
Tue May 27
πŸ†”05454357
⭐0.47

Lol codex literally re-implemented markdown to HTML parsing rather than download react markdown for nextjs. lmfao https://t.co/AggF2zrkMr

Media 1
❀️7
likes
πŸ–ΌοΈ Media
I
Ivan Leo
@ivanleomk
πŸ“…
Tue May 27
πŸ†”19803639
⭐0.46

Damn @AnthropicAI sonnet 4 isn't messing around. This moves with the mouse btw and rotates Artefact here : https://t.co/YRX4GuKW4I https://t.co/jh6gpGrIxq

Media 1
❀️4
likes
πŸ–ΌοΈ Media
L
LlamaIndex πŸ¦™
@llama_index
πŸ“…
Tue May 27
πŸ†”51796302
⭐0.73

Learn how to build a custom multimodal embedder for LlamaIndex! This guide shows you how to: ➑️ Override LlamaIndex's default embedder for AWS Titan Multimodal support ➑️ Create a custom embedding class handling both text and images ➑️ Integrate it with @pinecone for efficient… https://t.co/jBqn7jrMak

Media 1
❀️17
likes
πŸ”2
retweets
πŸ–ΌοΈ Media
O
elvis
@omarsar0
πŸ“…
Tue May 27
πŸ†”85800849
⭐0.77

NEW: Mistral AI announces Agents API - code execution - web search - MCP tools - persistent memory - agentic orchestration capabilities Cool to see that Mistral AI has joined the growing number of agent frameworks. More below: https://t.co/tjruOmcDP5

Media 1
❀️515
likes
πŸ”76
retweets
πŸ–ΌοΈ Media
I
Ivan Leo
@ivanleomk
πŸ“…
Tue May 27
πŸ†”15736217
⭐0.42

wow wow finally playing around with operator curious to see where this is https://t.co/7W0FLwwPGp

Media 1
❀️4
likes
πŸ–ΌοΈ Media
A
Andrew Ng
@AndrewYNg
πŸ“…
Tue May 27
πŸ†”79170259

Agentic Document Extraction just got much faster! From previous 135sec median processing time down to 8sec. Extracts not just text but diagrams, charts, and form fields from PDFs to give LLM-ready output. Please see the video for details and some application ideas. https://t.co/29lOKf6UGO

❀️3,844
likes
πŸ”607
retweets
πŸ–ΌοΈ Media
E
Ethan Mollick
@emollick
πŸ“…
Mon
πŸ†”79589631

Just so everyone knows, we have passed the point where you can tell what is AI at a glance (or even, in many cases, a close look) These were all made by me with text prompts alone using Veo 3. https://t.co/RoZ67BYABr

❀️2,933
likes
πŸ”472
retweets
πŸ–ΌοΈ Media
I
Tanishq Mathew Abraham, Ph.D.
@iScienceLuvr
πŸ“…
Tue May 27
πŸ†”06889312

Long-Context State-Space Video World Models "we propose a novel architecture leveraging state-space models (SSMs) to extend temporal memory without compromising computational efficiency. " https://t.co/7KTQwg4veE

Media 1
❀️270
likes
πŸ”30
retweets
πŸ–ΌοΈ Media
J
jason liu
@jxnlco
πŸ“…
Sun
πŸ†”18064323

Bunch of good conversations in today's RAG office hours. Most teams get RAG wrong because they're obsessed with the AI instead of the data. The real money is in finding what users actually need through data analysis. I've seen $100k/month value unlocked just by identifying… https://t.co/SD56uw6ZZU

Media 1
❀️101
likes
πŸ”6
retweets
πŸ–ΌοΈ Media
C
Lewis
@ctjlewis
πŸ“…
Sun
πŸ†”24253391

this company is still such a mystery to me. https://t.co/0jepqHsrL0

Media 1
❀️440
likes
πŸ”4
retweets
πŸ–ΌοΈ Media
J
Jeremy Howard
@jeremyphoward
πŸ“…
Sun
πŸ†”63328432

Just came across this old GigaOM article. I guessed in 2013 it might take ~10 years until "packages like word2vec can make deep learning even for relatively unsophisticated users". Not bad, I think! :D https://t.co/XxtbbeIlLY https://t.co/6PxNQR82t7

Media 1
❀️90
likes
πŸ–ΌοΈ Media
J
Jeremy Howard
@jeremyphoward
πŸ“…
Sun
πŸ†”68102434

ok google https://t.co/vyp1GcHzP5

Media 1
❀️68
likes
πŸ–ΌοΈ Media
P
Paul Gauthier
@paulgauthier
πŸ“…
Sun
πŸ†”97172151

Claude 4 Opus scored 72% on the aider polyglot coding benchmark. Claude 4 Sonnet scored 61%. Both of those are with 32k think tokens. Sonnet 4 seems to have underperformed 3.7. Full leaderboard: https://t.co/mBVaUPG9ZN https://t.co/tj4p5Pn6Tk

Media 1
❀️641
likes
πŸ”62
retweets
πŸ–ΌοΈ Media
Y
YIFENG LIU
@YIFENGLIU_AI
πŸ“…
Mon
πŸ†”71017265

1/6 We introduce RPG, a principled framework for deriving and analyzing KL-regularized policy gradient methods, unifying GRPO/k3-estimator and REINFORCE++ under this framework and discovering better RL objectives than GRPO: Paper: https://t.co/7xSUj01GIx Code:… https://t.co/0pn5sqhhC7

Media 1
❀️199
likes
πŸ”38
retweets
πŸ–ΌοΈ Media
H
Sepp Hochreiter
@HochreiterSepp
πŸ“…
Mon
πŸ†”52597808

xLSTM for the classification of assembly tasks: https://t.co/l3hQTtQ31e "xLSTM model demonstrated better generalization capabilities to new operators. The results clearly show that for this type of classification, the xLSTM model offers a slight edge over Transformers." https://t.co/g5RWU9Bonf

Media 1
❀️155
likes
πŸ”26
retweets
πŸ–ΌοΈ Media
S
Simon Willison
@simonw
πŸ“…
Sun
πŸ†”75158060

I put together an annotated version of the new Claude 4 system prompt, covering both the prompt Anthropic published and the missing, leaked sections (thanks, @elder_plinius) that describe its various tools It's basically the secret missing manual for Claude 4, it's fascinating! https://t.co/qDKnViikkS

Media 1
❀️3,763
likes
πŸ”370
retweets
πŸ–ΌοΈ Media
L
Lior⚑
@LiorOnAI
πŸ“…
Sun
πŸ†”67369367

The best fine-tuning guide you'll find on arXiv this year. Covers: > NLP basics > PEFT/LoRA/QLoRA techniques > Mixture of Experts > Seven-stage fine-tuning pipeline https://t.co/Z7NSBBFvSS

Media 1
❀️1,606
likes
πŸ”245
retweets
πŸ–ΌοΈ Media
Q
Eric Hartford
@QuixiAI
πŸ“…
Sun
πŸ†”35981382

I released a dataset of 10k prompts which are refused by Qwen3, but answered by Llama3.3. This highly diverse data can be used to train a model to comply with Chinese law (or not), testing, evaluation, and activation steering. https://t.co/aKhFC1Bunb

Media 1
❀️520
likes
πŸ”58
retweets
πŸ–ΌοΈ Media
E
Ethan Mollick
@emollick
πŸ“…
Mon
πŸ†”51701654

I've been reading Bethany Hughes book on the 7 Wonders & the myths of the Colossus of Rhodes (eg it did not span the harbor) As an experiment, I asked Google Deep Research to come make a prompt to describe the historical Colossus for veo 3. Nice, down to the location & riveting https://t.co/VLGzkQnbC3

❀️250
likes
πŸ”16
retweets
πŸ–ΌοΈ Media
H
Hamel Husain
@HamelHusain
πŸ“…
Mon
πŸ†”31624688

How do I debug multi-turn conversation traces? This is our advice - what has worked for you? https://t.co/XYEJj5vZDg

Media 1
❀️93
likes
πŸ”7
retweets
πŸ–ΌοΈ Media
E
Ethan Mollick
@emollick
πŸ“…
Tue Jun 03
πŸ†”93748649

Which academic fields transitioned to Blue Sky explains much of the transformation in conversations occurring here, far less humanities & economics discussion happens on X compared to before. (And no, I don’t think this was a net good thing for X, at least if you want new ideas) https://t.co/DttfOKQFfS

Media 1
❀️350
likes
πŸ”42
retweets
πŸ–ΌοΈ Media
J
Jerry Liu
@jerryjliu0
πŸ“…
Tue Jun 03
πŸ†”41701470

Practical Applications of AI Agents in Finance πŸ€–πŸ¦ I'm excited to release a set of slides that gives an overview of assistant and automation-based agent architectures and their applications across a broad range of financial verticals - from investment research to back-office to… https://t.co/VsHGA754bU

Media 1
❀️146
likes
πŸ”38
retweets
πŸ–ΌοΈ Media
O
elvis
@omarsar0
πŸ“…
Mon
πŸ†”48400840

Reasoning Models Thinking Slow and Fast at Test Time Another super cool work on improving reasoning efficiency in LLMs. They show that slow-then-fast reasoning outperforms other strategies. Here are my notes: https://t.co/79XsaYcR8N

Media 1
❀️286
likes
πŸ”64
retweets
πŸ–ΌοΈ Media
J
Jeremy Howard
@jeremyphoward
πŸ“…
Tue Jun 03
πŸ†”91031600

How @seb_ruder made the ULMFiT paper happen πŸ˜‚ (from 2018 https://t.co/GEOZunWoXj student notes: https://t.co/1dX4HpAF4V) https://t.co/sl1r6heZ3B

Media 1
❀️210
likes
πŸ”17
retweets
πŸ–ΌοΈ Media
E
Ethan Mollick
@emollick
πŸ“…
Tue Jun 03
πŸ†”72616075

An inexplicable failure of Microsoft & Google's AI tools is that they have access to my email but won't actually use their smarts to help me When I ask for "urgent messages," Google just gives me unread ones and Microsoft literally searches for "urgent" Yet Claude does better. https://t.co/yEQhgkAZxH

Media 1Media 2
+1 more
❀️434
likes
πŸ”19
retweets
πŸ–ΌοΈ Media
J
Jeremy Howard
@jeremyphoward
πŸ“…
Tue Jun 03
πŸ†”77704843

What a nice surprise - the MonsterUI component library for FastHTML on the HN front page :) https://t.co/eUcV2ysrrx

Media 1
❀️192
likes
πŸ”7
retweets
πŸ–ΌοΈ Media
E
Ethan Mollick
@emollick
πŸ“…
Thu May 29
πŸ†”86148584

Claude 4 Opus gets real weird real fast when you ask it to create something "numinous" (and especially when you ask it to make it "more numinous"). This was just the start. The system card discusses the model's tendency towards producing spiritual themes, definitely noticeable. https://t.co/xKELacUK2C

❀️898
likes
πŸ”67
retweets
πŸ–ΌοΈ Media
L
LlamaIndex πŸ¦™
@llama_index
πŸ“…
Thu May 29
πŸ†”35917385

RAG is dead, long live agentic retrieval! At LlamaIndex we've been saying for a long time that naive RAG is not enough for a modern application. Following from that conviction, we've built agentic strategies directly into LlamaCloud that you can adopt with just a few lines of… https://t.co/7Rh7ohDw3x

Media 1
❀️788
likes
πŸ”142
retweets
πŸ–ΌοΈ Media
D
Derya Unutmaz, MD
@DeryaTR_
πŸ“…
Thu May 29
πŸ†”34809216

The agentic AI system I was testing is from @perplexity_ai Labs, which has just launched. My full prompt and the agent’s outputs are now featured on their website (link below). Overall, it’s an excellent and innovative agentic system that takes about 5–10 minutes to run. I’ll… https://t.co/meumSz7NlN

Media 1
❀️198
likes
πŸ”15
retweets
πŸ–ΌοΈ Media
E
Ethan Mollick
@emollick
πŸ“…
Thu May 29
πŸ†”98423874

Neat: "Claude 4 create a game with a completely novel mechanic. start with 20 different ideas and narrow them down" Its idea was for players to "steal, store & redistribute physical properties between objects and themselves." It built this demo & fixed a couple of bugs I found. https://t.co/y2G1QsNAIi

❀️290
likes
πŸ”14
retweets
πŸ–ΌοΈ Media
S
Shashwat Goel
@ShashwatGoel7
πŸ“…
Thu May 29
πŸ†”83972675

Confused about recent LLM RL results where models improve without any ground-truth signal? We were too. Until we looked at the reported numbers of the Pre-RL models and realized they were serverely underreported across papers. We compiled discrepancies in a blog belowπŸ§΅πŸ‘‡ https://t.co/Hmn41grrrh

Media 1
❀️879
likes
πŸ”126
retweets
πŸ–ΌοΈ Media
B
Timothy B. Lee
@binarybits
πŸ“…
Thu May 29
πŸ†”76214011

I don't think people appreciate how much Anthropic has been running the table in the market for coding tools over the last year. https://t.co/wVKF6MdsdF

Media 1
❀️140
likes
πŸ”10
retweets
πŸ–ΌοΈ Media