I

Ivan Leo

@ivanleomk

📅

Tue May 27

🆔05454357

⭐0.47

Lol codex literally re-implemented markdown to HTML parsing rather than download react markdown for nextjs. lmfao https://t.co/AggF2zrkMr

❤️7

likes

🖼️ Media

View Details View on X ↗

I

Ivan Leo

@ivanleomk

📅

Tue May 27

🆔19803639

⭐0.46

Damn @AnthropicAI sonnet 4 isn't messing around. This moves with the mouse btw and rotates Artefact here : https://t.co/YRX4GuKW4I https://t.co/jh6gpGrIxq

❤️4

likes

🖼️ Media

View Details View on X ↗

L

LlamaIndex 🦙

@llama_index

📅

Tue May 27

🆔51796302

⭐0.73

Learn how to build a custom multimodal embedder for LlamaIndex! This guide shows you how to: ➡️ Override LlamaIndex's default embedder for AWS Titan Multimodal support ➡️ Create a custom embedding class handling both text and images ➡️ Integrate it with @pinecone for efficient… https://t.co/jBqn7jrMak

❤️17

likes

🔁2

retweets

🖼️ Media

View Details View on X ↗

O

elvis

@omarsar0

📅

Tue May 27

🆔85800849

⭐0.77

NEW: Mistral AI announces Agents API - code execution - web search - MCP tools - persistent memory - agentic orchestration capabilities Cool to see that Mistral AI has joined the growing number of agent frameworks. More below: https://t.co/tjruOmcDP5

❤️515

likes

🔁76

retweets

🖼️ Media

View Details View on X ↗

I

Ivan Leo

@ivanleomk

📅

Tue May 27

🆔15736217

⭐0.42

wow wow finally playing around with operator curious to see where this is https://t.co/7W0FLwwPGp

❤️4

likes

🖼️ Media

View Details View on X ↗

A

Andrew Ng

@AndrewYNg

📅

Tue May 27

🆔79170259

Agentic Document Extraction just got much faster! From previous 135sec median processing time down to 8sec. Extracts not just text but diagrams, charts, and form fields from PDFs to give LLM-ready output. Please see the video for details and some application ideas. https://t.co/29lOKf6UGO

❤️3,844

likes

🔁607

retweets

🖼️ Media

View Details View on X ↗

E

Ethan Mollick

@emollick

📅

Mon

🆔79589631

Just so everyone knows, we have passed the point where you can tell what is AI at a glance (or even, in many cases, a close look) These were all made by me with text prompts alone using Veo 3. https://t.co/RoZ67BYABr

❤️2,933

likes

🔁472

retweets

🖼️ Media

View Details View on X ↗

I

Tanishq Mathew Abraham, Ph.D.

@iScienceLuvr

📅

Tue May 27

🆔06889312

Long-Context State-Space Video World Models "we propose a novel architecture leveraging state-space models (SSMs) to extend temporal memory without compromising computational efficiency. " https://t.co/7KTQwg4veE

❤️270

likes

🔁30

retweets

🖼️ Media

View Details View on X ↗

J

jason liu

@jxnlco

📅

Sun

🆔18064323

Bunch of good conversations in today's RAG office hours. Most teams get RAG wrong because they're obsessed with the AI instead of the data. The real money is in finding what users actually need through data analysis. I've seen $100k/month value unlocked just by identifying… https://t.co/SD56uw6ZZU

❤️101

likes

🔁6

retweets

🖼️ Media

View Details View on X ↗

C

Lewis

@ctjlewis

📅

Sun

🆔24253391

this company is still such a mystery to me. https://t.co/0jepqHsrL0

❤️440

likes

🔁4

retweets

🖼️ Media

View Details View on X ↗

J

Jeremy Howard

@jeremyphoward

📅

Sun

🆔63328432

Just came across this old GigaOM article. I guessed in 2013 it might take ~10 years until "packages like word2vec can make deep learning even for relatively unsophisticated users". Not bad, I think! :D https://t.co/XxtbbeIlLY https://t.co/6PxNQR82t7

❤️90

likes

🖼️ Media

View Details View on X ↗

J

Jeremy Howard

@jeremyphoward

📅

Sun

🆔68102434

ok google https://t.co/vyp1GcHzP5

❤️68

likes

🖼️ Media

View Details View on X ↗

P

Paul Gauthier

@paulgauthier

📅

Sun

🆔97172151

Claude 4 Opus scored 72% on the aider polyglot coding benchmark. Claude 4 Sonnet scored 61%. Both of those are with 32k think tokens. Sonnet 4 seems to have underperformed 3.7. Full leaderboard: https://t.co/mBVaUPG9ZN https://t.co/tj4p5Pn6Tk

❤️641

likes

🔁62

retweets

🖼️ Media

View Details View on X ↗

Y

YIFENG LIU

@YIFENGLIU_AI

📅

Mon

🆔71017265

1/6 We introduce RPG, a principled framework for deriving and analyzing KL-regularized policy gradient methods, unifying GRPO/k3-estimator and REINFORCE++ under this framework and discovering better RL objectives than GRPO: Paper: https://t.co/7xSUj01GIx Code:… https://t.co/0pn5sqhhC7

❤️199

likes

🔁38

retweets

🖼️ Media

View Details View on X ↗

H

Sepp Hochreiter

@HochreiterSepp

📅

Mon

🆔52597808

xLSTM for the classification of assembly tasks: https://t.co/l3hQTtQ31e "xLSTM model demonstrated better generalization capabilities to new operators. The results clearly show that for this type of classification, the xLSTM model offers a slight edge over Transformers." https://t.co/g5RWU9Bonf

❤️155

likes

🔁26

retweets

🖼️ Media

View Details View on X ↗

S

Simon Willison

@simonw

📅

Sun

🆔75158060

I put together an annotated version of the new Claude 4 system prompt, covering both the prompt Anthropic published and the missing, leaked sections (thanks, @elder_plinius) that describe its various tools It's basically the secret missing manual for Claude 4, it's fascinating! https://t.co/qDKnViikkS

❤️3,763

likes

🔁370

retweets

🖼️ Media

View Details View on X ↗

L

Lior⚡

@LiorOnAI

📅

Sun

🆔67369367

The best fine-tuning guide you'll find on arXiv this year. Covers: > NLP basics > PEFT/LoRA/QLoRA techniques > Mixture of Experts > Seven-stage fine-tuning pipeline https://t.co/Z7NSBBFvSS

❤️1,606

likes

🔁245

retweets

🖼️ Media

View Details View on X ↗

Q

Eric Hartford

@QuixiAI

📅

Sun

🆔35981382

I released a dataset of 10k prompts which are refused by Qwen3, but answered by Llama3.3. This highly diverse data can be used to train a model to comply with Chinese law (or not), testing, evaluation, and activation steering. https://t.co/aKhFC1Bunb

❤️520

likes

🔁58

retweets

🖼️ Media

View Details View on X ↗

E

Ethan Mollick

@emollick

📅

Mon

🆔51701654

I've been reading Bethany Hughes book on the 7 Wonders & the myths of the Colossus of Rhodes (eg it did not span the harbor) As an experiment, I asked Google Deep Research to come make a prompt to describe the historical Colossus for veo 3. Nice, down to the location & riveting https://t.co/VLGzkQnbC3

❤️250

likes

🔁16

retweets

🖼️ Media

View Details View on X ↗

H

Hamel Husain

@HamelHusain

📅

Mon

🆔31624688

How do I debug multi-turn conversation traces? This is our advice - what has worked for you? https://t.co/XYEJj5vZDg

❤️93

likes

🔁7

retweets

🖼️ Media

View Details View on X ↗

E

Ethan Mollick

@emollick

📅

Tue Jun 03

🆔93748649

Which academic fields transitioned to Blue Sky explains much of the transformation in conversations occurring here, far less humanities & economics discussion happens on X compared to before. (And no, I don’t think this was a net good thing for X, at least if you want new ideas) https://t.co/DttfOKQFfS

❤️350

likes

🔁42

retweets

🖼️ Media

View Details View on X ↗

J

Jerry Liu

@jerryjliu0

📅

Tue Jun 03

🆔41701470

Practical Applications of AI Agents in Finance 🤖🏦 I'm excited to release a set of slides that gives an overview of assistant and automation-based agent architectures and their applications across a broad range of financial verticals - from investment research to back-office to… https://t.co/VsHGA754bU

❤️146

likes

🔁38

retweets

🖼️ Media

View Details View on X ↗

O

elvis

@omarsar0

📅

Mon

🆔48400840

Reasoning Models Thinking Slow and Fast at Test Time Another super cool work on improving reasoning efficiency in LLMs. They show that slow-then-fast reasoning outperforms other strategies. Here are my notes: https://t.co/79XsaYcR8N

❤️286

likes

🔁64

retweets

🖼️ Media

View Details View on X ↗

J

Jeremy Howard

@jeremyphoward

📅

Tue Jun 03

🆔91031600

How @seb_ruder made the ULMFiT paper happen 😂 (from 2018 https://t.co/GEOZunWoXj student notes: https://t.co/1dX4HpAF4V) https://t.co/sl1r6heZ3B

❤️210

likes

🔁17

retweets

🖼️ Media

View Details View on X ↗

E

Ethan Mollick

@emollick

📅

Tue Jun 03

🆔72616075

An inexplicable failure of Microsoft & Google's AI tools is that they have access to my email but won't actually use their smarts to help me When I ask for "urgent messages," Google just gives me unread ones and Microsoft literally searches for "urgent" Yet Claude does better. https://t.co/yEQhgkAZxH

+1 more

❤️434

likes

🔁19

retweets

🖼️ Media

View Details View on X ↗

J

Jeremy Howard

@jeremyphoward

📅

Tue Jun 03

🆔77704843

What a nice surprise - the MonsterUI component library for FastHTML on the HN front page :) https://t.co/eUcV2ysrrx

❤️192

likes

🔁7

retweets

🖼️ Media

View Details View on X ↗

E

Ethan Mollick

@emollick

📅

Thu May 29

🆔86148584

Claude 4 Opus gets real weird real fast when you ask it to create something "numinous" (and especially when you ask it to make it "more numinous"). This was just the start. The system card discusses the model's tendency towards producing spiritual themes, definitely noticeable. https://t.co/xKELacUK2C

❤️898

likes

🔁67

retweets

🖼️ Media

View Details View on X ↗

L

LlamaIndex 🦙

@llama_index

📅

Thu May 29

🆔35917385

RAG is dead, long live agentic retrieval! At LlamaIndex we've been saying for a long time that naive RAG is not enough for a modern application. Following from that conviction, we've built agentic strategies directly into LlamaCloud that you can adopt with just a few lines of… https://t.co/7Rh7ohDw3x

❤️788

likes

🔁142

retweets

🖼️ Media

View Details View on X ↗

D

Derya Unutmaz, MD

@DeryaTR_

📅

Thu May 29

🆔34809216

The agentic AI system I was testing is from @perplexity_ai Labs, which has just launched. My full prompt and the agent’s outputs are now featured on their website (link below). Overall, it’s an excellent and innovative agentic system that takes about 5–10 minutes to run. I’ll… https://t.co/meumSz7NlN

❤️198

likes

🔁15

retweets

🖼️ Media

View Details View on X ↗

E

Ethan Mollick

@emollick

📅

Thu May 29

🆔98423874

Neat: "Claude 4 create a game with a completely novel mechanic. start with 20 different ideas and narrow them down" Its idea was for players to "steal, store & redistribute physical properties between objects and themselves." It built this demo & fixed a couple of bugs I found. https://t.co/y2G1QsNAIi

❤️290

likes

🔁14

retweets

🖼️ Media

View Details View on X ↗

S

Shashwat Goel

@ShashwatGoel7

📅

Thu May 29

🆔83972675

Confused about recent LLM RL results where models improve without any ground-truth signal? We were too. Until we looked at the reported numbers of the Pre-RL models and realized they were serverely underreported across papers. We compiled discrepancies in a blog below🧵👇 https://t.co/Hmn41grrrh

❤️879

likes

🔁126

retweets

🖼️ Media

View Details View on X ↗

B

Timothy B. Lee

@binarybits

📅

Thu May 29

🆔76214011

I don't think people appreciate how much Anthropic has been running the table in the market for coding tools over the last year. https://t.co/wVKF6MdsdF

❤️140

likes

🔁10

retweets

🖼️ Media

View Details View on X ↗