Your curated collection of saved posts and media

Showing 32 posts Β· last 14 days Β· by score
L
LlamaIndex πŸ¦™
@llama_index
πŸ“…
Wed
πŸ†”47393729

πŸ’ΌπŸ“Š Build a multi-agent financial report generating chatbot from scratch, using LlamaIndex agent workflows πŸ‘‡ The full example from @jerryjliu0's workshop last week is below. In this hands-on Colab, you'll: βœ… Parse & index 10-K filings from Adobe βœ… Use agentic RAG to answer… https://t.co/cisveE9Ry0

Media 1
❀️80
likes
πŸ”18
retweets
πŸ–ΌοΈ Media
_
Philipp Schmid
@_philschmid
πŸ“…
Wed
πŸ†”51675312

Here is my 2 hour long workshop i just finished at the @aiDotEngineer World's fair. This is all you need to know to learn on how to use Gemini 2.5! It is beginner friendly from getting your first API key to multimodality, function calling and MCP. πŸ†“ Completely free - runs… https://t.co/aViZBxvx2M

Media 1
❀️251
likes
πŸ”33
retweets
πŸ–ΌοΈ Media
J
Jeremy Howard
@jeremyphoward
πŸ“…
Wed
πŸ†”16447176

I'm thrilled to see @math_rachel's interesting new article, on a recent deep learning microbiology paper, getting the attention it deserves -- check it out if you haven't seen it yet! https://t.co/K7vhqOaecB https://t.co/4bSn39KCo6

Media 1
❀️362
likes
πŸ”49
retweets
πŸ–ΌοΈ Media
T
Robert Ta
@therobertta_
πŸ“…
Tue Jun 03
πŸ†”55864295

β€œIs RAG dead?” This question pops up every 2 weeks! I just got the scoop from @HamelHusain and @sh_reya in their amazing AI Evals course: https://t.co/hSpuJqh6eM Let’s clear the confusionβ€”and talk about what actually matters when evaluating retrieval-augmented generation… https://t.co/o2P17tZYX3

Media 1
❀️4
likes
πŸ”3
retweets
πŸ–ΌοΈ Media
O
elvis
@omarsar0
πŸ“…
Tue Jun 03
πŸ†”81422030

πŸ”₯ Introducing Firecrawl /search. @firecrawl_dev just launched an insane feature to search and crawl in one shot. You heard that right! One API call to search the web and scrape any data you need for your AI agents. I took it for a spin in n8n: https://t.co/HViclvq6I1

❀️230
likes
πŸ”33
retweets
πŸ–ΌοΈ Media
S
Standard Completions
@stdcompletions
πŸ“…
Fri
πŸ†”31598379

standard completions dot org https://t.co/DhcwmbEuJH

Media 1
❀️184
likes
πŸ”19
retweets
πŸ–ΌοΈ Media
L
Lerrel Pinto
@LerrelPinto
πŸ“…
Tue Jun 03
πŸ†”07287355

Teaching robots to learn only from RGB human videos is hard! In Feel The Force (FTF), we teach robots to mimic the tactile feedback humans experience when handling objects. This allows for delicate, touch-sensitive tasksβ€”like picking up a raw egg without breaking it. πŸ§΅πŸ‘‡ https://t.co/hshodP8elW

❀️538
likes
πŸ”86
retweets
πŸ–ΌοΈ Media
T
Teknium (e/Ξ»)
@Teknium1
πŸ“…
Wed
πŸ†”00444838

Another RL environment added to Atropos! @MatternJustus released a pydantic schemas dataset that can be used to ask the model to create valid structured outputs of those objects - so I made an environment that asks the model to create JSON, YAML, TOML, etc and validate against… https://t.co/kqiSUXBJYf

Media 1
❀️92
likes
πŸ”8
retweets
πŸ–ΌοΈ Media
A
Andrew Ng
@AndrewYNg
πŸ“…
Tue Jun 03
πŸ†”08113409

Everyone should learn to code with AI! At AI Fund, everyone - not just engineers - can vibe code or use AI assistance to code. This has been great for our creativity and productivity. I hope more teams will empower everyone to build with AI. Please watch the video for details. https://t.co/rsGC1QSKHL

❀️2,288
likes
πŸ”432
retweets
πŸ–ΌοΈ Media
B
Stella Biderman
@BlancheMinerva
πŸ“…
Sat
πŸ†”51973825

Claude-4 Sonnet scores quite well on SPOT, our recent benchmark for identifying errors in academic papers. Its precision of 11.3% is far ahead of its competition, but probably not something you'd want to rely on to report you for fraud... https://t.co/Bwg3beDscd

Media 1
❀️116
likes
πŸ”10
retweets
πŸ–ΌοΈ Media
Z
Ravid Shwartz Ziv
@ziv_ravid
πŸ“…
Thu May 29
πŸ†”03987636

You know all those arguments that LLMs think like humans? Turns out it's not true. 🧠 In our paper "From Tokens to Thoughts: How LLMs and Humans Trade Compression for Meaning" we test it by checking if LLMs form concepts the same way humans do @ylecun @ChenShani2 @jurafsky https://t.co/ctVszZXoDW

Media 1
❀️1,882
likes
πŸ”322
retweets
πŸ–ΌοΈ Media
J
Jerry Liu
@jerryjliu0
πŸ“…
Sat
πŸ†”38983396

How does Sonnet 4.0 compare vs. Gemini 2.5 Pro on document understanding? πŸ‘‡ I’ve found Sonnet 4.0 to be much better at table parsing. Check out the screenshot below πŸ–ΌοΈ - I compared both models’ visual reasoning capabilities over a screenshot of a dense Caltrain schedule packed… https://t.co/rcnv1H3mq0

Media 1
❀️101
likes
πŸ”16
retweets
πŸ–ΌοΈ Media
H
htmx.org / The Net's Smoothest Code Man (same)
@htmx_org
πŸ“…
Sat
πŸ†”76395580

we are now at a point that we can ditch build systems for many projects & many people underestimate the amount weight doing so would lift off their burdened shoulders https://t.co/41vg3fcnXr

Media 1
❀️2,266
likes
πŸ”138
retweets
πŸ–ΌοΈ Media
O
elvis
@omarsar0
πŸ“…
Sat
πŸ†”21247227

Open-Ended Evolution of Self-Improving Agents Can AI systems endlessly improve themselves? This work shows the potential of self-improving AI, inspired by biological evolution and open-ended exploration. This is a must-read! Here are my notes: https://t.co/KRmNve8pl5

Media 1
❀️916
likes
πŸ”193
retweets
πŸ–ΌοΈ Media
T
Teknium (e/Ξ»)
@Teknium1
πŸ“…
Sun
πŸ†”05116334

FYI @max_paperclips integrated **1,069** new environments to Atropos by porting in @intern_lm's new environment bootcamp - hundreds and hundreds of new tasks - including various task types such as games, logic problems, puzzles, algorithms, and more. We're working on a reasoning… https://t.co/RlB5jzIzps

Media 1
❀️74
likes
πŸ”3
retweets
πŸ–ΌοΈ Media
E
Eugene Yan
@eugeneyan
πŸ“…
Sun
πŸ†”91612354

If you're attending @aiDotEngineer on wed, june 4th, check out the recsys track. I'll be hosting talks from Pinterest, LinkedIn, Netflix, Instacart, Youtube. I'll also share 3 ideas that'll likely drive the next few years in recsys: semantic IDs, llm-augmentation, unified models https://t.co/WbL8yhRw0k

Media 1
❀️110
likes
πŸ”9
retweets
πŸ–ΌοΈ Media
H
Hamel Husain
@HamelHusain
πŸ“…
Sat
πŸ†”01493197

Should I build a custom annotation tool or use something off-the-shelf? https://t.co/TlmrtRrnDk https://t.co/7EsMIEHkxV

Media 1
❀️25
likes
πŸ”3
retweets
πŸ–ΌοΈ Media
O
elvis
@omarsar0
πŸ“…
Sun
πŸ†”80821645

Just learned about Claude 4's God mode! "Don't hold back. Give it your all!" On a serious note, you'll be surprised just how better the results are when you use clear modifiers, be specific, and demand more from the models. This also works well for other models like o3. https://t.co/7yREQjF33R

Media 1
❀️969
likes
πŸ”91
retweets
πŸ–ΌοΈ Media
H
Hamel Husain
@HamelHusain
πŸ“…
Sun
πŸ†”37330608

Why do you recommend binary (pass/fail) evaluations instead of 1-5 ratings (Likert scales) for applied evals? Links in reply https://t.co/wCi78dH8J2

Media 1
❀️247
likes
πŸ”21
retweets
πŸ–ΌοΈ Media
E
Ethan Mollick
@emollick
πŸ“…
Sun
πŸ†”03319089

I wrote a history of AI in 32 images of otters using wifi on airplanes, from images to video to code. It shows two big trends: rapid improvements in AI models of all types and the growth of open weights AI models. Link in the comments. https://t.co/PrZDmKaP7D

Media 1
❀️317
likes
πŸ”34
retweets
πŸ–ΌοΈ Media
J
Jerry Liu
@jerryjliu0
πŸ“…
Mon
πŸ†”79365462

A huge (and probably underrated) promise of LLMs is inhaling a million PDFs and making sense of them through automated extraction. The baseline 🟑: Stuff tokens into a function-calling LLM with a Pydantic schema, get back structured JSON. You can do this with most frameworks in… https://t.co/qPCa3NchJp

Media 1
❀️232
likes
πŸ”32
retweets
πŸ–ΌοΈ Media
S
Stefania Druga
@Stefania_druga
πŸ“…
Mon
πŸ†”64595248

On my way to SF from Tokyo. Can't wait to talk about Real-time AI Scientists @aiDotEngineer this year! https://t.co/I80upeWgrW

Media 1Media 2
❀️34
likes
πŸ”1
retweets
πŸ–ΌοΈ Media
T
Teknium (e/Ξ»)
@Teknium1
πŸ“…
Mon
πŸ†”43703919

Decentralized Training Progress - 6/1/2025 https://t.co/y4pjVejNks

Media 1
❀️417
likes
πŸ”29
retweets
πŸ–ΌοΈ Media
E
Ethan Mollick
@emollick
πŸ“…
Mon
πŸ†”17307514

Veo 3 is really fun to use for historical what-ifs. I put together a 1940s video newsreel as if Project Habakkuk, the World War Two British plan to build a giant aircraft carrier out of pykrete, a mix of ice and woodpulp, had actually happened. https://t.co/kh7wqm3yCB

❀️309
likes
πŸ”19
retweets
πŸ–ΌοΈ Media
M
martin_casado
@martin_casado
πŸ“…
Sun
πŸ†”13450573

Knuth shows us the way. Again: https://t.co/kndEGZGHFr

Media 1
❀️188
likes
πŸ”19
retweets
πŸ–ΌοΈ Media
I
Tanishq Mathew Abraham, Ph.D.
@iScienceLuvr
πŸ“…
Mon
πŸ†”43070775

VisualSphinx: Large-Scale Synthetic Vision Logic Puzzles for RL "we propose VisualSphinx, a first-of-its-kind large-scale synthetic visual logical reasoning training data." "we propose a rule-to-image synthesis pipeline, which extracts and expands puzzle rules from seed… https://t.co/H3b1PuuqPE

Media 1
❀️102
likes
πŸ”14
retweets
πŸ–ΌοΈ Media
H
hardmaru
@hardmaru
πŸ“…
Mon
πŸ†”68762892

Facebook AI Research is the OG β€œOpen” AI https://t.co/awEt2L4ZvD

Media 1
❀️3,047
likes
πŸ”261
retweets
πŸ–ΌοΈ Media
E
Eugene Yan
@eugeneyan
πŸ“…
Wed
πŸ†”59552524

Some thoughts on leadership: https://t.co/FvDlqCGGu3 β€’ What makes an exceptional leader? β€’ What do exceptional leaders do? β€’ Leadership styles: Commando, soldier, police https://t.co/S0eYpGBjxo

Media 1Media 2
+1 more
❀️46
likes
πŸ”3
retweets
πŸ–ΌοΈ Media
O
elvis
@omarsar0
πŸ“…
Sun
πŸ†”74813672

Agent Zero A personal agentic framework that dynamically grows and learns with you. - It uses the OS as a tool. - Has search and terminal execution too. - It has persistent memory to memorize key information to solve future tasks more reliably. - Multi-agent support. https://t.co/b0PMAvzcrq

Media 1
❀️356
likes
πŸ”59
retweets
πŸ–ΌοΈ Media
H
Hamel Husain
@HamelHusain
πŸ“…
Sun
πŸ†”77885194

Should I build a custom annotation tool or use something off the shelf? P.S. This is a correction - I wasn't opinionated enough before. Links in reply https://t.co/PtIW1AjtYF

Media 1
❀️36
likes
πŸ”3
retweets
πŸ–ΌοΈ Media
H
Hamel Husain
@HamelHusain
πŸ“…
Mon
πŸ†”85708727

Asking in meme format Instead https://t.co/MUn6tqbyBS

Media 1
❀️33
likes
πŸ”1
retweets
πŸ–ΌοΈ Media
S
SkalskiP
@skalskip92
πŸ“…
Mon
πŸ†”99859220

CVPR 2025 starts in less than few weeks; I'm working on a list of must-see CVPR papers / projects any important papers I should add? link: https://t.co/1VlLn2BWxl https://t.co/XX8uLwmWLc

Media 1
❀️269
likes
πŸ”37
retweets
πŸ–ΌοΈ Media