Your curated collection of saved posts and media

Showing 32 posts Β· last 14 days Β· by score
J
Jerry Liu
@jerryjliu0
πŸ“…
Thu Jul 03
πŸ†”50608646

Practical Techniques for Context Engineering πŸ’‘ This is a fantastic blog post from @tuanacelik and @LoganMarkewich on a comprehensive breakdown of the types of context an LLM can interact with, and the core dimensions you have to consider: 1️⃣ Knowledge Base or tool selection -… https://t.co/r6XEtsJpio

Media 1
❀️274
likes
πŸ”62
retweets
πŸ–ΌοΈ Media
T
TwelveLabs (twelvelabs.io)
@twelve_labs
πŸ“…
Wed
πŸ†”40587399

In the 87thΒ session of #MultimodalWeekly, we welcome @garridoq_ (Research Scientist at @metaai) to share his awesome paper titledΒ "Intuitive physics understanding emerges from self-supervised pretraining on natural videos" in collaboration with his Meta AI colleagues. https://t.co/lqfZdXHvkM

Media 1
❀️71
likes
πŸ”23
retweets
πŸ–ΌοΈ Media
N
NIK
@ns123abc
πŸ“…
Fri
πŸ†”55836578

> a16z funds β€œCluely” a startup building AI cheating tools > soham open-sources clone of cluely called β€œCheating Daddy” > YC startup steals cheating daddy code and illegally relicenses as Apache 2.0 saying β€œbuilt in 4 daysβ€¦β€πŸ’€ Absolute state. https://t.co/K6WPZomu0e

Media 1Media 2
+2 more
❀️7,374
likes
πŸ”422
retweets
πŸ–ΌοΈ Media
_
Philipp Schmid
@_philschmid
πŸ“…
Fri
πŸ†”96869924

Gemini CLI Update from last week! We merged 85 PRs from 51 unique contributors. Here are key improvements: - Gemini CLI can now use audio and video (santhoshkumarCodes) - Upgrade to Ink 6 and React 19 (SandyTao520) - GEMINI md can import other markdown files with @.… https://t.co/i4CfZKdJOu

Media 1
❀️1,266
likes
πŸ”132
retweets
πŸ–ΌοΈ Media
J
jason liu
@jxnlco
πŸ“…
Thu Jul 03
πŸ†”63061791

lessons from finetuning rerankers with @lancedb https://t.co/54r2a5Cqdj

Media 1
❀️384
likes
πŸ”32
retweets
πŸ–ΌοΈ Media
H
Hamel Husain
@HamelHusain
πŸ“…
Wed
πŸ†”05856341

Whats a minimal viable eval setup? Error Analysis + Notebooks are all you need (for a while) 1 of 3 https://t.co/nv1ukwZ57s

Media 1
❀️161
likes
πŸ”10
retweets
πŸ–ΌοΈ Media
T
Zach Mueller
@TheZachMueller
πŸ“…
Fri
πŸ†”96238701

What is "SLURM"? How do you utilize a cluster of GPUs effectively? Why does this matter in today's Deep Learning world? On Wednesday at 11AM EST I'll be talking just about that! https://t.co/AXun5mZd5a

Media 1
❀️45
likes
πŸ”4
retweets
πŸ–ΌοΈ Media
H
Hamel Husain
@HamelHusain
πŸ“…
Fri
πŸ†”27327238

I do not have a favorite eval vendor. This is because I use most of them as a db and build tools on top. What seems to make the most impact wrt to success is the support they provide (which varies according to the situation), so I suggest paying attention to that. https://t.co/ZDNArBEaK1

Media 1
❀️17
likes
πŸ”3
retweets
πŸ–ΌοΈ Media
J
Jerry Liu
@jerryjliu0
πŸ“…
Thu Jul 03
πŸ†”79183229

We're all in on context engineering! A related topic that imo is table stakes for every AI engineer/user: workflow engineering πŸ› οΈ A lot of agent use cases revolve around automating work that otherwise a human would have to perform - customer support, legal research, report… https://t.co/Ry2F1IapZp

Media 1
❀️278
likes
πŸ”44
retweets
πŸ–ΌοΈ Media
H
Hamel Husain
@HamelHusain
πŸ“…
Wed
πŸ†”52797148

I have never been more excited about a talk! Why? @ttorres will show: 1. How domain experts (like PMs) can create high quality evals using simple approaches 2. Solving problems, no gatekeeping. 3. A decisive victory for notebooks. In our course: https://t.co/dR23WB2cAl https://t.co/biFMJgwG6t

Media 1
❀️56
likes
πŸ”6
retweets
πŸ–ΌοΈ Media
J
jason liu
@jxnlco
πŸ“…
Wed
πŸ†”11295088

now @skylar_b_payne https://t.co/PDgjLQRcuF

Media 1
πŸ–ΌοΈ Media
H
Hamel Husain
@HamelHusain
πŸ“…
Wed
πŸ†”87707223

What makes a good custom interface for reviewing LLM outputs? (which I recommend most people build!) These are some enhancements we’ve seen work well Screenshots in replies 1/6 https://t.co/W1L8jabX96

Media 1
❀️50
likes
πŸ”8
retweets
πŸ–ΌοΈ Media
D
Nirit Weiss-Blatt, PhD
@DrTechlash
πŸ“…
Wed
πŸ†”39574656

In the current AI talent war, everyone is focused on the big numbers (alleged compensation packages). It misses the bigger picture: the cultural shift following the DeepSeek moment. META is the American leader in open science (publications) and open source (Llama). Both OpenAI… https://t.co/5JwjM36Kjk

Media 1
❀️186
likes
πŸ”32
retweets
πŸ–ΌοΈ Media
V
vik
@vikhyatk
πŸ“…
Wed
πŸ†”10398615

they really are cooked. adults are not in charge any more https://t.co/gcgzSjuCsJ

Media 1
❀️918
likes
πŸ”37
retweets
πŸ–ΌοΈ Media
J
Jerry Liu
@jerryjliu0
πŸ“…
Wed
πŸ†”94580698

RAG POCs are easy, but building production-grade retrieval is legitimately hard. These are things you don’t realize when you’re first starting out building agents - β€œwow my chat over 10 pdfs works in 10 mins!”. We learned these lessons as we built out LlamaCloud and wanted to… https://t.co/NWPgDrF64x

Media 1
❀️237
likes
πŸ”41
retweets
πŸ–ΌοΈ Media
E
Ethan Mollick
@emollick
πŸ“…
Thu Jul 03
πŸ†”02726463

"Claude 4 Opus, make the most insanely referential thing possible, make it super clever. like really smart. it should be working code" "Make it even more so" https://t.co/OulVDykGyv

❀️281
likes
πŸ”21
retweets
πŸ–ΌοΈ Media
O
elvis
@omarsar0
πŸ“…
Wed
πŸ†”79693108

Sometimes you get lucky with vibe coding. These days, I rely less on luck and get better results by focusing on context engineering. I built this fully functional deep research agent with Replit Agent and n8n in <10 mins. And it's deployed too! What a time to be alive! https://t.co/kd7K2Kjrb6

Media 1
❀️132
likes
πŸ”17
retweets
πŸ–ΌοΈ Media
E
Ethan Mollick
@emollick
πŸ“…
Thu Jul 03
πŸ†”34836036

Product idea for OpenAI (I know a lot of you follow me): an entirely paper-based LLM. Just 780 volumes and only 30 person years to do the math for the first token using the paper version of GPT-1 Give the weights actual weight. Plus an excellent setup for science fiction stories https://t.co/iDGetnej4H

Media 1Media 2
+1 more
❀️239
likes
πŸ”14
retweets
πŸ–ΌοΈ Media
Z
Zeyuan Allen-Zhu, Sc.D.
@ZeyuanAllenZhu
πŸ“…
Thu Jul 03
πŸ†”78162555

Facebook AI Research (FAIR) is a small, prestigious lab in Meta. We don't train large models like GenAI or MSL, so it's natural that we have limited GPUs. GenAI or MSL's success or failure, past or future, doesn't reflect the work of FAIR. It is important to make this distinction https://t.co/2aN9ZEou7u

Media 1
❀️846
likes
πŸ”61
retweets
πŸ–ΌοΈ Media
H
Hamel Husain
@HamelHusain
πŸ“…
Thu Jul 03
πŸ†”87759755

How do I evaluate agentic workflows? We recommend a two-phased approach, first do error analysis on end-to-end task success/failure. 1 of 5 https://t.co/ZrfLOuXPWh

Media 1
❀️151
likes
πŸ”18
retweets
πŸ–ΌοΈ Media
J
jason liu
@jxnlco
πŸ“…
Thu Jul 03
πŸ†”81569923

buildign a look at your data agent in claude code https://t.co/Igah74qvCB

Media 1
❀️9
likes
πŸ–ΌοΈ Media
O
elvis
@omarsar0
πŸ“…
Thu Jul 03
πŸ†”96187970

AI for Scientific Search AI for Science is where I spend most of my time exploring with AI agents. This 120+ pages report does a good job of highlighting why all the big names like OpenAI and Google DeepMind are pursuing AI4Science. Bookmark it! My notes below: https://t.co/z2gRcVbnV4

Media 1
❀️694
likes
πŸ”154
retweets
πŸ–ΌοΈ Media
O
elvis
@omarsar0
πŸ“…
Wed
πŸ†”62406505

Threats in LLM-Powered AI Agents Workflows Neat survey of typical threats you encounter when building AI agents. Prompt injections and protocol exploits included. Bookmark this one! https://t.co/WalkxmYRBO

Media 1
❀️644
likes
πŸ”110
retweets
πŸ–ΌοΈ Media
E
evan conrad
@evanjconrad
πŸ“…
Wed
πŸ†”98868649

We've partnered with Modular to create Large Scale Inference (LSI), a new OpenAI-compatible inference service. It's up to 85% cheaper than other offerings & can handle trillion-token scale. We originally created it at the request of a major AI lab to do large scale multimodal… https://t.co/Ad6FXWBSXv

Media 1
❀️485
likes
πŸ”41
retweets
πŸ–ΌοΈ Media
L
LlamaIndex πŸ¦™
@llama_index
πŸ“…
Wed
πŸ†”71530035

Did you know? You can retrieve images and illustrative figures from your LlamaCloud Indexes as well as text! This is great for presentations, reports, and other document types that have rich imagery. Enabling this feature is as simple as toggling the "Multi-modal indexing"… https://t.co/egzmPkOUvv

Media 1
❀️15
likes
πŸ”2
retweets
πŸ–ΌοΈ Media
L
Luke Metro
@luke_metro
πŸ“…
Tue Jul 01
πŸ†”73873419

This is broadly true but the comms here seem very hard to get right β€œTake a pay cut due to the mission, but as a founder I get to be both mission driven and mega rich” feels like a difficult starting point https://t.co/Er38HO6aym

Media 1
❀️1,547
likes
πŸ”54
retweets
πŸ–ΌοΈ Media
S
swyx
@swyx
πŸ“…
Tue Jul 01
πŸ†”49020567

congrats to Boris and Cat for joining @cursor_ai ! Claude Code + Cursor = ???? https://t.co/oh2q8HkWX6

Media 1
❀️622
likes
πŸ”33
retweets
πŸ–ΌοΈ Media
R
Arnaud Bertrand
@RnaudBertrand
πŸ“…
Wed
πŸ†”07655965

This absolutely stunning chart comparison by the NYT might prove to be the most important geopolitical visualization of the 21st century. The two major superpowers are each cornering a competing energy platform. China bets everything on clean energy, the US on fossil fuels.… https://t.co/H9mPeFmQy1

Media 1Media 2
❀️4,167
likes
πŸ”1,265
retweets
πŸ–ΌοΈ Media
K
Kylie Robison
@kyliebytes
πŸ“…
Tue Jul 01
πŸ†”91397675

sorry apple what in the ever loving fuck is this contact drop down https://t.co/xuCYevJbXK

Media 1
❀️1,979
likes
πŸ”35
retweets
πŸ–ΌοΈ Media
W
weber
@weberwongwong
πŸ“…
Tue Jun 24
πŸ†”23888657

enough about forward-deployed engineers we need more forward-deployed angels was thinking about this because one of our angels Umesh Khanna (@forwarddeploy) has just been pulling up on Sundays and going deep with us on the product problems we're dealing with, designing growth… https://t.co/HONqYxYwA9

Media 1
❀️143
likes
πŸ”7
retweets
πŸ–ΌοΈ Media
J
Jerry Liu
@jerryjliu0
πŸ“…
Wed
πŸ†”85199234

Introducing Document Extraction as an MCP Server βœ‚οΈπŸ“‘ A huge use case for AI agents is being able to extract out items from a diverse set of complex documents in a repeatable manner - whether it’s legal contracts, invoices, financial statements, passports, and more. In this… https://t.co/1glV2lCgZd

❀️397
likes
πŸ”44
retweets
πŸ–ΌοΈ Media
I
Tanishq Mathew Abraham, Ph.D.
@iScienceLuvr
πŸ“…
Wed
πŸ†”71709744
⭐0.30

Fun! https://t.co/aDbqirMhbw

Media 1
❀️17
likes
πŸ”1
retweets
πŸ–ΌοΈ Media