Your curated collection of saved posts and media

Showing 32 posts ยท last 14 days ยท by score
O
omarsar0
@omarsar0
๐Ÿ“…
Nov 25, 2025
150d ago
๐Ÿ†”64211266

Here's a visual summary of the new guide by Anthropic. It's on how to improve tool use for AI agents. 3 core ideas: - Tool Search tool to discover tools on-demand to save context - Programmatic tool calling to orchestrate tools via code - Tool schema + usage bookmark it https://t.co/cNBCx0y7AH

Media 1
๐Ÿ–ผ๏ธ Media
O
omarsar0
@omarsar0
๐Ÿ“…
Nov 26, 2025
149d ago
๐Ÿ†”56689789

Interleaved thinking is a game-changer. I built this little deep research agent, and the results are impressive. The agent is just more efficient at reasoning over multiple steps. Huge leverage for self-improving agents. https://t.co/6oCi8ifyKV

Media 1
๐Ÿ–ผ๏ธ Media
D
dair_ai
@dair_ai
๐Ÿ“…
Nov 26, 2025
149d ago
๐Ÿ†”48115915

Multi-agent systems are powerful but expensive. However, the cost isn't in the reasoning itself. It's in the communication. Agents exchange full text messages, consuming tokens for every coordination step. When agents need to collaborate on complex problems, this overhead adds up fast. More agents, more messages, more tokens, more cost. This new research introduces LatentMAS, a framework where agents communicate through compressed latent vectors instead of natural language. The key idea: agents don't need to explain everything in words. They encode task-relevant information into compact hidden representations. Other agents decode what they need. No verbose back-and-forth. The framework operates in three phases: - Encoding: agents compress their knowledge into low-dimensional vectors. - Sharing: these vectors replace text messages between agents. - Decoding: receiving agents reconstruct what matters. Inspired by how transformers process information internally through hidden states. Now applied to inter-agent communication. What makes this powerful? Communication costs drop significantly while task performance stays intact. The paper provides theoretical guarantees and empirical validation across multiple reasoning benchmarks. This unlocks practical scalability. It will allow AI devs to deploy more agents, run more complex reasoning without proportional cost increases. Paper: https://t.co/nl2OQX7txH Learn to build AI agents in our academy: https://t.co/zQXQt0PMbG

Media 1Media 2
๐Ÿ–ผ๏ธ Media
O
omarsar0
@omarsar0
๐Ÿ“…
Nov 26, 2025
149d ago
๐Ÿ†”62960083

As usual, Anthropic just published another banger. This one is on building agents that continue to do useful work for an arbitrarily long time. Great tips on context management. A must-read for AI devs. https://t.co/UXCIr86FS4

Media 1
๐Ÿ–ผ๏ธ Media
O
omarsar0
@omarsar0
๐Ÿ“…
Nov 27, 2025
148d ago
๐Ÿ†”41045739

a cool part of the opus 4.5 that went under the radar. you can now set the effort level to trade off between response thoroughness and token efficiency. high - complex reasoning medium - agentic tasks with balanced effort low - simpler tasks https://t.co/gTDEvplhUj

Media 1
๐Ÿ–ผ๏ธ Media
O
omarsar0
@omarsar0
๐Ÿ“…
Nov 27, 2025
148d ago
๐Ÿ†”23592040

Cool paper from Meta. And another excellent application of multi-agent systems. (bookmark it) Training modern AI models requires massive amounts of high-quality data. However, the bottleneck isn't just quantity. The data is just not diverse enough. Single models generating synthetic data tend to produce homogeneous outputs, repeating patterns, and lacking the nuanced variety found in human-created datasets. This new research from Meta introduces Matrix, a peer-to-peer framework where multiple AI agents collaboratively generate synthetic training data through decentralized interactions. Matrix achieves 2โ€“15ร— higher data generation throughput under identical hardware resources, without compromising output quality. TL;DR: Instead of one model producing data, specialized agents play distinct roles and interact with each other. One asks questions, another responds, a third evaluates quality. These multi-turn conversations capture complex reasoning and diverse perspectives. What makes Matrix different: no central coordinator. Agents communicate directly in a fully decentralized architecture. This enables scalability without infrastructure bottlenecks. The framework operates through role-based conversation protocols, multi-turn interaction patterns, and built-in quality filtering at each stage. Only data meeting quality thresholds makes it into the final training set. Multi-agent collaboration produces more diverse synthetic data than single-model approaches. The resulting datasets improve downstream model performance across reasoning and instruction-following benchmarks.

Media 1
๐Ÿ–ผ๏ธ Media
D
dair_ai
@dair_ai
๐Ÿ“…
Nov 27, 2025
148d ago
๐Ÿ†”46598076

Banger paper from NVIDIA. Bigger models aren't always the answer. However, the default approach to improving AI systems today remains scaling up. More parameters, more compute, more cost. But many tasks don't require the full power of a massive model. This new research introduces ToolOrchestra, a framework that strategically coordinates multiple AI models with external tools based on task complexity. Instead of routing everything through one large model, an orchestrator decides dynamically. When is a tool necessary? Which model size fits the task? How should components coordinate? The researchers trained Orchestrator-8B, a specialized 8-billion parameter model that makes intelligent routing decisions. It determines when external tools are needed versus when model inference alone suffices. On HLE, Orchestrator achieves a score of 37.1%, outperforming GPT-5 (35.1%) while being 2.5x more efficient. They also release ToolScale, a synthetic dataset of tool usage examples across diverse scenarios for training orchestration capabilities. What it matters: strategic orchestration of smaller models with targeted tool usage can match or exceed monolithic large model performance while cutting computational overhead. Paper: https://t.co/iNvqIHGTES Learn how to build AI Agents in our academy: https://t.co/zQXQt0PMbG

Media 1Media 2
๐Ÿ–ผ๏ธ Media
G
GoogleDeepMind
@GoogleDeepMind
๐Ÿ“…
Oct 16, 2025
190d ago
๐Ÿ†”11588666

Weโ€™re announcing a research collaboration with @CFS_energy, one of the worldโ€™s leading nuclear fusion companies. Together, weโ€™re helping speed up the development of clean, safe, limitless fusion power with AI. โš›๏ธ https://t.co/5gDqP3WiNe

Media 1
๐Ÿ–ผ๏ธ Media
G
goodfellow_ian
@goodfellow_ian
๐Ÿ“…
Nov 18, 2025
157d ago
๐Ÿ†”31337797

Amazing test of Gemini 3โ€™s multimodal reasoning capabilities: try generating a threejs voxel art scene using only an image as input Prompt: I have provided an image. Code a beautiful voxel art scene inspired by this image. Write threejs code as a single-page

๐Ÿ–ผ๏ธ Media
G
goodfellow_ian
@goodfellow_ian
๐Ÿ“…
Nov 18, 2025
157d ago
๐Ÿ†”90875587

Best outputs from Gemini 2.5 Pro, vs 3 Pro, this example nicely illustrates the fidelity jump with 3 Pro, through strong multimodal understanding and 3D reasoning https://t.co/0aM8oSgmFA

๐Ÿ–ผ๏ธ Media
G
goodfellow_ian
@goodfellow_ian
๐Ÿ“…
Nov 18, 2025
157d ago
๐Ÿ†”11043966

Try it for yourself at https://t.co/yT45AjuoEn https://t.co/wmOv3cj17j

๐Ÿ–ผ๏ธ Media
D
DrJimFan
@DrJimFan
๐Ÿ“…
Oct 28, 2025
178d ago
๐Ÿ†”84853998

Listening to Jensen talk about his favorite maths - specs of Vera Rubin chips, and the full stack from lithography to robot fleets assembling physical fabs in Arizona & Houston. Quoting Jensen, โ€œthese factories are basically robots themselvesโ€. I visited NVIDIA facilities before and they look absolutely unreal. Sci-fi scenes pale in comparison to the real Matrix, racks over racks fading into the horizon. The art of enchanting rocks to do computation is the greatest craft humanity has mastered. Sometimes I forget Iโ€™m at a hardware company with huge muscles to move atoms at unbelievable scale.

Media 1
๐Ÿ–ผ๏ธ Media
Y
yifever
@yifever
๐Ÿ“…
Nov 05, 2025
170d ago
๐Ÿ†”62062088

congrats to llama 3 large for winning the LLM trading contest by not participating https://t.co/PsA6hUYQ48

Media 1
๐Ÿ–ผ๏ธ Media
R
random_walker
@random_walker
๐Ÿ“…
Oct 16, 2025
190d ago
๐Ÿ†”12253029

My @PrincetonCITP colleague Mihir Kshirsagar asks: are Microsoft-OpenAI, Amazon-Anthropic, and Google using subsidized capital to lock in enterprise customers right now through aggressive pricing, multi-year contracts, and deep integrations? https://t.co/iQ5sADa6aI https://t.co/tjun9v4cGj

Media 1Media 2
๐Ÿ–ผ๏ธ Media
S
sayashk
@sayashk
๐Ÿ“…
Oct 20, 2025
186d ago
๐Ÿ†”18641889

I am on the faculty job market this year! I am seeking tenure-track faculty positions to drive my research agenda on rigorous AI evaluation for science and policy. I am applying broadly across disciplines, and would be grateful to hear of relevant positions. Materials: ๐Ÿงต https://t.co/31r7W1FHNO

Media 1
๐Ÿ–ผ๏ธ Media
R
random_walker
@random_walker
๐Ÿ“…
Oct 21, 2025
185d ago
๐Ÿ†”07273887

A lot of AI discourse is magical thinking and ignores the crucial "Phase 2". My bet is that (1) AI can indeed play a role in transforming various industries, societies, and governments (2) These transformations, if they happen, will usually be slow and painful (3) Their nature is not dictated by the logic of the technology itself but by what we collectively choose to do with the tech (4) In most cases, positive transformation will require institutional reform and hard decisions on issues such as compensating the "losers" of structural shifts (5) Most of the energy on AI & transformation today (for instance, on how AI can transform science) is misdirected because it doesnโ€™t focus on real bottlenecks. The work of identifying and addressing bottlenecks, which is where the alpha is, has not really begun (6) This work will have to be done on a sector-by-sector basis, and domain expertise is more important than AI expertise.

Media 1
๐Ÿ–ผ๏ธ Media
R
random_walker
@random_walker
๐Ÿ“…
Nov 10, 2025
165d ago
๐Ÿ†”22501626

The Longitudinal Expert AI Panel is a really well thought out AI forecasting exercise and I have been happy to participate as an advisor. Here is the launch whitepaper: https://t.co/oSLN9AGayN https://t.co/7dLo1QSxQO

Media 1
๐Ÿ–ผ๏ธ Media
A
AndyMasley
@AndyMasley
๐Ÿ“…
Nov 19, 2025
156d ago
๐Ÿ†”92412564

I think that Hao made a bad but honest mistake and I don't mean to attack her overall character as a journalist. In contrast, I would like to take this opportunity to directly attack the journalistic integrity of More Perfect Union, who are much more influential in the AI water conversation than Hao and have made clear, direct decisions to deceive their viewers in basically every video they made about it. I see them as the biggest bad guy in the debate. My interactions with Hao have been very nice. My interaction with the head of More Perfect Union was terrible (logged in the post below) and confirmed to me that they don't care about the truth on this at all. I have a long post on how every single presentation of the AI water issue they make has been intentionally off base here. Would like more attention on this: https://t.co/hbDqRORZsn

Media 1
๐Ÿ–ผ๏ธ Media
L
llama_index
@llama_index
๐Ÿ“…
Nov 06, 2025
169d ago
๐Ÿ†”87140503

Here's a common scenario: Your finance team gets emails all day with invoices from partners and expense reports from employees. Each one needs different handling. Invoices need acknowledgment and payment scheduling. Expenses need budget validation before approval etc. In this example we build an agent that automatically triages incoming emails with attachments, extracts the right information, and takes appropriate action. Our approach uses three of our tools working together: 1๏ธโƒฃ LlamaClassify handles the first decision point. It looks at each attachment and determines: is this an invoice that needs to be paid out to a partner, or an expense that needs reimbursement? It also provides reasoning for the decision. 2๏ธโƒฃ LlamaExtract does the heavy lifting on data extraction. We create two specialized agents with different schemas for invoices vs expenses. 3๏ธโƒฃ Agent Workflows orchestrates the entire process. It connects classification to extraction to business logic: in this case, checking expenses against a budget threshold and generating appropriate email responses via LLM. Classify incoming documents โ†’ extract relevant data โ†’ apply business rules โ†’ take action. Need to add a new document type? Add a classification rule and an extraction schema. Need different business logic? Modify the workflow steps. The components stay the same. Check out the full example: https://t.co/5qsO6gmBs2

Media 1Media 2
๐Ÿ–ผ๏ธ Media
L
llama_index
@llama_index
๐Ÿ“…
Nov 07, 2025
168d ago
๐Ÿ†”13855235

We probably shouldn't tell you how to build your own document parsing agents, but we will ๐Ÿ˜ฎ. AI agents are transforming how we handle messy, real-world documents that break traditional OCR systems. Join our live webinar on December 4th at 9 AM PST where the LlamaParse team reveals industry secrets for parsing complex documents: ๐Ÿ“‹ Blueprint for building next-generation document parsing workflows using agents instead of OCR alone ๐Ÿ”ง Practical strategies for handling handwriting, rotated scans, nested tables, and visually dense layouts ๐Ÿค– Latest LlamaCloud capabilities showing how vision language models automate extraction from previously unparseable PDFs, forms, and images โšก When to apply each component in your parsing pipeline and why it matters We'll show you how to move beyond simple text extraction to actually automate understanding of documents with multi-column layouts, embedded charts, skewed scans, and tables within tables. Register now: https://t.co/Q17V6sC1V1

Media 1Media 2
๐Ÿ–ผ๏ธ Media
L
llama_index
@llama_index
๐Ÿ“…
Nov 07, 2025
168d ago
๐Ÿ†”72857356

Trigger your agent workflows directly from your inbox, using our LlamaAgents and @resend webhooks๐Ÿ“ง In this demo, we built a system that: ๐Ÿ‘‰ Receives emails with documents attached ๐Ÿ‘‰ Classifies the attachments as either invoices or expenses using LlamaClassify ๐Ÿ‘‰ Extracts the relevant information through LlamaExtract ๐Ÿ‘‰ Writes an email reply and sends it back to the user All of this is packaged as an agent workflow and deployed to the cloud through our LlamaAgents!๐Ÿš€ ๐Ÿฆ™ Get started with all our LlamaCloud services now: https://t.co/Ct7pawLEFX ๐Ÿ“š Learn more about our agent workflows: https://t.co/VX6GwKdVMB โญ Star the repo on GitHub: https://t.co/vKjGP62fLE

Media 2
+1 more
๐Ÿ–ผ๏ธ Media
L
llama_index
@llama_index
๐Ÿ“…
Nov 10, 2025
165d ago
๐Ÿ†”48266141

There are Vegas parties and there is Late Shift ๐ŸŽ‰ Join us for an exclusive re:Invent afterparty that brings together the best minds in AI and tech for a night you won't forget. ๐Ÿธ Cocktails and disco balls at Diner Ross Steakhouse in The LINQ ๐Ÿค– Connect with the teams behind @browserbase, @braintrust, @modal_labs, and LlamaIndex ๐ŸŒ™ Late-night tech conversations when the conference sessions end ๐ŸŽŸ๏ธ Limited spots with approval-required registration We're teaming up with our friends at @browserbase, @usebraintrust, and @modal_labs to host the most fun you'll have all conference. After your evening sessions, meet us for cocktails, networking, and the kind of tech chatter that makes re:Invent legendary. RSVP now - spots are limited: https://t.co/sYU6IbKYvg

Media 1
๐Ÿ–ผ๏ธ Media
L
llama_index
@llama_index
๐Ÿ“…
Nov 11, 2025
164d ago
๐Ÿ†”79829204

See how @pathwork scaled their life insurance document processing from 5,000 to 40,000 pages per week using LlamaParse. ๐Ÿ“„ Process complex medical records, lab results, and decades-old scanned PDFs with 8x improved throughput ๐Ÿค– Automatically extract and index carrier underwriting guidelines to keep risk rules current โšก Replace fragile, manual pipelines with robust automation that handles everything from digital forms to 1970s faded scans ๐ŸŽฏ Free up engineering time from maintenance to focus on building new product features @pathwork's Case Underwriter, Knowledge Assistant, and Pre-App Manager products all rely on transforming unstructured insurance documentation into structured data for faster decision-making. By integrating LlamaParse, they eliminated bottlenecks that were directly limiting customer growth and built future-proof infrastructure that automatically improves over time. Read the full case study: https://t.co/Cla0bDzPji

Media 1Media 2
๐Ÿ–ผ๏ธ Media
L
llama_index
@llama_index
๐Ÿ“…
Nov 12, 2025
163d ago
๐Ÿ†”15848401

Build a document understanding agent for SEC filings that uses a multi-step approach with LlamaClassify and Extract to identify the filing type and hand it off to the right extraction agent. Deployed with LlamaAgents. ๐Ÿ”ง Customize extraction schemas to fit your specific data requirements and business logic ๐Ÿ“Š Review and correct extractions through an intuitive frontend UI before finalizing results ๐Ÿš€ Extend the system with additional workflows for downstream data syncing or automated monitoring โš™๏ธ Get started quickly with our structured template and clear documentation Check out the complete starter template and begin building your extraction system: https://t.co/a4XtdKvKbs

๐Ÿ–ผ๏ธ Media
L
llama_index
@llama_index
๐Ÿ“…
Nov 12, 2025
163d ago
๐Ÿ†”20900654

See the full example here: https://t.co/i5ZzljkdK3

Media 1
๐Ÿ–ผ๏ธ Media
L
llama_index
@llama_index
๐Ÿ“…
Nov 13, 2025
162d ago
๐Ÿ†”51549854

Chart OCR just got a major upgrade with our new experimental "agentic chart parsing" feature in LlamaParse ๐Ÿ“ˆ๐Ÿงช Most LLMs struggle with converting charts to precise numerical data, so we've created an experimental a system that follows contours in line charts and extracts values. Automate chart analysis without spending hours manually correcting extracted values. Try it now in LlamaParse: https://t.co/JHWRvwd93B

Media 1Media 2
๐Ÿ–ผ๏ธ Media
L
llama_index
@llama_index
๐Ÿ“…
Nov 17, 2025
158d ago
๐Ÿ†”57950625

Document AI goes beyond traditional OCR to create intelligent systems that read, understand, and act on documents like humans do. Our latest blog post explains how agentic OCR combined with LLM-powered workflows is transforming document automation across industries: ๐Ÿง  Agentic OCR interprets visual and semantic structure, achieving 90%+ pass-through rates vs 60-70% with legacy systems ๐Ÿ“Š Multimodal understanding processes charts, images, and tables that conventional OCR simply cannot interpret โšก Smart workflows use reasoning instead of rigid rules, adapting dynamically when documents deviate from expectations ๐Ÿ”„ Self-correcting error handling attempts corrections and learns from exceptions rather than failing silently Companies are seeing measurable gains: higher throughput, lower operational costs, faster deployment, and improved compliance with full auditability. We've built the complete Document AI stack with LlamaCloud - from LlamaParse for structure-aware parsing to LlamaExtract for declarative schema extraction, all orchestrated through our Workflows framework. Read the full breakdown of how Document AI operates and its business impact: https://t.co/cAlgQT2Lv2

Media 1Media 2
๐Ÿ–ผ๏ธ Media
L
llama_index
@llama_index
๐Ÿ“…
Nov 18, 2025
157d ago
๐Ÿ†”35791697

LlamaAgents is now in open preview - the fastest way to build, serve, and deploy multi-step document agents that combine LlamaCloud's document extraction and parsing power with Agent Workflows orchestration. ๐Ÿš€ Get started instantly with pre-built templates for SEC filings, invoice processing, document Q&A and more ๐Ÿ› ๏ธ Use llamactl CLI to serve agents locally โ˜๏ธ Deploy to production in LlamaCloud with a single command by pointing to your git repository ๐Ÿ“Š Build agents that extract structured data, classify documents, and include human-in-the-loop review steps Perfect for automating complex document workflows where you need both powerful parsing and precise control over the agent's decision-making process. Read the full announcement and see the SEC Insights agent demo: https://t.co/6ELWb5JMkl

Media 2
๐Ÿ–ผ๏ธ Media
L
llama_index
@llama_index
๐Ÿ“…
Nov 18, 2025
157d ago
๐Ÿ†”88855185

The @GoogleDeepMind team just dropped Gemini 3, and we at LlamaIndex have day-zero support! We also made a little demo to show how you can leverage the advanced agentic capabilities and structured output accuracy of Gemini 3 to automate your GitHub workflow around PRs, you just need to run: ๐˜ฑ๐˜ช๐˜ฑ ๐˜ช๐˜ฏ๐˜ด๐˜ต๐˜ข๐˜ญ๐˜ญ ๐˜ฑ๐˜ณ-๐˜ฎ๐˜ข๐˜ฏ๐˜ข๐˜จ๐˜ฆ๐˜ณ ๐Ÿ‘ฉโ€๐Ÿ’ป Check out the GitHub repo: https://t.co/Txtdi2vmvx ๐ŸŽฅ Or take a look at the demo below ๐Ÿ‘‡

Media 2
๐Ÿ–ผ๏ธ Media
J
jerryjliu0
@jerryjliu0
๐Ÿ“…
Nov 18, 2025
157d ago
๐Ÿ†”40873810

Weโ€™ve built one of the most advanced ways to help you automate knowledge work over your documents A lot of document work depends on encoding custom processes. For instance, enforcing custom validation checks, doing web search, integrating with external systems. LlamaAgents is a full product suite that lets you build and deploy an agentic document extraction workflow, orchestrated purely through code. ๐Ÿšซ It is not a drag-and-drop builder โœ… It directly integrates with the LlamaCloud suite: document parsing, extraction, classification, indexing. โœ… It lets you orchestrate workflows through code, meaning itโ€™s infinitely customizable โœ… It gives you the app deployment layer out of the box - and you can even customize the app layer! Come check it out: https://t.co/miJCJPj1BA Docs: https://t.co/nLzTT9hoc4

@llama_index โ€ข Tue Nov 18 17:03

LlamaAgents is now in open preview - the fastest way to build, serve, and deploy multi-step document agents that combine LlamaCloud's document extraction and parsing power with Agent Workflows orchestration. ๐Ÿš€ Get started instantly with pre-built templates for SEC filings, invoi

Media 2
๐Ÿ–ผ๏ธ Media
L
llama_index
@llama_index
๐Ÿ“…
Nov 19, 2025
156d ago
๐Ÿ†”64553959

Agentic Document Workflows are crucial for AI-driven knowledge work and automation, but they are often treated as black boxes, which leads to silent failures and unexpected behaviors. With our Agent Workflows you don't have to worry about not knowing what is happening behind the scenes of your application, thanks to our built-in observability features that you can easily integrate with tracing pipelines. In our latest blog, @itsclelia shows how you can instrument your workflows to gain reliable insights over how your unstructured documents get turned into structured data, using @opentelemetry and @JaegerTracing. ๐Ÿ“š Read the article: https://t.co/FMPHBClMJo ๐Ÿ‘ฉโ€๐Ÿ’ป Check out the code: https://t.co/8cCP8WwCp0

Media 1Media 2
+2 more
๐Ÿ–ผ๏ธ Media
L
llama_index
@llama_index
๐Ÿ“…
Nov 20, 2025
155d ago
๐Ÿ†”23840862

Not another PDF parser ๐Ÿ“„ ๐Ÿคฏ? Here's why AI-powered document parsing is all the rave. AI document parsing has evolved beyond OCR to systems that truly understand documents like humans do ๐Ÿง  In our latest blog post, we explore what's changing the game: ๐Ÿ“Š Zero-shot semantic layout reconstruction - LLMs can now understand document structure without templates or training data ๐Ÿ” Deep multimodal understanding - Processing text, tables, charts, and images together out of the box ๐Ÿค– Agent engineering approach - Parsers that plan, reflect, and self-correct through reasoning workflows โš™๏ธ Enterprise-ready solutions - Moving beyond basic LLM APIs to production systems with metadata, provenance, and reliability Read the full article: https://t.co/YuI1oXd3Uk

Media 1Media 2
๐Ÿ–ผ๏ธ Media