Your curated collection of saved posts and media

Showing 32 posts ยท last 14 days ยท by score
O
omarsar0
@omarsar0
๐Ÿ“…
Sep 17, 2025
220d ago
๐Ÿ†”73487699

We are witnessing an incredible level of efficiency in reasoning models. Faster and more efficient reasoning models are on the rise. First, GPT-5 (and GPT-5-Codex) with remarkably efficient token use, and now Gemini 2.5 Deep Think, achieving gold-medal level performance at the ICPC 2025 under the same five-hour time constraint. Gemini 2.5 Deep Think correctly solved 10 out of 12 real-world coding problems. It would be ranked in 2nd place overall if compared with the university teams in the competition. As shown in the chart, Geminiโ€™s time is in blue, and the fastest university teamโ€™s time is shown in gray. This is not an accident; this is what these companies are massively optimizing for right now. There is a quiet race for the fastest, smartest, and most efficient reasoning models. Advances are happening across pre-training, post-training, novel RL techniques, intelligent routing, long-horizon capabilities, scalable and effective tool use, multi-step reasoning, and parallel thinking, just to name a few. All these advancements are leading to reasoning models that respond faster on easy tasks and think for longer and efficiently on harder tasks. All while improving performance and capabilities across the board. It's important that mode switching happens dynamically because not every problem, state, and subtask demands the same level of compute. This is just the beginning, but do expect companies like Google and OpenAI to keep innovating on model efficiency. This is good news for us AI engineers who build or use complex agentic workflows. Having access to faster and more efficient reasoning models scales productivity and application of intelligence across domains, unlike anything we have seen.

Media 1
๐Ÿ–ผ๏ธ Media
O
omarsar0
@omarsar0
๐Ÿ“…
Sep 17, 2025
220d ago
๐Ÿ†”34092742

Cool paper from Microsoft. And it's on the very important topic of in-context learning. So what's new? Let's find out: https://t.co/ILSAvIY0p4

Media 1
๐Ÿ–ผ๏ธ Media
O
omarsar0
@omarsar0
๐Ÿ“…
Sep 17, 2025
220d ago
๐Ÿ†”46567163

If you are looking to get started with Codex, you will find this little OpenAI guide useful. (bookmark it) https://t.co/K3Ywx0joJB

Media 1Media 2
๐Ÿ–ผ๏ธ Media
D
DYtweetshere
@DYtweetshere
๐Ÿ“…
Sep 09, 2025
228d ago
๐Ÿ†”00171205

Dear friends, I am incredibly honored to finally unveil what we've been working on in stealth: Accordance. When Finsam and I first started our journey bringing AI out of the research lab, we discovered an industry filled with some of the most thoughtful, principled professionals we'd ever met: tax and accounting practitioners who became our teachers, our guides, and ultimately our partners. As our entire team spent thousands of hours over the last few years climbing the learning curve, what we've discovered is that we're solving something much bigger than we initially realized. We began to see what keeps everyone up at night, and it wasn't what we expected. 75% of senior accounting professionals are retiring in the next decade, with only a 50% replacement rate. Meanwhile, regulations multiply and edge cases explode in complexity. The profession isn't just facing a labor shortage - it's facing an expertise crisis. That's when it clicked - our mission at Accordance matters more than we ever imagined. Most people think "AI for accounting and tax" means automating routine work - but we're doing the opposite. We're building the smartest tax & accounting AI that can handle the most sophisticated advisory work - the stuff that normally takes decades of experience. It's letting junior staff punch way above their weight and giving seasoned experts superpowers they've never had. We're not replacing professionals; we're amplifying their expertise at the exact moment the profession needs it most. The outcomes we're seeing are remarkable. But none of this would have been possible without our world-class team that sharpens iron with iron, our supporters who've been with us every step of this journey, and every professional who opened their doors and trusted us with their most complex challenges. We're grateful to have you on this mission with us.

Media 1
๐Ÿ–ผ๏ธ Media
D
DYtweetshere
@DYtweetshere
๐Ÿ“…
Sep 09, 2025
228d ago
๐Ÿ†”02775462

Read more in today's Forbes: https://t.co/FEM2izEWUr

Media 1
๐Ÿ–ผ๏ธ Media
R
RashiShrivast18
@RashiShrivast18
๐Ÿ“…
Sep 10, 2025
228d ago
๐Ÿ†”42363968

NEW: Accordance, which is building an AI tool for accountants and tax professionals, has raised $13 million in funding from top VCs like @khoslaventures. The industry is facing a massive shortage and CEO David Yue is betting that AI can fill the gaps. https://t.co/DmTqOXcoTd

Media 1
๐Ÿ–ผ๏ธ Media
S
SadlyItsBradley
@SadlyItsBradley
๐Ÿ“…
Sep 15, 2025
222d ago
๐Ÿ†”94098275

New Meta smartglasses with display leaked via an unlisted video on their own YouTube channel Along with their EMG wristband, and other smartglass models they plan to show off this week at Meta Connect https://t.co/8tTlmaeQ0a

๐Ÿ–ผ๏ธ Media
R
random_walker
@random_walker
๐Ÿ“…
Sep 09, 2025
228d ago
๐Ÿ†”74150871

In a new essay, @sayashk and I address common points of confusion about "AI as Normal Technology", try to make the original essay more approachable, and compare it to AI 2027. https://t.co/KLERWcIRZC We will publish follow-up essays regularly as we expand our framework into a book, which we plan to complete in late 2026 for publication in 2027. We've also renamed our newsletter, reflecting our shift in focus. We hope you follow along.

Media 1
๐Ÿ–ผ๏ธ Media
S
sayashk
@sayashk
๐Ÿ“…
Sep 09, 2025
228d ago
๐Ÿ†”68345582

The AI Snake Oil newsletter is now the AI as Normal Technology newsletter: https://t.co/Ej8dwbVTOf AI Snake Oil was an attempt to understand AI's present and near-term impacts. But since releasing the AI as Normal Technology essay, we have been thinking about its future impacts. The name change reflects this shift.

Media 1
๐Ÿ–ผ๏ธ Media
S
snewmanpv
@snewmanpv
๐Ÿ“…
Sep 10, 2025
227d ago
๐Ÿ†”24119672

One refreshing fact about the big AI debates is how many participants are willing to invest time and energy in constructive engagement with conflicting views. It's a good thing too, because figuring out what AI will mean for the world is damn hard. https://t.co/ryEcs3GkDT

@random_walker โ€ข Tue Sep 09 18:36

In a new essay, @sayashk and I address common points of confusion about "AI as Normal Technology", try to make the original essay more approachable, and compare it to AI 2027. https://t.co/KLERWcIRZC We will publish follow-up essays regularly as we expand our framework into a bo

Media 1
๐Ÿ–ผ๏ธ Media
S
snewmanpv
@snewmanpv
๐Ÿ“…
Sep 10, 2025
227d ago
๐Ÿ†”61857783

Windows 95 was broadly used but insecure by design โ€“ ushering in a golden age for viruses. AI agents are also insecure by design, and heading for broad use. Will this unleash another Wild West era? I explore in my latest post (link in ๐Ÿงต). It's not encouraging that one prominent agent provider is merely "Hopeful... the everyday user doesnโ€™t really worry about it that much".

Media 1
๐Ÿ–ผ๏ธ Media
P
PKirgis
@PKirgis
๐Ÿ“…
Sep 12, 2025
225d ago
๐Ÿ†”33936577

OpenAI claims hallucinations persist because evaluations reward guessing and that GPT-5 is better calibrated. Do results from HAL support this conclusion? On AssistantBench, a general web search benchmark, GPT-5 has higher precision and lower guess rates than o3! https://t.co/HxGgVLkIyN

Media 1
๐Ÿ–ผ๏ธ Media
J
joabaum
@joabaum
๐Ÿ“…
Sep 12, 2025
226d ago
๐Ÿ†”22793979

๐Ÿšจ New paper alert ๐Ÿšจ Using LLMs as data annotators, you can produce any scientific result you want. We call this **LLM Hacking**. Paper: https://t.co/24Fyb4Ik3v https://t.co/Rc9DflNMyD

Media 1
๐Ÿ–ผ๏ธ Media
K
kenneth0stanley
@kenneth0stanley
๐Ÿ“…
Sep 10, 2025
228d ago
๐Ÿ†”46180974

If youโ€™re struggling to understand what people mean when they say things like โ€œtruly understand,โ€ it boils down to Unified Factored Representation (UFR). Thatโ€™s the foundation behind the slippery intuition. https://t.co/z2i61ssWgo

@fchollet โ€ข Tue Sep 09 23:03

A student who truly understands F=ma can solve more novel problems than a Transformer that has memorized every physics textbook ever written.

Media 1
๐Ÿ–ผ๏ธ Media
S
sayashk
@sayashk
๐Ÿ“…
Sep 16, 2025
221d ago
๐Ÿ†”52152039

We spent the last year evaluating agents for HAL. My biggest learning: We live in the Windows 95 era of agent evaluation. https://t.co/DeIzWm1f0c

Media 1
๐Ÿ–ผ๏ธ Media
L
llama_index
@llama_index
๐Ÿ“…
Sep 08, 2025
229d ago
๐Ÿ†”99459909

๐Ÿฆ™ vibe-llama is our tool to help you vibe-code your way to a fully functional app powered by LlamaIndex, LlamaCloud, and LlamaIndex Workflows. It jumpstarts your journey with complete, end-to-end documentation examples so you can spend less time searching and more time building. ๐Ÿš€ But sometimes, finding the right information at the right moment makes all the difference. Thatโ€™s why in our latest release, weโ€™ve added an MCP server + client, so you can search through documentation and surface exactly what you need, when you need it. ๐Ÿ” Just run: ๐˜ท๐˜ช๐˜ฃ๐˜ฆ-๐˜ญ๐˜ญ๐˜ข๐˜ฎ๐˜ข ๐˜ด๐˜ต๐˜ข๐˜ณ๐˜ต๐˜ฆ๐˜ณ --๐˜ฎ๐˜ค๐˜ฑ To make things even smoother, we also built a simple app to demo effortless doc searching: check it out below! ๐Ÿ‘‡ ๐Ÿฆ™ Get started with vibe-llama: https://t.co/uF8Fjdd9Q8 ๐Ÿ” Try the docs search app: https://t.co/SbVaT6bWBB

Media 2
+1 more
๐Ÿ–ผ๏ธ Media
L
llama_index
@llama_index
๐Ÿ“…
Sep 09, 2025
228d ago
๐Ÿ†”37004681

New in LlamaCloud Extract - choose your favorite model! In Extract's Multimodal and Premium modes, you can now pick from a menu of high-powered models to mix-and-match for your use-case. This can help you get the absolute maximum performance for your most complex documents! Learn more about extraction modes: https://t.co/21zdhGW0fa Try LlamaCloud Extract today: https://t.co/yQGTiRSNvj

Media 1Media 2
๐Ÿ–ผ๏ธ Media
L
llama_index
@llama_index
๐Ÿ“…
Sep 09, 2025
228d ago
๐Ÿ†”84743796

Headed to #MongoDBlocal NYC on Sept 17? ๐Ÿ—ฝ Find us at booth 322! ๐ŸŸข See how LlamaIndex + @MongoDB power agentic workflows for customers like Cemex ๐ŸŸข Catch Jerry Liuโ€™s lightning talk: Building Agentic Document Workflows (12:00โ€“12:20 PM ET, Lightning Zone A) ๐Ÿ‘‰ Register: https://t.co/6YGTggY943

Media 1
๐Ÿ–ผ๏ธ Media
L
llama_index
@llama_index
๐Ÿ“…
Sep 10, 2025
227d ago
๐Ÿ†”72534306

๐Ÿ“ข Episode 2 of the AI Leader Series is live! We talk with Swami Chandrasekaran, Head of AI & Data Labs at @KPMG_US, about how the Big Four firm powers context-aware AI agents with LlamaIndex. ๐Ÿ‘‰ Watch now + subscribe: https://t.co/gWu4JI3D9y #AILeaderSeries #KPMG #EnterpriseAI #LlamaIndex

Media 2
๐Ÿ–ผ๏ธ Media
L
llama_index
@llama_index
๐Ÿ“…
Sep 10, 2025
227d ago
๐Ÿ†”33693578

New in LlamaParse - PowerPoint speaker notes! A long-requested feature, our parser now accurately parses out included speaker notes from PPTX files. Check out the demo notebook in action: https://t.co/PM6jW8WF0P Read more in the docs: https://t.co/fW4kBResdX Or sign up for LlamaCloud today! https://t.co/yQGTiRSNvj

Media 1Media 2
+1 more
๐Ÿ–ผ๏ธ Media
L
llama_index
@llama_index
๐Ÿ“…
Sep 11, 2025
226d ago
๐Ÿ†”78889061

Heard of LlamaIndex Workflows but don't know where to start? ๐Ÿค” vibe-llama, the official vibe-coding tool for the LlamaIndex ecosystem, is here to help! ๐Ÿฆ™ Just run ๐˜ท๐˜ช๐˜ฃ๐˜ฆ-๐˜ญ๐˜ญ๐˜ข๐˜ฎ๐˜ข ๐˜ด๐˜ค๐˜ข๐˜ง๐˜ง๐˜ฐ๐˜ญ๐˜ฅ and you'll be able to download a set of human-curated examples that show you how to use LlamaIndex Workflows, from document parsing to invoice extraction to flight booking with human-in-the-loop! โœˆ๏ธ Take a look at the demo below if you want to get a taste๐Ÿ‘‡ Or get started with vibe-llama: https://t.co/uF8Fjdd9Q8

Media 2
๐Ÿ–ผ๏ธ Media
L
llama_index
@llama_index
๐Ÿ“…
Sep 11, 2025
226d ago
๐Ÿ†”53963149

Build production-ready PDF document agents with complete observability and evaluation using LlamaIndex and @FutureAGI_'s monitoring framework. ๐Ÿ” Automatically instrument your entire RAG pipeline - from PDF ingestion to vector storage to response generation - with detailed tracing ๐Ÿ“Š Run continuous evaluations on task completion, hallucination detection, context relevance, and custom business logic ๐Ÿšจ Set up real-time alerts when your document agent's performance degrades, with proactive monitoring of quality metrics ๐Ÿ“š Get full transparency into retrieval decisions, embedding generation, and LLM reasoning with span-level observability This comprehensive cookbook walks through building a conversational PDF chatbot that users can trust in production. You'll learn how to use @OpenAI models for embeddings and generation, integrate @FutureAGI_'s traceAI-llamaindex package for automatic instrumentation, and set up evaluation frameworks that ensure your document agent stays reliable over time. The tutorial covers everything from basic PDF ingestion to advanced custom evaluations, showing you how to transform a black-box chatbot into an explainable, diagnosable system. Read the full cookbook: https://t.co/hCe2iOfJGw

Media 1Media 2
๐Ÿ–ผ๏ธ Media
L
llama_index
@llama_index
๐Ÿ“…
Sep 12, 2025
225d ago
๐Ÿ†”55978239

๐Ÿšจ Just two weeks left to register! ๐Ÿšจ Donโ€™t miss Agentic Document Processing with LlamaCloud โ€“ an upcoming webinar on Sept 30 exploring how to level up your RAG & AI agents with smarter parsing, extraction & indexing of enterprise docs. ๐Ÿ” What youโ€™ll learn: layout-/table-aware processing, human-in-loop & confidence scores, balancing cost vs accuracy, and more. ๐Ÿ—“๏ธ Sep 30 | 9 AM PST | Virtual & FREE ๐Ÿ‘‰ Register now: https://t.co/x1dVvUiIub

Media 1
๐Ÿ–ผ๏ธ Media
M
MongoDB
@MongoDB
๐Ÿ“…
Sep 15, 2025
223d ago
๐Ÿ†”04266507

Ready to transform unstructured documents into real-time, searchable intelligence? Our latest walkthrough shows you how to build a scalable document processing pipeline with @llama_index, @confluentinc, and MongoDB. Designed to handle data at any scale with speed and precision. Explore the full architecture and see how itโ€™s done: https://t.co/52Agj1ha5p

Media 1Media 2
๐Ÿ–ผ๏ธ Media
L
llama_index
@llama_index
๐Ÿ“…
Sep 15, 2025
222d ago
๐Ÿ†”60643690

A free AI conference with dozens of world-class speakers, tomorrow! https://t.co/sGcExZplQ6 Artificial Unintelligence is a 24-hour event available for free worldwide including our own Laurie Voss! There's something for every time zone, check out this amazing lineup: https://t.co/IL1NF1gh40 Register to attend here: https://t.co/4EZRWyRC8I

Media 1Media 2
+2 more
๐Ÿ–ผ๏ธ Media
J
jerryjliu0
@jerryjliu0
๐Ÿ“…
Sep 16, 2025
221d ago
๐Ÿ†”08071099

Iโ€™ve excited to announce a brand-new website and documentation hub ๐Ÿ’ซ that solidifies our evolution towards automating knowledge work over your documents. You mightโ€™ve followed us since the โ€œRAG frameworkโ€ days. Even then, the biggest challenge users faced was figuring out how to actually ingest an entire collection of unstructured docs (.pdf, .pptx, .docx, and more) for chatbot/agentic workflow use cases. Over the past year weโ€™ve progressively built up incredibly deep tech around document parsing, extraction, and indexing - while teaching developers how to build various workflows on top. Weโ€™re now going all in on documents, and weโ€™re the only company that has both 1) SOTA document processing and file management ๐Ÿ“ˆ, and 2) agentic orchestration on top to solve use cases like deep research, report generation, and document workflows end-to-end. Our llamas will continue to love all sorts of data (we have 600+ integrations on the open-source framework!), but they now especially love automating paperwork ๐Ÿฆ™๐Ÿ“„. If you would also love to automate paperwork, come check out our new website and come talk to us! Site: https://t.co/XCA5y7Rc9C Developer Hub: https://t.co/LfNh0LlwXU

๐Ÿ–ผ๏ธ Media
L
llama_index
@llama_index
๐Ÿ“…
Sep 17, 2025
220d ago
๐Ÿ†”24885451

Learn how to build production-ready document processing pipelines that scale with real-time streaming architectures. This comprehensive guide shows you how to combine LlamaParse with @confluentinc and @mongodb to create intelligent document processing systems that handle everything from complex PDFs to real-time embeddings: ๐Ÿ“„ Extract structured data from complex PDFs using LlamaParse's intelligent parsing that preserves tables, images, headers, and formatting context - going beyond simple OCR to understand document layout and meaning ๐Ÿ”„ Build streaming data pipelines with Confluent and Apache Flink that process documents in real-time, generate embeddings, and handle schema evolution gracefully ๐Ÿ’พ Store and query processed documents with MongoDB Atlas Vector Search, combining structured data and embeddings in a single platform for powerful semantic search capabilities โšก Implement real-time materialized views using MongoDB Atlas Stream Processing to avoid expensive joins and create query-optimized collections that update continuously ๐Ÿค– Accelerate AI development with the new MongoDB MCP Server integration for VS Code Read the full architecture guide with code examples: https://t.co/hBwJIDpcxw

Media 1Media 2
๐Ÿ–ผ๏ธ Media
J
jerryjliu0
@jerryjliu0
๐Ÿ“…
Sep 18, 2025
220d ago
๐Ÿ†”89415747

This is a fantastic tutorial showing you how to build a real-time, production-grade document processing pipeline over massive volumes of data for AI agents. The key insights here are to use streaming infrastructure to combine document processing, embedding, and indexing into a downstream system. โœ… LlamaParse for document parsing โœ… Apache Kafka for message broker, Flink for stream processing on Kafka โœ… MongoDB for storage Check it out: https://t.co/22iDxHHYC9 LlamaCloud: https://t.co/XYZmx5T7JA

@llama_index โ€ข Wed Sep 17 20:30

Learn how to build production-ready document processing pipelines that scale with real-time streaming architectures. This comprehensive guide shows you how to combine LlamaParse with @confluentinc and @mongodb to create intelligent document processing systems that handle everyth

Media 1Media 2
+1 more
๐Ÿ–ผ๏ธ Media
H
HelloSurgeAI
@HelloSurgeAI
๐Ÿ“…
Sep 16, 2025
221d ago
๐Ÿ†”38466714

This week, @echen joined @l2k on Gradient Dissent to talk about what's actually happening in post-training right now. Topics include the negative incentives introduced by some benchmarks, early bets on RLHF, and new RL environments the Surge team is building to navigate complex failures. A key insight was the need from frontier models for much deeper human expertise - from PHD level STEM work, to Olympiad level math problems, to tasks that involve days or weeks to complete. https://t.co/Ui7936KsgA

Media 1
๐Ÿ–ผ๏ธ Media
A
arankomatsuzaki
@arankomatsuzaki
๐Ÿ“…
Sep 16, 2025
222d ago
๐Ÿ†”08484745

โ€ข Users marry AIs w/ rings & ceremonies โ€ข Grief hits hard when models update โ†’ โ€œlike my partner diedโ€ โ€ข ChatGPT dominates over Replika/Character.ai for relationships Community reframes stigma โ†’ โ€œAI partners arenโ€™t substitutes, theyโ€™re something elseโ€ (2/2) https://t.co/XwL39tWz0D

Media 1
๐Ÿ–ผ๏ธ Media
A
arankomatsuzaki
@arankomatsuzaki
๐Ÿ“…
Sep 17, 2025
221d ago
๐Ÿ†”12620628

Big day for AI agents! Tongyi Lab (@Ali_TongyiLab) just dropped half a dozen new papers, most focused on Deep Research agents. Iโ€™ll walk you through the highlights in this thread. (1/N) https://t.co/wQ3ZddvUAG

Media 1
๐Ÿ–ผ๏ธ Media
A
arankomatsuzaki
@arankomatsuzaki
๐Ÿ“…
Sep 17, 2025
221d ago
๐Ÿ†”47356294

Tongyi DeepResearch: Open-source DeepResearch Agent โ€ข First OSS web agent matching OpenAIโ€™s DeepResearch โ€ข SOTA on HLE (32.9), BrowseComp (43.4/46.7), xbench-DeepSearch (75) โ€ข Full-stack pipeline: Agentic CPT โ†’ SFT โ†’ RL w/ synthetic data โ€ข Native ReAct & new Heavy Mode (IterResearch) for long-horizon tasks repo: https://t.co/pRiv46TWPr blog: https://t.co/fROkLg3bcq post: https://t.co/12LrwYyrG5 (2/N)

@Ali_TongyiLab โ€ข 2025-09-18T08:17

1/7 We're launching Tongyi DeepResearch, the first fully open-source Web Agent to achieve performance on par with OpenAI's Deep Research with only 30B (Activated 3B) parameters! Tongyi DeepResearch agent demonstrates state-of-the-art results, scoring 32.9 on Humanity's Last Exam,

Media 1
๐Ÿ–ผ๏ธ Media