Your curated collection of saved posts and media

Showing 32 posts Β· last 14 days Β· by score
I
Tanishq Mathew Abraham, Ph.D.
@iScienceLuvr
πŸ“…
Tue Apr 22
πŸ†”73913332

Think Deep, Think Fast: Investigating Efficiency of Verifier-free Inference-time-scaling Methods "This work conducts a comprehensive analysis of inference-time scaling methods for both reasoning and non-reasoning models on challenging reasoning tasks." "Non-reasoning models… https://t.co/g8tqp3whNQ

Media 1
❀️215
likes
πŸ”37
retweets
πŸ–ΌοΈ Media
W
wordgrammer
@wordgrammer
πŸ“…
Mon
πŸ†”11145627

o4-mini is the single greatest search engine that I have ever used. It can scan the entirety of PyTorch’s github to find the exact operation I am interested in https://t.co/tbC3jcKDCO

Media 1
❀️530
likes
πŸ”25
retweets
πŸ–ΌοΈ Media
O
Omar Sanseviero
@osanseviero
πŸ“…
Sun
πŸ†”73405738

We're building Gemma as a community and developer-centric project πŸ€—Please share your asks, feedback, and pain points We announced PaliGemma 2 Mix, Gemma 3, Gemma 3 QAT, TxGemma, and DolphinGemma. And there's much more to come! We're learning, hearing, and improving. Let's go! https://t.co/niICRSnNLC

Media 1
❀️284
likes
πŸ”27
retweets
πŸ–ΌοΈ Media
H
Hamel Husain
@HamelHusain
πŸ“…
Sun
πŸ†”31787957

Utterly confusing, funny and sad https://t.co/wOMoh4MYd5

Media 1
❀️2,732
likes
πŸ”81
retweets
πŸ–ΌοΈ Media
L
Lior⚑
@LiorOnAI
πŸ“…
Sun
πŸ†”44340421

One of the most impressive AI demo I've seen. This is the future of customer service. Agents that can understand text, speech, images and even live video. Soon to be all open-source. https://t.co/kKoNTgTJ1T

❀️3,071
likes
πŸ”377
retweets
πŸ–ΌοΈ Media
H
Harrison Chase
@hwchase17
πŸ“…
Sun
πŸ†”61506421

OpenAI recently released a guide on building agents which contains some misguided takes There's a lot of FUD, confusion, hype, and noise around agents I wrote a blog on how to think about agent frameworks. Includes: Background Info - What is an agent? - What is hard about… https://t.co/7VKe9VMBad

Media 1
❀️927
likes
πŸ”126
retweets
πŸ–ΌοΈ Media
E
Ethan Mollick
@emollick
πŸ“…
Sun
πŸ†”09812801

Interesting how much specifically instructions are given for making games. AI labs are optimizing for viral use cases. https://t.co/PWQdkuz0zw

Media 1
❀️437
likes
πŸ”27
retweets
πŸ–ΌοΈ Media
M
matt palmer
@mattppal
πŸ“…
Sat
πŸ†”72889504

Pydantic quietly dropped the most straightforward framework for building AI Agents. This ~28 liner builds an agent that can fetch URLs with a "fetch" MCP server. https://t.co/enGmnuOZqE

Media 1
❀️1,542
likes
πŸ”135
retweets
πŸ–ΌοΈ Media
I
Ivan Leo
@ivanleomk
πŸ“…
Sun
πŸ†”77897966

haha signed up for @elevenlabsio just to play with their conversational AI agent https://t.co/U1iET1d0mu

Media 1
❀️2
likes
πŸ–ΌοΈ Media
I
Ivan Leo
@ivanleomk
πŸ“…
Sun
πŸ†”35961411

Voice agent simulations let employees perfect interactions hundreds of times with personalised feedback before meeting real customers. Using @elevenlabsio's new conversational AI, watch as I try my best to explain the in-flight menu to a fictional customer :) https://t.co/1fjIGUwrBH

❀️8
likes
πŸ–ΌοΈ Media
L
Lior⚑
@LiorOnAI
πŸ“…
Sun
πŸ†”40463034

Ilya Sutskever (OpenAI cofounder) top 30 must-read research papers. "If you really learn all of these, you’ll know 90% of what matters today" https://t.co/VprtIapiFF

Media 1
❀️4,114
likes
πŸ”420
retweets
πŸ–ΌοΈ Media
J
Jerry Liu
@jerryjliu0
πŸ“…
Sun
πŸ†”54551851

I’m releasing a set of slides I’ve used for various talks which lays out the architecture for agentic document workflows - how LLMs can parse, reason over, and act on PDFs, Excel etc. In general we’re really excited about using AI agents to automate knowledge work over… https://t.co/yxWRY6z08t

Media 1
❀️413
likes
πŸ”48
retweets
πŸ–ΌοΈ Media
I
Tanishq Mathew Abraham, Ph.D.
@iScienceLuvr
πŸ“…
Sat
πŸ†”55397632

I hear a lot of talk about zero-knowledge proofs from crypto folks and I had no idea what it was until I watched this very intuitive video, it's actually quite interesting! https://t.co/ale4fQUhNv

Media 1
❀️116
likes
πŸ”7
retweets
πŸ–ΌοΈ Media
H
Hamel Husain
@HamelHusain
πŸ“…
Sat
πŸ†”08592719

It’s definitely worth reading this post for anyone using Claude code. TIL that the word β€œultrathink” will result in maximum thinking Lots of other great tips in here https://t.co/ETS1eaqh5V https://t.co/yxUhR1tkXL

Media 1
❀️481
likes
πŸ”38
retweets
πŸ–ΌοΈ Media
T
Tanishq Kumar
@tanishqkumar07
πŸ“…
Wed
πŸ†”87278163

trained a nanoGPT? feeling behind before o4-mini? 🚨🚨i'm open-sourcing beyond-nanoGPT, an internal codebase to help people go from LLM basics to research-level understanding. 🚨🚨 it contains thousands of lines of from-scratch, annotated pytorch implementing advanced… https://t.co/51165pg73q

Media 1
❀️318
likes
πŸ”48
retweets
πŸ–ΌοΈ Media
L
Lior⚑
@LiorOnAI
πŸ“…
Sat
πŸ†”05874030

You can now run 100B parameter models on your local CPU without GPUs. Microsoft finally open-sourced their 1-bit LLM inference framework called bitnet.cpp: > 6.17x faster inference > 82.2% less energy on CPUs > Supports Llama3, Falcon3, and BitNet models https://t.co/AGPOsUjlyB

❀️6,166
likes
πŸ”794
retweets
πŸ–ΌοΈ Media
L
LlamaIndex πŸ¦™
@llama_index
πŸ“…
Sat
πŸ†”60060975

We’re excited to feature ZapGit πŸ’« - an all-in-one place for you to manage @github issues and PRs through a natural language client πŸ§‘β€πŸ’» Made possible by MCP (@zapier servers) and plugged into an agent workflow (@llama_index) 1. Choose the @github action and the repo 2. Agent… https://t.co/qkp50i2SOc

❀️23
likes
πŸ”3
retweets
πŸ–ΌοΈ Media
M
Maxim Lott
@maximlott
πŸ“…
Sat
πŸ†”01439290

ChatGPT's new o3 model set a new IQ record, based on my site https://t.co/MKlEC93EK4 It got an IQ of 136! That's top 1% for humans. Thread: https://t.co/iHGpBlpRuD

Media 1
❀️95
likes
πŸ”20
retweets
πŸ–ΌοΈ Media
E
eric zakariasson
@ericzakariasson
πŸ“…
Fri
πŸ†”67417085

rolling out @cursor_ai 0.49 here’s what’s new ↓ https://t.co/5WWwdOPL4f

Media 1
❀️3,020
likes
πŸ”162
retweets
πŸ–ΌοΈ Media
O
elvis
@omarsar0
πŸ“…
Fri
πŸ†”31066719

Gemini 2.5 Flash is here! It's Google's first hybrid model, which allows you to turn thinking on or off. It has a new parameter, thinking_budget (i.e., max # of thinking tokens), to control quality, cost, and latency. Flash also leads in the price-to-performance ratio. https://t.co/QHqy1h2BOG

Media 1
❀️106
likes
πŸ”12
retweets
πŸ–ΌοΈ Media
L
LlamaIndex πŸ¦™
@llama_index
πŸ“…
Fri
πŸ†”63432348

A full-stack JavaScript web app using LlamaExtract to perform financial analysis! LlamaExtract, part of LlamaCloud, allows you to create agents by defining reusable schemas that precisely define what structured data you want extracted from complex documents. In this example,… https://t.co/AgvOLKk4Pd

❀️43
likes
πŸ”7
retweets
πŸ–ΌοΈ Media
A
Aravind Srinivas
@AravSrinivas
πŸ“…
Fri
πŸ†”97908399

Perplexity serves MoEs like post-trained versions of DeepSeek-v3. These models can be made to utilize GPUs efficiently in multi-node settings, achieving high throughput and low latency simultaneously, compared to single-node deployments. https://t.co/pZwOaRb0oZ

Media 1
❀️385
likes
πŸ”17
retweets
πŸ–ΌοΈ Media
K
kwindla
@kwindla
πŸ“…
Fri
πŸ†”46224780

Announcing: Voice AI course and online community ... @swyx and I are hosting a month-long technical deep dive into Voice AI and Voice Agents. Our goals are to: ➑️ cover all the lessons we've learned over the last two years building realtime, conversational AI, ➑️host fun… https://t.co/E68FivL4y0

Media 1
❀️239
likes
πŸ”31
retweets
πŸ–ΌοΈ Media
E
Ethan Mollick
@emollick
πŸ“…
Fri
πŸ†”47265986

I am frequently asked: "when will AI work with well spreadsheets?" Quietly, Google has come some way in realizing this vision, using AI at both the cell & sheet level (and running code). Here are examples of me taking fake startup financials I use in class and it spotting issues https://t.co/MkFqKnmMQK

Media 1Media 2
❀️717
likes
πŸ”51
retweets
πŸ–ΌοΈ Media
S
swyx
@swyx
πŸ“…
Fri
πŸ†”55264328

sooo @aiDotEngineer now has an official MCP server :) (and a @jeremyphoward llms.txt) try adding this to your friendly neighborhood VS Code Fork: and then convert your codebase into a talk with natural language inside your IDE: happy to share that @threepointone and i vibe… https://t.co/HGufVdeRju

Media 1Media 2
❀️89
likes
πŸ”22
retweets
πŸ–ΌοΈ Media
J
Jonathan Whitaker
@johnowhitaker
πŸ“…
Fri
πŸ†”26963275

Dang it, I made an eval I thought I'd trounce LLMs at: identifying species in photos I've taken over the years, given ~5 plausible options. TIL 1) I don't know my latin names as well as I thought, and 2) 4o apparently does πŸ˜‚ Writeup once I do the human baseline score + polish https://t.co/iEVirKt3Vz

Media 1Media 2
❀️9
likes
πŸ”1
retweets
πŸ–ΌοΈ Media
I
Isaac Flath
@isaac_flath
πŸ“…
Fri
πŸ†”81509063

πŸŽ‰ Finally Cursor has notebook support! https://t.co/pNLRU8ZU7d

❀️28
likes
πŸ”7
retweets
πŸ–ΌοΈ Media
B
Ben ClaviΓ©
@bclavie
πŸ“…
Mon
πŸ†”01323950

Multimodal RAG: Just use ColPali/DSE then pass your screenshots to the LLM This is the dream, but how well do LLMs read text contained in images? We wanted to know, so we tried a simple thing: do results change on evals when using screenshots rather than text as input? Yes. https://t.co/j23rObYcG0

Media 1
❀️439
likes
πŸ”82
retweets
πŸ–ΌοΈ Media
A
ARC Prize
@arcprize
πŸ“…
Tue Jun 10
πŸ†”80395332

o3 Pro on ARC-AGI Semi Private Eval Results ARC-AGI-1: * Low: 44%, $1.64/task * Medium: 57%, $3.18/task * High: 59%, $4.16/task ARC-AGI-2: * All reasoning efforts: <5%, $4-7/task Takeaways: * o3-pro in line with o3 performance * o3's new price sets the ARC-AGI-1 Frontier https://t.co/ihTP82ue4D

Media 1
❀️740
likes
πŸ”96
retweets
πŸ–ΌοΈ Media
E
Ethan Mollick
@emollick
πŸ“…
Tue Jun 10
πŸ†”77794146

That Altman essay… One thing you can definitely say about him and Dario is that they are making very bold, very testable predictions. We will know whether they are right or wrong in a remarkably short time https://t.co/4NQCIHrBSQ

Media 1
❀️1,079
likes
πŸ”99
retweets
πŸ–ΌοΈ Media
E
Ethan Mollick
@emollick
πŸ“…
Tue Jun 10
πŸ†”46196673

This was less than almost every estimate I have seem: according to the latest Sam Altman post, the average ChatGPT query uses about the same amount of power as the average Google search in 2009 (the last time they released a per-search number)… 0.0003 kWh https://t.co/AgVQB7zkOu

Media 1Media 2
❀️861
likes
πŸ”85
retweets
πŸ–ΌοΈ Media
C
Chris Albon
@chrisalbon
πŸ“…
Wed
πŸ†”72600877

As someone who was formally trained in applied statistics, this book legitimately changed my life. It's old now, but fundamentally it's the intellectual bridge between statistics and machine learning. And I crossed it. https://t.co/2YSfFbQ5gM

Media 1
❀️395
likes
πŸ”19
retweets
πŸ–ΌοΈ Media