Your curated collection of saved posts and media

Recent Top

Showing 32 posts · last 14 days · by score

🖼️ Media

E

Ethan Mollick

@emollick

📅

Tue Apr 01

🆔42054607

⭐1.00

Deleted this post because Kevin is right - it isn’t clear that a better prompted LLM, or a consensus or pass experiment with multiple attempts, would not be able to solve more of these problems. Testing LLMs is challenging for these reasons. https://t.co/nM6wc4Rg7b

❤️220

likes

🔁10

retweets

🖼️ Media

View Details View on X ↗

L

Lior⚡

@LiorOnAI

📅

Thu Apr 03

🆔16856550

⭐0.83

Game changer for scraping. This repo lets you easily scrape web pages and have the output in LLM-friendly formats (JSON, cleaned HTML, markdown). • Supports crawling multiple URLs simultaneously • Extracts and returns all media tags (Images, Audio, and Video) • Extracts all… https://t.co/bVx1OUeEIB

❤️321

likes

🔁49

retweets

🖼️ Media

View Details View on X ↗

W

Wing Lian (caseus)

@winglian

📅

Wed

🆔08959943

⭐0.73

Axolotl is out v0.8.0 today! Major features include support for Sequence Parallelism, Gemma3, Multimodal (beta), Muon optimizer, and a major expansion to our docs! We've worked to make sure that our features are composable leading to 3.6x speedups over vanilla HF+FA2 with >50%… https://t.co/WCZ0IJaqcz

❤️211

likes

🔁40

retweets

🖼️ Media

View Details View on X ↗

E

Ethan Mollick

@emollick

📅

Wed

🆔64052249

⭐1.00

Deleted this. The key point is true (all the major labs offer secure versions that will not train on your data governed by the same rules as other cloud services) and also it is true Claude will not train on your data. However, the policies for free Gemini are not that clear. https://t.co/yfhOY8dhZz

❤️178

likes

🔁6

retweets

🖼️ Media

View Details View on X ↗

R

Sebastian Raschka

@rasbt

📅

Wed

🆔93490831

⭐0.73

Pretty cool "Multi-Head Attention Shape Transformations (Cheat Sheet)" shared by a reader: https://t.co/9Nprk4XHgJ https://t.co/VBft7wtuR7

❤️622

likes

🔁90

retweets

🖼️ Media

View Details View on X ↗

W

Wing Lian (caseus)

@winglian

📅

Thu Apr 03

🆔13366927

⭐0.83

We've implemented a simple toolkit for fine-tuning powerful coding models using only RL with an entirely local, zero-setup sandboxed code interpreter. We found very promising results using a tiny fraction of data & training time vs SFT. Check out our blogpost for more details! 👇… https://t.co/IMiRO3LS3C

❤️184

likes

🔁28

retweets

🖼️ Media

View Details View on X ↗

O

elvis

@omarsar0

📅

Fri

🆔67150340

⭐0.91

AI 2027? 🤔 AI predictions are wild... what do you think? https://t.co/HbEz3UmmT6

❤️150

likes

🔁18

retweets

🖼️ Media

View Details View on X ↗

E

Ethan Mollick

@emollick

📅

Mon

🆔80073466

⭐0.82

This was pretty impressive for a one-shot from Gemini 2.5 with the only prompt being: "the poem lepanto, but about the war of 1812" https://t.co/OxF5Kz8rFP

❤️121

likes

🔁7

retweets

🖼️ Media

View Details View on X ↗

E

Ethan Mollick

@emollick

📅

Tue Apr 08

🆔40063708

⭐1.00

This is very cool, and a really impressive step forward. I do think the classic cartoon format makes some of the less coherent storytelling and prompt-following seem less relevant (which the authors acknowledge), this is still the weakness of AI video. For example, this is the… https://t.co/d3c5j5XGJL

❤️249

likes

🔁27

retweets

🖼️ Media

View Details View on X ↗

I

Ivan Leo (SF 1 - 19 June )

@ivanleomk

📅

Sun

🆔05672047

⭐0.58

Working with multimodal content has never been easier with our new Image, Audio and PDF classes. Added 200,000 new downloads with the new launch! Check it out at https://t.co/ch3nc9HEXd https://t.co/ARwf3aZhDW

❤️3

likes

🖼️ Media

View Details View on X ↗

H

Hamel Husain

@HamelHusain

📅

Mon

🆔03404569

⭐0.58

Going with a crazy marketing copy for Parlance Labs. Is it too crazy? https://t.co/c5kFtRPYkI

❤️55

likes

🖼️ Media

View Details View on X ↗

L

Lior⚡

@LiorOnAI

📅

Wed

🆔32281610

⭐0.83

Google might've created the successor of the Transformer architecture. It's a new architecture that pairs attention with a learnable long-term memory module. Attention handles short-term context with accurate dependency modeling. The memory module stores and retrieves… https://t.co/HO1cS3RIG1

❤️514

likes

🔁77

retweets

🖼️ Media

View Details View on X ↗

L

LlamaIndex 🦙

@llama_index

📅

Mon

🆔34506590

⭐1.00

We’re excited to introduce a brand-new layout agent within LlamaParse that gives you the best-in-class document parsing and extraction with precise visual citations. It uses SOTA VLM models to 1) detect all the blocks on a page (tables/charts/paragraphs), and 2) dynamically… https://t.co/2WRRXxIRa1

❤️205

likes

🔁31

retweets

🖼️ Media

View Details View on X ↗

J

Jerry Liu

@jerryjliu0

📅

Tue Apr 08

🆔86684428

⭐0.78

Our brand-new layout agent uses state-of-the-art LLMs ranging from faster/cheaper to larger/better (Flash 2.0 to Sonnet-3.7) to dynamically parse a page in a layout aware way. The layout agent first parses the overall layout of the document and breaks it into chunks. It then… https://t.co/mh8lXJGUYq

❤️46

likes

🔁8

retweets

🖼️ Media

View Details View on X ↗

O

elvis

@omarsar0

📅

Thu Apr 10

🆔41935995

⭐0.91

I still receive several consulting requests for optimizing prompts for RAG and agentic systems. It's usually the same techniques that work really well. So I've packaged prompting best practices in this 4hr course (all code). Learn it once and you're good going forward. https://t.co/xcMvdNoSVX

❤️155

likes

🔁23

retweets

🖼️ Media

View Details View on X ↗

O

elvis

@omarsar0

📅

Thu Apr 10

🆔03262316

⭐1.00

// Tracing LLM Outputs Back to Trillions of Training Tokens // Presents OLMOTRACE, the first system that can trace LLM outputs verbatim back to their entire multi-trillion-token training sets in real time! https://t.co/Xs4R7vJcx7

❤️281

likes

🔁54

retweets

🖼️ Media

View Details View on X ↗

A

Aran Komatsuzaki

@arankomatsuzaki

📅

Fri

🆔45069211

⭐0.97

Some interesting points: - They are now data-constrained, not compute-constrained. Future progress relies on algos w/ better sample-efficiecy. - Training GPT-4 now requires only 5-10 people. - Expect 10M+ GPU training runs, potentially “semi-synchronous” or decentralized. https://t.co/dV82TY0S7P

❤️363

likes

🔁46

retweets

🖼️ Media

View Details View on X ↗

O

elvis

@omarsar0

📅

Thu Apr 10

🆔43902198

⭐0.81

I can confirm that Gemini 2.5 Pro is really great at creative writing. Already using it for agentic systems that require editing, reviewing, writing, and refining outputs. https://t.co/pdXQI09UsD

❤️589

likes

🔁58

retweets

🖼️ Media

View Details View on X ↗

I

Tanishq Mathew Abraham, Ph.D.

@iScienceLuvr

📅

Sun

🆔34636382

⭐0.83

Multimodal foundation models will enable new workflows in medical imaging (and healthcare more broadly), especially compared to the conventional approach This is what we focus on at @SophontAI https://t.co/E3ujIBnqXw

❤️316

likes

🔁46

retweets

🖼️ Media

View Details View on X ↗

I

Ivan Leo (SF 1 - 19 June )

@ivanleomk

📅

Sun

🆔79303063

⭐0.68

I built a small voice agent that goes out and searches the web using GPT-4o's web browsing when you ask it for recommendations with @livekit Tool Calling a bit funky but otherwise quite happy with the progress so far. https://t.co/6gqZoDFuxn

❤️6

likes

🖼️ Media

View Details View on X ↗

R

Sebastian Raschka

@rasbt

📅

Sun

🆔01986135

⭐0.98

As we all know by now, reasoning models often generate longer responses, which raises compute costs. Now, this new paper (https://t.co/SwxBs8RsTq) shows that this behavior comes from the RL training process, not from an actual need for long answers for better accuracy. The RL… https://t.co/JnTmDNiVgg

❤️1,201

likes

🔁191

retweets

🖼️ Media

View Details View on X ↗

J

jason liu

@jxnlco

📅

Sat

🆔19880209

⭐0.94

people used to pay me like 20k just explaining and helping people set up generative benchmarking Come check out this lightning lesson where kelly explains why it's so important that evaluations are used as we move systems from prototype to production https://t.co/nN8Z3p3sNk https://t.co/YuJ0bkil2P

❤️76

likes

🖼️ Media

View Details View on X ↗

S

Susan Zhang

@suchenzang

📅

Sat

🆔28975457

⭐0.83

made the mistake of continuing down this rabbit 🕳️ apparently babistories itself is a synthetically generated dataset from the memory mosaics paper (https://t.co/lYKd4dPsBH), which is a synthetic copy of tinystories, which itself is also synthetically generated so the "strong… https://t.co/gN4bUzwJDT

❤️435

likes

🔁38

retweets

🖼️ Media

View Details View on X ↗

J

Jonathan Whitaker

@johnowhitaker

📅

Sat

🆔20912511

⭐0.78

New video + post + experiment. What if you could take a dumb model and a smart model and interpolate between them? Then extrapolate out to get an even better response? I talk about some related papers, then try out an idea, documenting the process and the (negative) result. https://t.co/ofOR118TwW

❤️34

likes

🔁3

retweets

🖼️ Media

View Details View on X ↗

F

François Chollet

@fchollet

📅

Sat

🆔62190554

⭐0.89

Google quietly released a powerful recommender systems library optimized for JAX and TPUs, based on Keras. It's called RecML. It has native support for SparseCore (latest hardware for handling large distributed embeddings) https://t.co/IBNlKXUwcz

❤️1,106

likes

🔁124

retweets

🖼️ Media

View Details View on X ↗

K

Dieter

@kagglingdieter

📅

Fri

🆔38886923

⭐0.88

🎉 Excited to share that we won the AI Math Olympiad competition on @kaggle with a mind-blowing score of solving 34/50 student-level math problems using a LLM. Summary of our solution below. https://t.co/kpNygidHAV

❤️1,303

likes

🔁88

retweets

🖼️ Media

View Details View on X ↗

O

OpenAI

@OpenAI

📅

Thu Apr 10

🆔72212636

Starting today, memory in ChatGPT can now reference all of your past chats to provide more personalized responses, drawing on your preferences and interests to make it even more helpful for writing, getting advice, learning, and beyond. https://t.co/s9BrWl94iY

❤️14,825

likes

🔁1,932

retweets

🖼️ Media

View Details View on X ↗

P

Henrik Karlsson

@phokarlsson

📅

Fri

🆔50279234

⭐0.68

from my notes on the childhoods of people who went on to do exceptional work https://t.co/cIxmlAh3et

❤️5,177

likes

🔁418

retweets

🖼️ Media

View Details View on X ↗

L

Lior⚡

@LiorOnAI

📅

Thu Apr 10

🆔25135342

⭐0.83

AI agents fact-checking each other reduce hallucinations by over 2,800%. This new research paper introduces a 4-agent NLP pipeline that flags, explains, and rewrites hallucinated content. Each agent runs a different LLM and focuses on a distinct task—generation, review,… https://t.co/h5KKXvrcib

❤️813

likes

🔁123

retweets

🖼️ Media

View Details View on X ↗

S

SkalskiP

@skalskip92

📅

Thu Apr 10

🆔70460714

⭐0.83

which VLM is the best? we are building @roboflow VLM playground to find out - test multiple VLMs in parallel and for free - open-source VLMs like PaliGemma and DeepSeek-VL coming soon - we had GPT-4o as well, but @OpenAI banned us for "distillation"; wtf? link:… https://t.co/G9NyC2J726

❤️129

likes

🔁21

retweets

🖼️ Media

View Details View on X ↗

R

rahul

@rahulgs

📅

Wed

🆔78345238

⭐0.63

added gemini 2.5 pro support to Claude Code feels faster and smarter than Sonnet 3.7 my go to local coding assistant now link below ⬇️ https://t.co/Q2nnIM2Dzo

❤️919

likes

🔁47

retweets

🖼️ Media

View Details View on X ↗

B

Ben Clavié

@bclavie

📅

Wed

🆔45835569

⭐0.83

You can now get `fastkmeans` at your nearest PyPi reseller. It serves just one (1) purpose: run GPU-accelerated k-means that can do 200k+ clusters without going OOM without any installation pain. (bonus: the API mimics both faiss & sklearn, so it slots in just about anywhere.) https://t.co/WlfzN4t89k

❤️163

likes

🔁26

retweets

🖼️ Media

View Details View on X ↗

← PreviousPage 608 of 656Next →