Your curated collection of saved posts and media

Showing 32 posts Β· last 14 days Β· by score
E
Ethan Mollick
@emollick
πŸ“…
Tue Apr 01
πŸ†”42054607
⭐1.00

Deleted this post because Kevin is right - it isn’t clear that a better prompted LLM, or a consensus or pass experiment with multiple attempts, would not be able to solve more of these problems. Testing LLMs is challenging for these reasons. https://t.co/nM6wc4Rg7b

Media 1
❀️220
likes
πŸ”10
retweets
πŸ–ΌοΈ Media
L
Lior⚑
@LiorOnAI
πŸ“…
Thu Apr 03
πŸ†”16856550
⭐0.83

Game changer for scraping. This repo lets you easily scrape web pages and have the output in LLM-friendly formats (JSON, cleaned HTML, markdown). β€’ Supports crawling multiple URLs simultaneously β€’ Extracts and returns all media tags (Images, Audio, and Video) β€’ Extracts all… https://t.co/bVx1OUeEIB

❀️321
likes
πŸ”49
retweets
πŸ–ΌοΈ Media
W
Wing Lian (caseus)
@winglian
πŸ“…
Wed
πŸ†”08959943
⭐0.73

Axolotl is out v0.8.0 today! Major features include support for Sequence Parallelism, Gemma3, Multimodal (beta), Muon optimizer, and a major expansion to our docs! We've worked to make sure that our features are composable leading to 3.6x speedups over vanilla HF+FA2 with >50%… https://t.co/WCZ0IJaqcz

Media 1
❀️211
likes
πŸ”40
retweets
πŸ–ΌοΈ Media
E
Ethan Mollick
@emollick
πŸ“…
Wed
πŸ†”64052249
⭐1.00

Deleted this. The key point is true (all the major labs offer secure versions that will not train on your data governed by the same rules as other cloud services) and also it is true Claude will not train on your data. However, the policies for free Gemini are not that clear. https://t.co/yfhOY8dhZz

Media 1
❀️178
likes
πŸ”6
retweets
πŸ–ΌοΈ Media
R
Sebastian Raschka
@rasbt
πŸ“…
Wed
πŸ†”93490831
⭐0.73

Pretty cool "Multi-Head Attention Shape Transformations (Cheat Sheet)" shared by a reader: https://t.co/9Nprk4XHgJ https://t.co/VBft7wtuR7

Media 1
❀️622
likes
πŸ”90
retweets
πŸ–ΌοΈ Media
W
Wing Lian (caseus)
@winglian
πŸ“…
Thu Apr 03
πŸ†”13366927
⭐0.83

We've implemented a simple toolkit for fine-tuning powerful coding models using only RL with an entirely local, zero-setup sandboxed code interpreter. We found very promising results using a tiny fraction of data & training time vs SFT. Check out our blogpost for more details! πŸ‘‡β€¦ https://t.co/IMiRO3LS3C

❀️184
likes
πŸ”28
retweets
πŸ–ΌοΈ Media
O
elvis
@omarsar0
πŸ“…
Fri
πŸ†”67150340
⭐0.91

AI 2027? πŸ€” AI predictions are wild... what do you think? https://t.co/HbEz3UmmT6

Media 1
❀️150
likes
πŸ”18
retweets
πŸ–ΌοΈ Media
E
Ethan Mollick
@emollick
πŸ“…
Mon
πŸ†”80073466
⭐0.82

This was pretty impressive for a one-shot from Gemini 2.5 with the only prompt being: "the poem lepanto, but about the war of 1812" https://t.co/OxF5Kz8rFP

Media 1Media 2
❀️121
likes
πŸ”7
retweets
πŸ–ΌοΈ Media
E
Ethan Mollick
@emollick
πŸ“…
Tue Apr 08
πŸ†”40063708
⭐1.00

This is very cool, and a really impressive step forward. I do think the classic cartoon format makes some of the less coherent storytelling and prompt-following seem less relevant (which the authors acknowledge), this is still the weakness of AI video. For example, this is the… https://t.co/d3c5j5XGJL

❀️249
likes
πŸ”27
retweets
πŸ–ΌοΈ Media
I
Ivan Leo (SF 1 - 19 June )
@ivanleomk
πŸ“…
Sun
πŸ†”05672047
⭐0.58

Working with multimodal content has never been easier with our new Image, Audio and PDF classes. Added 200,000 new downloads with the new launch! Check it out at https://t.co/ch3nc9HEXd https://t.co/ARwf3aZhDW

Media 1
❀️3
likes
πŸ–ΌοΈ Media
H
Hamel Husain
@HamelHusain
πŸ“…
Mon
πŸ†”03404569
⭐0.58

Going with a crazy marketing copy for Parlance Labs. Is it too crazy? https://t.co/c5kFtRPYkI

Media 1
❀️55
likes
πŸ–ΌοΈ Media
L
Lior⚑
@LiorOnAI
πŸ“…
Wed
πŸ†”32281610
⭐0.83

Google might've created the successor of the Transformer architecture. It's a new architecture that pairs attention with a learnable long-term memory module. Attention handles short-term context with accurate dependency modeling. The memory module stores and retrieves… https://t.co/HO1cS3RIG1

Media 1
❀️514
likes
πŸ”77
retweets
πŸ–ΌοΈ Media
L
LlamaIndex πŸ¦™
@llama_index
πŸ“…
Mon
πŸ†”34506590
⭐1.00

We’re excited to introduce a brand-new layout agent within LlamaParse that gives you the best-in-class document parsing and extraction with precise visual citations. It uses SOTA VLM models to 1) detect all the blocks on a page (tables/charts/paragraphs), and 2) dynamically… https://t.co/2WRRXxIRa1

❀️205
likes
πŸ”31
retweets
πŸ–ΌοΈ Media
J
Jerry Liu
@jerryjliu0
πŸ“…
Tue Apr 08
πŸ†”86684428
⭐0.78

Our brand-new layout agent uses state-of-the-art LLMs ranging from faster/cheaper to larger/better (Flash 2.0 to Sonnet-3.7) to dynamically parse a page in a layout aware way. The layout agent first parses the overall layout of the document and breaks it into chunks. It then… https://t.co/mh8lXJGUYq

Media 1
❀️46
likes
πŸ”8
retweets
πŸ–ΌοΈ Media
O
elvis
@omarsar0
πŸ“…
Thu Apr 10
πŸ†”41935995
⭐0.91

I still receive several consulting requests for optimizing prompts for RAG and agentic systems. It's usually the same techniques that work really well. So I've packaged prompting best practices in this 4hr course (all code). Learn it once and you're good going forward. https://t.co/xcMvdNoSVX

❀️155
likes
πŸ”23
retweets
πŸ–ΌοΈ Media
O
elvis
@omarsar0
πŸ“…
Thu Apr 10
πŸ†”03262316
⭐1.00

// Tracing LLM Outputs Back to Trillions of Training Tokens // Presents OLMOTRACE, the first system that can trace LLM outputs verbatim back to their entire multi-trillion-token training sets in real time! https://t.co/Xs4R7vJcx7

Media 1
❀️281
likes
πŸ”54
retweets
πŸ–ΌοΈ Media
A
Aran Komatsuzaki
@arankomatsuzaki
πŸ“…
Fri
πŸ†”45069211
⭐0.97

Some interesting points: - They are now data-constrained, not compute-constrained. Future progress relies on algos w/ better sample-efficiecy. - Training GPT-4 now requires only 5-10 people. - Expect 10M+ GPU training runs, potentially β€œsemi-synchronous” or decentralized. https://t.co/dV82TY0S7P

❀️363
likes
πŸ”46
retweets
πŸ–ΌοΈ Media
O
elvis
@omarsar0
πŸ“…
Thu Apr 10
πŸ†”43902198
⭐0.81

I can confirm that Gemini 2.5 Pro is really great at creative writing. Already using it for agentic systems that require editing, reviewing, writing, and refining outputs. https://t.co/pdXQI09UsD

❀️589
likes
πŸ”58
retweets
πŸ–ΌοΈ Media
I
Tanishq Mathew Abraham, Ph.D.
@iScienceLuvr
πŸ“…
Sun
πŸ†”34636382
⭐0.83

Multimodal foundation models will enable new workflows in medical imaging (and healthcare more broadly), especially compared to the conventional approach This is what we focus on at @SophontAI https://t.co/E3ujIBnqXw

❀️316
likes
πŸ”46
retweets
πŸ–ΌοΈ Media
I
Ivan Leo (SF 1 - 19 June )
@ivanleomk
πŸ“…
Sun
πŸ†”79303063
⭐0.68

I built a small voice agent that goes out and searches the web using GPT-4o's web browsing when you ask it for recommendations with @livekit Tool Calling a bit funky but otherwise quite happy with the progress so far. https://t.co/6gqZoDFuxn

❀️6
likes
πŸ–ΌοΈ Media
R
Sebastian Raschka
@rasbt
πŸ“…
Sun
πŸ†”01986135
⭐0.98

As we all know by now, reasoning models often generate longer responses, which raises compute costs. Now, this new paper (https://t.co/SwxBs8RsTq) shows that this behavior comes from the RL training process, not from an actual need for long answers for better accuracy. The RL… https://t.co/JnTmDNiVgg

❀️1,201
likes
πŸ”191
retweets
πŸ–ΌοΈ Media
J
jason liu
@jxnlco
πŸ“…
Sat
πŸ†”19880209
⭐0.94

people used to pay me like 20k just explaining and helping people set up generative benchmarking Come check out this lightning lesson where kelly explains why it's so important that evaluations are used as we move systems from prototype to production https://t.co/nN8Z3p3sNk https://t.co/YuJ0bkil2P

Media 1
❀️76
likes
πŸ–ΌοΈ Media
S
Susan Zhang
@suchenzang
πŸ“…
Sat
πŸ†”28975457
⭐0.83

made the mistake of continuing down this rabbit πŸ•³οΈ apparently babistories itself is a synthetically generated dataset from the memory mosaics paper (https://t.co/lYKd4dPsBH), which is a synthetic copy of tinystories, which itself is also synthetically generated so the "strong… https://t.co/gN4bUzwJDT

❀️435
likes
πŸ”38
retweets
πŸ–ΌοΈ Media
J
Jonathan Whitaker
@johnowhitaker
πŸ“…
Sat
πŸ†”20912511
⭐0.78

New video + post + experiment. What if you could take a dumb model and a smart model and interpolate between them? Then extrapolate out to get an even better response? I talk about some related papers, then try out an idea, documenting the process and the (negative) result. https://t.co/ofOR118TwW

Media 1
❀️34
likes
πŸ”3
retweets
πŸ–ΌοΈ Media
F
FranΓ§ois Chollet
@fchollet
πŸ“…
Sat
πŸ†”62190554
⭐0.89

Google quietly released a powerful recommender systems library optimized for JAX and TPUs, based on Keras. It's called RecML. It has native support for SparseCore (latest hardware for handling large distributed embeddings) https://t.co/IBNlKXUwcz

Media 1
❀️1,106
likes
πŸ”124
retweets
πŸ–ΌοΈ Media
K
Dieter
@kagglingdieter
πŸ“…
Fri
πŸ†”38886923
⭐0.88

πŸŽ‰ Excited to share that we won the AI Math Olympiad competition on @kaggle with a mind-blowing score of solving 34/50 student-level math problems using a LLM. Summary of our solution below. https://t.co/kpNygidHAV

Media 1
❀️1,303
likes
πŸ”88
retweets
πŸ–ΌοΈ Media
O
OpenAI
@OpenAI
πŸ“…
Thu Apr 10
πŸ†”72212636

Starting today, memory in ChatGPT can now reference all of your past chats to provide more personalized responses, drawing on your preferences and interests to make it even more helpful for writing, getting advice, learning, and beyond. https://t.co/s9BrWl94iY

❀️14,825
likes
πŸ”1,932
retweets
πŸ–ΌοΈ Media
P
Henrik Karlsson
@phokarlsson
πŸ“…
Fri
πŸ†”50279234
⭐0.68

from my notes on the childhoods of people who went on to do exceptional work https://t.co/cIxmlAh3et

Media 1
❀️5,177
likes
πŸ”418
retweets
πŸ–ΌοΈ Media
L
Lior⚑
@LiorOnAI
πŸ“…
Thu Apr 10
πŸ†”25135342
⭐0.83

AI agents fact-checking each other reduce hallucinations by over 2,800%. This new research paper introduces a 4-agent NLP pipeline that flags, explains, and rewrites hallucinated content. Each agent runs a different LLM and focuses on a distinct taskβ€”generation, review,… https://t.co/h5KKXvrcib

Media 1
❀️813
likes
πŸ”123
retweets
πŸ–ΌοΈ Media
S
SkalskiP
@skalskip92
πŸ“…
Thu Apr 10
πŸ†”70460714
⭐0.83

which VLM is the best? we are building @roboflow VLM playground to find out - test multiple VLMs in parallel and for free - open-source VLMs like PaliGemma and DeepSeek-VL coming soon - we had GPT-4o as well, but @OpenAI banned us for "distillation"; wtf? link:… https://t.co/G9NyC2J726

❀️129
likes
πŸ”21
retweets
πŸ–ΌοΈ Media
R
rahul
@rahulgs
πŸ“…
Wed
πŸ†”78345238
⭐0.63

added gemini 2.5 pro support to Claude Code feels faster and smarter than Sonnet 3.7 my go to local coding assistant now link below ⬇️ https://t.co/Q2nnIM2Dzo

Media 1
❀️919
likes
πŸ”47
retweets
πŸ–ΌοΈ Media
B
Ben ClaviΓ©
@bclavie
πŸ“…
Wed
πŸ†”45835569
⭐0.83

You can now get `fastkmeans` at your nearest PyPi reseller. It serves just one (1) purpose: run GPU-accelerated k-means that can do 200k+ clusters without going OOM without any installation pain. (bonus: the API mimics both faiss & sklearn, so it slots in just about anywhere.) https://t.co/WlfzN4t89k

Media 1
❀️163
likes
πŸ”26
retweets
πŸ–ΌοΈ Media