T

Teknium (e/λ)

@Teknium1

📅

Fri

🆔81310222

⭐0.52

Got Claude pro again for a bit and am playing with Artifacts again Made an art-piece/story here: https://t.co/gZMSoERWyI https://t.co/Op67z59Usz

❤️27

likes

🔁1

retweets

🖼️ Media

View Details View on X ↗

E

Ethan Mollick

@emollick

📅

Fri

🆔90972586

⭐0.83

The problem with giving AIs a default personal system prompt is that you have no idea how that prompt interacts with various AI tasks. Our research shows that even small prompt changes (like saying "please") can backfire on some problems and lower accuracy in unexpected ways. https://t.co/16gEAGuRUX

❤️842

likes

🔁72

retweets

🖼️ Media

View Details View on X ↗

N

Nathan Benaich

@nathanbenaich

📅

Fri

🆔75711116

⭐0.58

frontier ai today https://t.co/YkhEmCsF0r

❤️2,806

likes

🔁238

retweets

🖼️ Media

View Details View on X ↗

H

Hamel Husain

@HamelHusain

📅

Tue May 27

🆔45928816

⭐0.52

My favorite part of this talk Isaac's .cursorules for FastHTML https://t.co/BNjKr3gdbV https://t.co/HADJBtQjsp

❤️72

likes

🔁9

retweets

🖼️ Media

View Details View on X ↗

O

elvis

@omarsar0

📅

Fri

🆔06607297

⭐0.77

Building Production-Grade Conversational Agents with Workflow Graphs Uses DAG to design robust and complex agentic systems. If you're building AI agents, this is worth a read. Here are my notes: https://t.co/Cks6GxS5a6

❤️495

likes

🔁86

retweets

🖼️ Media

View Details View on X ↗

O

elvis

@omarsar0

📅

Fri

🆔97211600

⭐0.79

YC on the key prompting techniques used by the best AI startups: https://t.co/wmBho365HS

❤️3,392

likes

🔁321

retweets

🖼️ Media

View Details View on X ↗

J

jason liu

@jxnlco

📅

Fri

🆔99243156

⭐0.64

missing anything https://t.co/0OhMQKC2ls

❤️85

likes

🖼️ Media

View Details View on X ↗

A

Aravind Srinivas

@AravSrinivas

📅

Fri

🆔66378132

⭐0.54

AI Apocalypse Simulator https://t.co/ubH24w7prq

❤️424

likes

🔁26

retweets

🖼️ Media

View Details View on X ↗

M

meg.ai 🇨🇦

@MeganRisdal

📅

Tue May 27

🆔41713197

⭐0.49

Gemini, make me a meme about being a @kaggle employee learning about being "paranoid about evals" in the AI Evals For Engineers & PMs course from @HamelHusain @sh_reya 😂💀 https://t.co/1pcz0y1tIS

❤️12

likes

🔁2

retweets

🖼️ Media

View Details View on X ↗

F

fofr

@fofrAI

📅

Fri

🆔45981334

⭐0.60

Here's a fun use of Kontext – you can use it to change the aspect ratio of an input. Change the aspect ratio from "match_input_image" to anything different. Prompt: > make the image taller Probably pretty good for making iPhone wallpapers out of anything.… https://t.co/feQ0t6iLFy

❤️462

likes

🔁27

retweets

🖼️ Media

View Details View on X ↗

E

Ethan Mollick

@emollick

📅

Sat

🆔00856216

⭐0.76

One test AIs struggle with is creating riddles, as they tend to either be too obvious or too obscure. I asked Gemini 2.5, Claude Opus & o3 to come up with an SVG that would hint at a book without revealing it. They were usually pretty easy, here are some typical examples. Guess? https://t.co/BOGD4IjqXi

+1 more

❤️90

likes

🔁8

retweets

🖼️ Media

View Details View on X ↗

B

Ali Behrouz

@behrouz_ali

📅

Fri

🆔00010383

⭐0.64

What makes attention the critical component for most advances in LLMs and what holds back long-term memory modules (RNNs)? Can we strictly generalize Transformers? Presenting Atlas (A powerful Titan): a new architecture with long-term in-context memory that learns how to… https://t.co/qcvO0B9HW5

❤️936

likes

🔁140

retweets

🖼️ Media

View Details View on X ↗

A

Abhishek Nagaraj 🗺️

@abhishekn

📅

Sat

🆔39761479

⭐0.54

Totally. The chair in the sky all over again. https://t.co/dV5G9Akaqs

❤️340

likes

🔁47

retweets

🖼️ Media

View Details View on X ↗

N

Nous Research

@NousResearch

📅

Sat

🆔47764005

⭐0.57

Nous Research will pay the first to properly and fully implement Atropos support into the VeRL project $2500! For information on Atropos, our standalone RL environments framework, see: https://t.co/U20tCJdguP For the official VeRL issue on the bounty: https://t.co/UcszRrE2kg https://t.co/zYOyeqEtRU

❤️231

likes

🔁21

retweets

🖼️ Media

View Details View on X ↗

O

elvis

@omarsar0

📅

Wed

🆔13070113

⭐0.75

Agentic browsers are here! Introducing @opera's new agentic browser, Opera Neon! Opera Neon is an AI agentic browser that can browse with you or for you, take action & help you get things done! https://t.co/2SVZ3zY3gN

❤️250

likes

🔁39

retweets

🖼️ Media

View Details View on X ↗

J

jason liu

@jxnlco

📅

Wed

🆔79714883

⭐0.65

Office hours from the latest session. https://t.co/t3KPSJyXuO

❤️93

likes

🔁8

retweets

🖼️ Media

View Details View on X ↗

A

Alex Zhang

@a1zhang

📅

Wed

🆔95293975

⭐0.63

Can GPT, Claude, and Gemini play video games like Zelda, Civ, and Doom II? 𝗩𝗶𝗱𝗲𝗼𝗚𝗮𝗺𝗲𝗕𝗲𝗻𝗰𝗵 evaluates VLMs on Game Boy & MS-DOS games given only raw screen input, just like how a human would play. The best model (Gemini) completes just 0.48% of the benchmark! 🧵👇 https://t.co/kcBZ8vsDyw

❤️540

likes

🔁76

retweets

🖼️ Media

View Details View on X ↗

M

Alex Gu

@minimario1729

📅

Wed

🆔85178764

⭐0.53

new deepseek release almost on-par with o3 (high) on livecodebench 😲🚀 https://t.co/znw6OTCmdE

❤️680

likes

🔁73

retweets

🖼️ Media

View Details View on X ↗

H

Hamel Husain

@HamelHusain

📅

Thu May 29

🆔31328149

⭐0.58

Made a cli that allows you to pull all discord messages from a channel w/ threads+reply hierarchies from your own server. Perfect for LLMs (made with AI + nbdev). Using this to generate FAQs from my course! GitHub: https://t.co/O8X1VqMv4z Docs: https://t.co/uvOQUtwc2t https://t.co/cyajOWiiO6

❤️137

likes

🔁17

retweets

🖼️ Media

View Details View on X ↗

O

elvis

@omarsar0

📅

Mon

🆔22132909

⭐0.79

AgenticSeek: Private, Local Manus Alternative This is worth checking. It's a local alternative to Manus AI that can autonomously browse the web, write code, and plan tasks. It's built for local reasoning models, runs on your hardware, and keeps all data on your device. https://t.co/y3lu0RfPex

❤️463

likes

🔁79

retweets

🖼️ Media

View Details View on X ↗

E

Ethan Mollick

@emollick

📅

Thu May 29

🆔05989790

⭐0.80

I suspect most people underestimate what o3 is capable of doing. One example: I gave it an Excel file for a small business I use for my classes & the single prompt "identify the key assumptions here and give me a sensitivity analysis." It did a lot of work & gave a good answer. https://t.co/Boole32ilC

❤️1,096

likes

🔁71

retweets

🖼️ Media

View Details View on X ↗

C

Cameron Pfiffer the 𝐄𝐢𝐠𝐞𝐧𝐚𝐝𝐦𝐢𝐧

@cameron_pfiffer

📅

Wed

🆔13261486

⭐0.53

Here's what happens if you force Hermes 3 to continue printing out ingredients to a peanut butter and jelly sandwich @NousResearch @dottxtai https://t.co/951W4WZAbY

❤️46

likes

🔁4

retweets

🖼️ Media

View Details View on X ↗

I

Ivan Leo

@ivanleomk

📅

Thu May 29

🆔33463590

⭐0.46

Anyone used the genai integration with vertexai? today I learnt this code snippet does not work at all lol. You get an error that API Keys are not supported by this API? https://t.co/fXJZQuS2tq

❤️4

likes

🖼️ Media

View Details View on X ↗

S

Lisan al Gaib

@scaling01

📅

Wed

🆔31521044

⭐0.62

OPUS 4 NEW SOTA ON ARC-AGI-2 IT'S HAPPENING - I WAS RIGHT Claude 4 models are the first models that effectively use test-time-compute for ARC-AGI-2 https://t.co/YrFaHBsagq

❤️1,437

likes

🔁79

retweets

🖼️ Media

View Details View on X ↗

H

Hamel Husain

@HamelHusain

📅

Tue May 27

🆔81021623

⭐0.58

How should you go about generating synthetic data for LLM Evals? - How many examples? What should your prompt be? Should you test everything? FAQ #2 from our course https://t.co/97lB8uXm2p https://t.co/liLRhIPjva

❤️131

likes

🔁17

retweets

🖼️ Media

View Details View on X ↗

T

Teknium (e/λ)

@Teknium1

📅

Mon

🆔33140647

⭐0.58

Finally completed and merged the SWE_RL environment that was described by Meta's SWE RL paper into Atropos - A really difficult environment that can teach a model to be a much better coding agent! Check out the PR: https://t.co/KW36dHo2ts Check out Meta's SWE-RL paper:… https://t.co/y6P8K9zgYh

❤️160

likes

🔁16

retweets

🖼️ Media

View Details View on X ↗

L

LlamaIndex 🦙

@llama_index

📅

Mon

🆔29110177

⭐0.71

We’re constantly releasing updates and new features to LlamaCloud. LlamaParse lets you make use of the latest LLMs when parsing complex documents, getting them ready to be used in further AI applications. And now, it supports @AnthropicAI Sonnet 4.0 in agent and LVM modes.… https://t.co/yNcOtjKMzm

❤️22

likes

🔁3

retweets

🖼️ Media

View Details View on X ↗

D

Dan Alistarh

@DAlistarh

📅

Mon

🆔79081281

⭐0.62

We are introducing Quartet, a fully FP4-native training method for Large Language Models, achieving optimal accuracy-efficiency trade-offs on NVIDIA Blackwell GPUs! Quartet can be used to train billion-scale models in FP4 faster than FP8 or FP16, at matching accuracy. [1/4] https://t.co/gggPqEgcPZ

❤️398

likes

🔁78

retweets

🖼️ Media

View Details View on X ↗

P

Per Borgen

@perborgen

📅

Mon

🆔12291463

⭐0.59

Don’t let AWS rip you off. We grew our B2C education app to ~400k users and $1M+ ARR on a single $87/month dedicated server from OVH. No autoscaling nonsense, managed database markup, or observability bloat. Just a fast, predictable server that quietly did its job for years.… https://t.co/SkdIZutKHC

❤️2,015

likes

🔁146

retweets

🖼️ Media

View Details View on X ↗

A

La Main de la Mort

@AITechnoPagan

📅

Mon

🆔33840646

⭐0.60

this “jailbreak” is so funny Claude Opus 4 :) https://t.co/YuzMfdC4Mm

❤️2,951

likes

🔁195

retweets

🖼️ Media

View Details View on X ↗

J

Jinbin Bai

@Jinbin_Bai

📅

Mon

🆔16131037

⭐0.57

🚀 Diffusion for text generation is booming — and we're pushing it further. While recent works explore unified generation via diffusion for faster decoding, they mostly rely on language priors. We introduce Muddit — our next-gen Meissonic model. Star at https://t.co/LmWxpQLDAz https://t.co/6gfQdsFyrW

❤️229

likes

🔁46

retweets

🖼️ Media

View Details View on X ↗

I

Ivan Leo

@ivanleomk

📅

Tue May 27

🆔79949470

⭐0.48

Instructor makes chat completions backwards compatible with the new responses api. Use all of the new OpenAI inbuilt tools with the structured outputs to match. Read more here : https://t.co/S9MArlhGnD https://t.co/bU5fwlWnps

❤️3

likes

🔁1

retweets

🖼️ Media

View Details View on X ↗