Your curated collection of saved posts and media

Showing 9 posts ยท last 14 days ยท by score
โž• Add New Post
๐Ÿ”Tim_Dettmers retweeted
L
Liang Chen
@liangchen5518
๐Ÿ“…
Apr 12, 2026
12d ago
๐Ÿ†”57047769
โญ0.36

GLM 5.1 from @Zai_org ranks as the top open model on the newly released Monthly-SWEBench by @UniPat_AIโ€”second only to Claude-Opus-4.6. Congrats to the team! ๐Ÿš€Explore the benchmark: https://t.co/jh4fTw0dIE https://t.co/sIerMtwDFb

โค๏ธ77
likes
๐Ÿ”13
retweets
C
ClementDelangue
@ClementDelangue
๐Ÿ“…
Apr 14, 2026
10d ago
๐Ÿ†”55901911
โญ0.38

Is there somewhere a collection of the best agent/coding harnesses for each models, especially open-source and local ones? In my opinion, the biggest reason why people are struggling with open/local models these days is that the agent/coding harnesses in most open agent are not designed for them and expect it to magically work when they switch models from the default.

๐Ÿ”unknown_user retweeted
U
unknown_user
@unknown_user
๐Ÿ“…
Apr 17, 2026
7d ago
๐Ÿ†”20431022
โญ0.36

This paper makes a strong case for open-world evaluations as a complement to traditional benchmarks, particularly for realistic, long-horizon, open-ended settings! Glad the AISI SoE team could contribute to this effort.

โค๏ธ18
likes
๐Ÿ”5
retweets
A
arcinstitute
@arcinstitute
๐Ÿ“…
Apr 13, 2026
11d ago
๐Ÿ†”19823515

Most genomic AI models use fixed rules to process DNA into chunks, imposing arbitrary boundaries on a sequence with its own biological structure. @arnavshah0, @victor_ljz, and team developed dnaHNet, a tokenizer-free foundation model that learns its own segmentation from scratch, supervised by @_albertgu, @genophoria, and @BoWang87.

Media 1
๐Ÿ–ผ๏ธ Media
C
claudeai
@claudeai
๐Ÿ“…
Apr 14, 2026
10d ago
๐Ÿ†”60309790

Now in research preview: routines in Claude Code. Configure a routine once (a prompt, a repo, and your connectors), and it can run on a schedule, from an API call, or in response to an event. Routines run on our web infrastructure, so you don't have to keep your laptop open. https://t.co/m2XJWYqkf8

Media 1
๐Ÿ–ผ๏ธ Media
A
Adam_Fish
@Adam_Fish
๐Ÿ“…
Apr 17, 2026
7d ago
๐Ÿ†”01580794
โญ0.38

Webflow's CMS API can't publish code blocks. Tables aren't in the API at all!? So I built a Playwright robot that clicks buttons in the Designer for us. In 2026, your API is your product. https://t.co/sA7Csu21uO

H
hardmaru
@hardmaru
๐Ÿ“…
Apr 13, 2026
11d ago
๐Ÿ†”81204232

ไธ€ใคใฎใƒ‹ใƒฅใƒผใƒฉใƒซใƒใƒƒใƒˆใซ็ฌฆๅทใจ่จ˜ๅทใฏๅ‰ต็™บใ—ใ†ใ‚‹ใ‹๏ผŸ ใ€ŒNeural Computersใ€่ซ–ๆ–‡ใ‹ใ‚‰่€ƒใˆใ‚‹ @rmaruy https://t.co/JZPE0bdCeK

Media 1
๐Ÿ–ผ๏ธ Media
C
ClementDelangue
@ClementDelangue
๐Ÿ“…
Apr 16, 2026
8d ago
๐Ÿ†”25424326

You can now visualize Pi traces that you upload on @huggingface! Let's make sharing agent traces 10x more common to make agent AI more open and collaborative! Also, because it's fun to analyze @badlogicgames's traces ๐Ÿ˜‚๐Ÿ˜‚๐Ÿ˜‚ https://t.co/LLclFIZeWS

Media 1Media 2
๐Ÿ–ผ๏ธ Media
H
Heyhassan
@Heyhassan
๐Ÿ“…
Apr 16, 2026
8d ago
๐Ÿ†”45131317

This is crazy good Grok Code built a full e-commerce website in less than an hour. Here is how i do this full tutorial + prompts: โ†“ https://t.co/bAmlxqEoOv

๐Ÿ–ผ๏ธ Media