Your curated collection of saved posts and media
GLM 5.1 from @Zai_org ranks as the top open model on the newly released Monthly-SWEBench by @UniPat_AIโsecond only to Claude-Opus-4.6. Congrats to the team! ๐Explore the benchmark: https://t.co/jh4fTw0dIE https://t.co/sIerMtwDFb
Is there somewhere a collection of the best agent/coding harnesses for each models, especially open-source and local ones? In my opinion, the biggest reason why people are struggling with open/local models these days is that the agent/coding harnesses in most open agent are not designed for them and expect it to magically work when they switch models from the default.
This paper makes a strong case for open-world evaluations as a complement to traditional benchmarks, particularly for realistic, long-horizon, open-ended settings! Glad the AISI SoE team could contribute to this effort.
Most genomic AI models use fixed rules to process DNA into chunks, imposing arbitrary boundaries on a sequence with its own biological structure. @arnavshah0, @victor_ljz, and team developed dnaHNet, a tokenizer-free foundation model that learns its own segmentation from scratch, supervised by @_albertgu, @genophoria, and @BoWang87.
Now in research preview: routines in Claude Code. Configure a routine once (a prompt, a repo, and your connectors), and it can run on a schedule, from an API call, or in response to an event. Routines run on our web infrastructure, so you don't have to keep your laptop open. https://t.co/m2XJWYqkf8
Webflow's CMS API can't publish code blocks. Tables aren't in the API at all!? So I built a Playwright robot that clicks buttons in the Designer for us. In 2026, your API is your product. https://t.co/sA7Csu21uO
ไธใคใฎใใฅใผใฉใซใใใใซ็ฌฆๅทใจ่จๅทใฏๅต็บใใใใ๏ผ ใNeural Computersใ่ซๆใใ่ใใ @rmaruy https://t.co/JZPE0bdCeK
You can now visualize Pi traces that you upload on @huggingface! Let's make sharing agent traces 10x more common to make agent AI more open and collaborative! Also, because it's fun to analyze @badlogicgames's traces ๐๐๐ https://t.co/LLclFIZeWS

This is crazy good Grok Code built a full e-commerce website in less than an hour. Here is how i do this full tutorial + prompts: โ https://t.co/bAmlxqEoOv