Your curated collection of saved posts and media
@gmiller @sama Gandhiโs thinking on this is such an inspiration. For those unfamiliar: https://t.co/2sLf80oeTf
Most serving stacks run FLUX.2 as four separate stages with Python overhead between each one. We collapsed all four into a single fused execution graph using MLIR-based compilation. On @AMD MI355X, that means a 3.8x speedup over torch.compile, 1024x1024 images in under 3.5 seconds, and a deployment container under 700MB. We ran the same pipeline on Blackwell, too. AMD delivers equivalent generation quality at a 5.5x lower cost. @clattner_llvm is presenting the full breakdown at AMD AI DevDay. Register: https://t.co/Pa1e36BTZn
you can now control things with your brain. literally. we're building the most wearable BCI on the planet, with @sabicap, backed by @khoslaventures @accel @initialized & @kevinweil. we collected the worldโs largest neural dataset and trained the most capable Brain Foundation Model. then we invented a new class of biosensors powered by custom ASICs. type without typing. click without clicking. a cap that lets your brain do the work. weโre sabi.
This is crazy good Grok Code built a full e-commerce website in less than an hour. Here is how i do this full tutorial + prompts: โ https://t.co/bAmlxqEoOv
Grok might be behind the Anthropics and OpenAIs but when it hooks code up to lists then we can create things like my news site inside of X: https://t.co/kiuZ7QXLzb That's when things great really fun here on X.
This is crazy good Grok Code built a full e-commerce website in less than an hour. Here is how i do this full tutorial + prompts: โ https://t.co/bAmlxqEoOv
The computer is for you https://t.co/xYIJfN5FAS
Seedance 2.0 Advancing Video Generation for World Complexity paper: https://t.co/v0TZmavCUr https://t.co/pWuFZOiX7g
this is the year of 3D World models ๐ฅ > Lyra 2.0 by NVIDIA: image โ 3D world with Gaussians, 14B params, built on WAN-14B > HY-World 2.0 by Tencent: text/image/video 3D โ editable world (meshes + Gaussians) drop-in to Blender/Unity/Unreal weights on the next one โก๏ธ https://t.co/MTSdHmRE2P
GLM-5.1 Tool Calling Issue Fix & Chat Template Update If you are running GLM-5.1 with vLLM/SGLang and using tool calling, please update your chat template. https://t.co/XyyCucws82 Issue When using tool calling, frameworks including vLLM automatically convert plain-text tool message content into an array of content parts (`[{"type": "text", "text": "..."}]`) before passing it to the chat template. The original template only supported string-formatted tool content, causing array-formatted tool outputs to render empty. As a result, the model does not receive tool results and repeatedly triggers the same tool call in a loop. Affected Models All GLM-5.1 variants deployed with vLLM or SGLang. Fix Simply replace your existing `chat_template.jinja` with the updated version from the repository.
This part of the 4.7 Opus system card is pretty neat and seems potentially worth emulating (Anthropic showed Mythos the private discussions/evidence underlying the system card and asked Mythos if the Opus system card accurately characterized that private evidence) https://t.co/4pf666ZB6m
@TheZvi Less juicy overall than last time, but I was happy we got to fit in section 6.1.3: https://t.co/8KYbb2DKgX
Nearly 1/3 of surveyed people in Anthropic now think entry-level engineers and researchers are likely replaced by Mythos within 3 months https://t.co/QUozBxLUrR
This is really bad. The scary part in the US is that it doesnโt matter whether you are the CEO of OpenAI or just a regular PhD. There are paid online websites that can find your address and phone number. I donโt know how such personal info got out. https://t.co/jyDMhyXIZC
why do the Japanese like their buns askew? https://t.co/6be3BqSJ7d
GameWorld Towards Standardized and Verifiable Evaluation of Multimodal Game Agents paper: https://t.co/IfbTgfNnSM https://t.co/gL3BURxzkV
New insane model from Jackrong on @huggingface ๐คฏ Qwen3.5-9B-GLM5.1-Distill-v1 ๐ง Distilled on GLM-5.1 reasoningโจโ๏ธ Deeper thinking than base modelโจ๐งช Benchmarks coming soon โ Fits on 8GB VRAM โ๏ธ New model after Qwopus/Gemopus After distilling Claude Opus 4.6, heโs now back on the strongest open-source model! An MLX ๏ฃฟ version is also available on his huggingface page 27B model incoming? https://t.co/893FvH51jb
Today we're releasing Personal Computer. Personal Computer integrates with the Perplexity Mac App for secure orchestration across your local files, native apps, and browser. Weโre rolling this out to all Perplexity Max subscribers and everyone on the waitlist starting today. https://t.co/kxgFQFo7BB
@vicberggren Whew. I'm not gonna stop doing them anyway. Even if it costs me a few hundred bucks a month. My AI that builds https://t.co/kiuZ7QXLzb learns from my reshares what to look for.
Qwen just released Qwen 3.6 on Hugging Face A 35B MoE vision-language model with 3B active parameters, featuring advanced agentic coding capabilities and thinking preservation. https://t.co/NmwjwxyN3m
Parcae Scaling Laws For Stable Looped Language Models paper: https://t.co/hUYU2x8STk https://t.co/ravMv2kbR3
Geometric Context Transformer for Streaming 3D Reconstruction paper: https://t.co/3ad6iyi0cG https://t.co/Y0k4csiC11
Claude remains irreducibly Claude. If you know, you know. (The fact that models have distinct personalities that are consistent across generations is technically interesting, it also makes it very easy to use new releases when they come along, because they feel very similar). https://t.co/imyGcPsYBI
โก Meet Qwen3.6-35B-A3B๏ผNow Open-Source๏ผ๐๐ A sparse MoE model, 35B total params, 3B active. Apache 2.0 license. ๐ฅ Agentic coding on par with models 10x its active size ๐ท Strong multimodal perception and reasoning ability ๐ง Multimodal thinking + non-thinking modes Efficient. Powerful. Versatile. Try it now๐ Blog๏ผhttps://t.co/EXx5y466su Qwen Studio๏ผhttps://t.co/bg4tAU1p74 HuggingFace๏ผhttps://t.co/w4pDX14DZS ModelScope๏ผhttps://t.co/SuRyLzdQiO API๏ผโQwen3.6-Flashโ on Model Studio๏ผ๏ผComing soon๏ฝ Stay tuned

today we launched the Legal AGI Lab. AI agents are beginning to operate in highly regulated environments like healthcare and financial services. but existing legal frameworks arenโt ready for this. And this creates a bottleneck for the agentic economy. so we are conducting interdisciplinary legal & AI research on how agents should be governed, held liable, and measured and defining the legal architecture required for autonomous agents to operate safely in high-stakes environments. Norm sits at a unique intersection: we build AI agents, we deploy them with institutional clients, and we power Norm Law, an AI-native law firm operating on live legal work. that feedback loop between building, testing, and deploying is what makes this research different.
@_winter_wonders A frequently cited example is OpenAIโs Sora, which reportedly incurred extremely high compute costs, widely estimated in the range of up to around $1 million per day, while struggling to match that with sustainable revenue. The product has since been pulled back, often framed as a mismatch between cutting-edge capability and viable unit economics. This connects to the broader pattern in AI commercialization, including the earlier cybersecurity discussion: significant spending is increasingly justified under โsafety,โ โrisk,โ or โcapabilityโ narratives, even when the underlying economic returns remain uncertain. Source: https://t.co/VTYh7hipAY
What you need to know about Opus 4.7 * Takes instructions literally * Better vision means improved computer use and producing slides and other visual artifacts * Optimized for large-scale real-world analysis * Better at using file system-based memory https://t.co/tEywxsCxSV
Introducing Claude Opus 4.7, our most capable Opus model yet. It handles long-running tasks with more rigor, follows instructions more precisely, and verifies its own outputs before reporting back. You can hand off your hardest work with less supervision. https://t.co/PtlRdpQcG
Viktor Orbรกnโs electoral loss in Hungary is as much a defeat for Trump and JD Vance. "Seldom have American leaders intervened so overtly in a foreign election, and seldom has their preferred candidate fared so badly." https://t.co/DbLhMz65ja
๐ค Take the stage at #PyTorchCon North America! We are looking for technical deep dives & production stories for our return to San Jose this Oct 20-21. Check out our "Preparing to Submit" guide to help craft your proposal. ๐๏ธ Deadline: June 7 Apply now: https://t.co/hLlKK7WxLD https://t.co/leYJj7nDfR
๐ @AnthropicAI's Claude Opus 4.7 is now generally available and rolling out in GitHub Copilot. Early testing shows โก๏ธ It has stronger multi-step task performance and more reliable agentic execution โก๏ธ Meaningful improvement in long-horizon reasoning and complex workflows Try it out in @code or Copilot CLI. https://t.co/8QFLkf0RqR
โFurnitureโ opens at Salon 94 NYC Thursday, April 23, 2026 from 7-9pm EDT. See you there! On view from April 23 - June 20, 2026 Salon 94: 3 E 89th St, New York, NY 10128 Hours: Wednesday - Saturday 11:00am - 6:00pm EDT https://t.co/N3MIlW3bv3

๐ฟ๐ฆ A senior South African politician just got caught lying about Starlink to protect mobile network operators. Parliamentary communications chair Khusela Diko claimed Starlink "doesn't move the needle" on school connectivity. Here's what she left out: 16,000 schools still have no internet after 12 years, a missed deadline, and mobile operators billions over budget. Starlink offered to connect 5,000 schools for free and was turned away. A rural mobile tower costs around $61,000 and can serve just a single school. The numbers don't lie. The politician did. Source: MyBroadband
Accurate

Introducing Claude Opus 4.7, our most capable Opus model yet. It handles long-running tasks with more rigor, follows instructions more precisely, and verifies its own outputs before reporting back. You can hand off your hardest work with less supervision. https://t.co/PtlRdpQcG5
T-7 days ๐ซ๐ท Open-source AI Art takes over Paris. 3 days. Hackathons. Art. Talks. 120 spots per day. https://t.co/f9RpcI3Cs5