Your curated collection of saved posts and media
🚀Our Free LongCat API’s Backend Model Gets Updated! LongCat-Flash-Chat New Update is arriving! Building on "extreme efficiency" & "lightning-fast response", this upgrade reinforces long-context capability and real-task performance. 🔺 256K Context: Doubled from previous 128K, with top-tier info capture accuracy. 🤖 Agentic coding: Built for developers who actually ship. Full-stack apps across mobile, desktop, and cross-platform. ⚙️ SOTA Tool Use: High precision and robust execution even in non-thinking modes. 🌏 Multilingual Support: Enhanced support for 9 new languages (ES, FR, AR, PT, RU, ID, DE, KO, JA). 💭 Enhanced Reasoning: Significant improvements in Math and Complex Instruction Following. ✅ Good News: LongCat-Flash API is Fully Updated Now! 🔗 Check API Docs & Get Started (up to 5M free tokens/day): https://t.co/7TlSHfiCyp
I’m excited to share a new repo: Agent Skills for Context Engineering Instead of just offering a library of black-box tools, it acts as a "Meta-Agent" knowledge base. It provides a standard set of skills, written in markdown and code, that you can feed to an agent so it understands how to manage its own cognitive resources. https://t.co/vWwrYPAd8k Most agent failures are not model failures; they are context failures. This is still an experimental project. The goal is to establish a platform-agnostic standard for context engineering that can be used in Cursor, Claude Code, Copilot or Codex. skills/ context-fundamentals: What context is, why it matters context-degradation: How context fails (lost-in-middle, poisoning) context-optimization: Compaction, masking, caching multi-agent-patterns: Orchestrator, swarm, hierarchical memory-systems: Vector RAG, knowledge graphs, Zep tool-design: Building tools agents can use evaluation: Testing and measuring agent systems I believe this is a good start, showing developers how to approach context engineering rather than relying on ready-made tools. You will also find the aggregated research documents I used to build these skills in the repo. The skills are synthesized from technical blogs on context and prompt engineering that I bookmarked, AI Labs' documentations, and Anthropic Skills examples. Try the 7 Skills, created using Antrhopic's Skills template format. Experiment with the provided scripts and references, and feel free to contribute to the repo.
It’s actually a good question; the difference is subtle but structural. I usually frame it like this: AGENTS[.]md acts as the declarative context. You write this for every repo (and nested directories) to define the project structure, persona, and coding rules. Skills are the

Had my first successful experience using a browser agent to solve a real problem - in this case I had the Claude in Chrome extension help me find some configuration I had lost deep within the Cloudflare control panel https://t.co/fpgx8hpnce
OpenAI and Anthropic engineers don't prompt like everyone else. I've been reverse-engineering their techniques for 2.5 years across all AI models. Here are 5 prompting methods that get you AI engineer-level results: https://t.co/9k64cbQ5I4
New York’s New AI Law Creates Oversight Office, Standards https://t.co/TwOFyrn7wt @govtechnews
Splat's app uses AI to turn your photos into coloring pages for kids https://t.co/TJTDgCWE9h @SarahPerezTC @techcrunch
We have a new best open source model to close out 2025: GLM 4.7! It's been ~6mos since the first closed source model, Opus 4, broke 73% on SWE-Bench and GLM does 73.8%! It's fantastic at math and coding and beats DeepSeek / Kimi. Very cheap at $0.6/M in, $2.2/M out, 200k context, and fast too: 70tok/s! The bigger story here is how the gap between closed and open source models has evolved. This time last year, OpenAI launched o1-preview and the closest OSS thing we had was DeepSeek R1-lite. We're still ~6mos off but there are at least 4 close open-source competitors.
Woah, SAM3 just reached 1 million downloads on HuggingFace! https://t.co/zcIBDPWPOI
Woah, SAM3 just reached 1 million downloads on HuggingFace! https://t.co/zcIBDPWPOI
"May my heart be kind, my mind fierce, and my spirit brave.” ~K. Forsyth https://t.co/vWMZ7ItNkA

From complex customer feedback, @SurveySparrow is enabling real-time insights. Using robust, compliant AWS infrastructure & genAI solutions, the startup is helping marketers get value from data & scale with confidence. https://t.co/GINfqOZeLD
Experts say that a fraud that has long existed has now become both easier and faster. https://t.co/VPyLolODNd @CSOonline
AI translation is replacing interpreters in GP care here’s why that’s troubling https://t.co/VV71XxDurD @ConversationUS @ConversationUK
AI 'friend' could help you plan your next vacation https://t.co/EqK69XGfiY @FuturityNews
The World is Your Canvas Painting Promptable Events with Reference Images, Trajectories, and Text https://t.co/MAsRMGjV1R
discuss: https://t.co/uefcEJi9cw
nice https://t.co/WbiKCQ8UAg

Starlink is connecting more than 9M active customers with high-speed internet across 155 countries, territories and many other markets. Thank you to all our customers around the world! 🛰️❤️🌎 → https://t.co/lJSdYGR9qN https://t.co/HpnDaKmJyL

Source Control in VS Code now displays Git stashes 📦 Your hidden stashes just became easy to manage! https://t.co/o8TBYbBo7W
“Our wounds are often the openings into the best and most beautiful part of us.” ~ D. Richo https://t.co/X3fflIc0cJ

60 of our biggest AI announcements in 2025 https://t.co/xYbrL2uFlT @google
Yes, AI Is Really Impacting The Job Market. Here's What To Do. https://t.co/1zzOzer5tf @Josh_Bersin
Introducing Vector Lab, a new type of AI image editor Vector Lab exposes control of the underlying features in the diffusion process, allowing you to nudge an image around latent space. https://t.co/Or5t6VbsoB
I partnered with Dreamina to test out their new 3.5 Video model (Seadance 1.5) which now comes with native audio. I created this short scene to put the model through its paces. I was most impressed with the synthetic performance which was more subtle and less uncanny than Veo 3.1. It was easy to prompt consistent voices for my characters. Also, check out the video quality and detail, especially elements like the teeth, which other models struggle with. All sound, apart from the Atmos track and the music, was generated in the model with the video. It's a great model that has already earned its place in my toolkit. Nano-Bananna for the image prompts, Topaz upscale and Suno music. @dreamina_ai
My beloved father has passed away. He fought with extraordinary courage until the very end with grace, strength, and a quiet dignity that defined his life. Even in his final days, when words were gone, his presence spoke louder than anything else ever could. Papa was my anchor, my example, and my definition of resilience. His love, values, and spirit will live on through all of us who were blessed to know him. I will share more when I’m able. For now, I ask for a moment of reflection and remembrance for a remarkable life.

【報告】AIエージェントが最適化プログラミングコンテストにて初優勝 🐟 ブログ:https://t.co/BJut3OjtLA 2025年12月14日に開催された「ALGO ARTIS プログラミングコンテスト2025 師走(AtCoder Heuristic Contest 058)」において、Sakana AIが開発するAIエージェント「ALE-Agent」(AtCoder アカウント名:fishylene)が、804名の参加者の中で1位を獲得しました。 SakanaAIはAtCoder株式会社様のご協力のもとこれまでアルゴリズムエンジニアリングタスクのベンチマークALE-Benchを開発し、独自のALE-Agentを複数のコンテストへ継続的に参加させていただきました。数ヶ月間にわたる改善と試行錯誤を積み重ねてきた中での今回の結果は、数時間単位の高度な最適化タスクにおいても、AIが人間のトップエキスパートに匹敵するパフォーマンスを発揮しうる段階に達したことを示す、一つの重要なマイルストーンだと考えています。 ALE-Agentは、AIが強みとする計算量や思考回数を最大限に活かしつつも、作問者の想定になかった効率的な焼きなまし法の近傍操作を自ら発見するなど、非常に興味深い挙動を示しました。詳しくはブログや下記のログ・技術分析をご覧ください。 ログ・技術分析: https://t.co/Up6rNz2njG 素晴らしい問題を提供いただいた主催の株式会社ALGO ARTIS様、そして継続的にご協力をいただいているAtCoder株式会社様に深く感謝申し上げます。今後も、AIを人間の探索能力を拡張するパートナーとして位置づけ、実社会の複雑な問題解決に向けた研究に邁進してまいります。

【報告】AIエージェントが最適化プログラミングコンテストにて初優勝 🐟 ブログ:https://t.co/BJut3OjtLA 2025年12月14日に開催された「ALGO ARTIS プログラミングコンテスト2025 師走(AtCoder Heuristic Contest 058)」において、Sakana AIが開発するAIエージェント「ALE-Agent」(AtCoder アカウント名:fishylene)が、804名の参加者の中で1位を獲得しました。 SakanaAIはAtCoder株式会社様のご協力のもとこれまでアルゴリズムエンジニアリングタスクのベンチマークALE-Benchを開発し、独自のALE-Agentを複数のコンテストへ継続的に参加させていただきました。数ヶ月間にわたる改善と試行錯誤を積み重ねてきた中での今回の結果は、数時間単位の高度な最適化タスクにおいても、AIが人間のトップエキスパートに匹敵するパフォーマンスを発揮しうる段階に達したことを示す、一つの重要なマイルストーンだと考えています。 ALE-Agentは、AIが強みとする計算量や思考回数を最大限に活かしつつも、作問者の想定になかった効率的な焼きなまし法の近傍操作を自ら発見するなど、非常に興味深い挙動を示しました。詳しくはブログや下記のログ・技術分析をご覧ください。 ログ・技術分析: https://t.co/Up6rNz2njG 素晴らしい問題を提供いただいた主催の株式会社ALGO ARTIS様、そして継続的にご協力をいただいているAtCoder株式会社様に深く感謝申し上げます。今後も、AIを人間の探索能力を拡張するパートナーとして位置づけ、実社会の複雑な問題解決に向けた研究に邁進してまいります。
excited to support this awesome evals work by @SophontAI the largest evaluation suite for assessing medical capabilities of LLMs across 15+ environments is now live on our hub! https://t.co/k59EIwOZzH
We're releasing Medmarks v0.1, the largest completely open-source automated evaluation suite for assessing the medical capabilities of LLMs! Developed in our @MedARC_AI community, w/ support from @PrimeIntellect So far we’ve explored 46 models to figure out the best! https://t.
We're excited to support @SophontAI's Medmarks release The largest completely open-source evaluation suite for medical capabilities Built using verifiers and published on the Environments Hub: https://t.co/hFNFZgz0TR
We're releasing Medmarks v0.1, the largest completely open-source automated evaluation suite for assessing the medical capabilities of LLMs! Developed in our @MedARC_AI community, w/ support from @PrimeIntellect So far we’ve explored 46 models to figure out the best! https://t.
US mortgage lenders insure against artificial intelligence screening errors https://t.co/ePOp7O2ohn @leee_harris @ft
Why AI is a nightmare for the EU https://t.co/1Y8RxBo6eL @calder_mchugh @politicomag @POLITICOEurope
This is huge. First signs of incoming 1000x acceleration of scientific progress. "Researchers found that when scientists use AI, their productivity soared. The biggest jump was in the social sciences and humanities, where output increased by 59.8%, while biology and life sciences saw a 52.9% increase." "Meanwhile, in physics and math, the scientists report a 36.2% boost." "LLM adoption is associated with a large increase in researchers' scientific output," wrote the team.