Your curated collection of saved posts and media
Weโre announcing Kos-1 Lite, a medical model that achieves SOTA on HealthBench Hard at 46.6%. As a medium sized language model (~100B), it achieves these results at a fraction of the serving cost of frontier trillion-parameter models. https://t.co/27sxAHPgZM
Weโre announcing Kos-1 Lite, a medical model that achieves SOTA on HealthBench Hard at 46.6%. As a medium sized language model (~100B), it achieves these results at a fraction of the serving cost of frontier trillion-parameter models. https://t.co/27sxAHPgZM
SWE-rebench V2 A language-agnostic pipeline that automatically harvests 32,000+ executable real-world software engineering tasks across 20 programming languages. Built for large-scale RL training of code agents with reproducible Docker environments. https://t.co/JJ0vLH5N7B
SWE-rebench V2 A language-agnostic pipeline that automatically harvests 32,000+ executable real-world software engineering tasks across 20 programming languages. Built for large-scale RL training of code agents with reproducible Docker environments. https://t.co/JJ0vLH5N7B
Image Generation with a Sphere Encoder https://t.co/6I2FbpogaC
Utonia Toward One Encoder for All Point Clouds paper: https://t.co/AJFPivgBm9 https://t.co/Xbux4iY1QV
BeyondSWE Can Current Code Agent Survive Beyond Single-Repo Bug Fixing? paper: https://t.co/IrLgJJomQU
Beyond Language Modeling An Exploration of Multimodal Pretraining paper: https://t.co/GmtPAQDo8T
Beyond Length Scaling Synergizing Breadth and Depth for Generative Reward Models https://t.co/25QhR93OKK
Kiwi-Edit Versatile Video Editing via Instruction and Reference Guidance https://t.co/s9xlDgXhfc
BBQ-to-Image Numeric Bounding Box and Qolor Control in Large-Scale Text-to-Image Models paper: https://t.co/54U6zmx2ZA https://t.co/fW8zbIrE19
Video world models today have a very limited context length. Mode Seeking meets Mean Seeking (MMM) unlocks long-context, persistent video world models through a unified representation. 1/8 ๐งต https://t.co/XXMic82qoc
The Faster-Qwen3-TTS demo just passed the official Qwen3-TTS demo in Hugging Face trending (last 7 days). Now the #5 most trending Space ๐ https://t.co/8Ar622nKU9
Helios Real Real-Time Long Video Generation Model paper: https://t.co/ae0ZH4zPzn https://t.co/kCnNfF3ImI
Heterogeneous Agent Collaborative Reinforcement Learning https://t.co/ASb1VwtCeK
Proact-VL A Proactive VideoLLM for Real-Time AI Companions https://t.co/GkHdSKxSvi
CubeComposer Spatio-Temporal Autoregressive 4K 360ยฐ Video Generation from Perspective Video paper: https://t.co/mnDM1VrYn7 https://t.co/iHtlZJCo1w
LTX-2.3 is out on Hugging Face model: https://t.co/te5nwPL1LE https://t.co/biO7szxFGz
Tencent released HY-WU on Hugging Face An Extensible Functional Neural Memory Framework and An Instantiation in Text-Guided Image Editing model: https://t.co/jAnic8Z9i1 https://t.co/LsLpyjMVQT
New model updates from iquestlab. If you're trying to find an inference model that you can run offline, this is probably the one you're looking for. - 7B and 14B coding models - Optimized for tool use, CLI agents and HTML generation - 128k context length - Explicit and detailed prompting works best - MiT license with requirement of display logo - available on @huggingface
With the help of @huggingface we (/w @RisingSayak) are building a ML Club India ๐ฎ๐ณ What we want to do: 1. Online talks 2. IST compatible timing 2. Open to all More to come in this week! Watch this space. ๐ค Special thanks to @LysandreJik who motivated me to keep working on this. ๐ฅ
Thanks @AnthropicAI. Thanks @huggingface for letting me work on Diffusers and other open-source projects across the fleet. https://t.co/R2qCNHk5lb
๐ฅ Learn how to build your own tool-calling agent with @huggingface TRL + @Alibaba_Qwen Qwen3.5 on @Azure Machine Learning! - @NousResearch hermes-function-calling-v1, 500 single-turn samples - SFT with TRL on Qwen3.5 2B (released today!) on a single NVIDIA H100 - Everything on Azure, from Container Registry to Machine Learning! Step-by-step in the thread ๐งต
agentic RL hackathon this weekend! mentors from @PyTorch, @huggingface , and @UnslothAI will guide you to build agentic environments to win from a pool of $100K prizes ๐ + free compute and token credits just for attending! lock in mar 7-8 in SF. https://t.co/erZRAJrgrA