@JianZhangCS
π@Nvidia Nemotron 3 Nano is live! Nemotron 3 Nano is the world's most efficient open MoE with an Hybrid-MoE architecture and 1M context length. π₯ Strong in reasoning, agentic and chat tasks with leading accuracy among AA index, Tau2, SWE Bench. π₯ Up to 3.3X higher throughput comparing to other open MoE at similar sizes π₯ A fully open recipe with data, infra released to the community Checkout the new model architecture and reinforcement learning technologies we used below: π Huggingface: https://t.co/UX9L9QmuWJ π’ Research blog: https://t.co/NeTb5xANxR π£οΈNemo RL & Nemo Gym (RL environment orchestration): https://t.co/fD78eabCZv & https://t.co/E3Q67AIA4j Kudos to the teams for months of hard work! We are excited to keep building the Nemotron 3 model family and empower the community.