@rasbt
This interesting week started with DeepSeek V3.2! I just wrote up a technical tour of the predecessors and components that led up to this: π https://t.co/JSAd9cx2s6 - Multi-Head Latent Attention - RLVR - Sparse Attention - Self-Verification - GRPO Updates https://t.co/5f965hR70I