@johnowhitaker
Wondering what the recent 'hybrid linear attention' buzz is about? I recorded a quick video looking at Jet Nemotron, Gated Delta Net and related pieces, prompted by the next Qwen possibly being a nice-looking hybrid model :) Hope it's useful: https://t.co/aCVgaxxESR