@arankomatsuzaki
LongCat-Next: Lexicalizing Modalities as Discrete Tokens - Matches or beats SOTA across multimodal benchmarks - SotA audio: strong on both recognition and TTS accuracy - No trade-offs: adds vision/audio without hurting core language performance https://t.co/g2LKPI6mnp