@teortaxesTex
Unpopular take: LLM community is *coping* about quantization. Any real test of reasoning shows k-quants≤Q5 fail. Ppl, evals are misleading: do you care that it's only 1% loss if it takes 99% of the hardest skills? We need kernels for AWQ, SpQR, or better – for all platforms. https://t.co/5RNOScIzgj