@emollick
What I find very funny about these “leaks” is that they don’t even bother to get ballpark benchmarks to feed into the image generators. Ask the model to look up real data, at least. Its easy! Like GPQA is over 90% for all recent models. https://t.co/XljT8L3QCJ