@emollick
Unappreciated fact is the second scaling law does not seem to completely plateau in many tasks: throw more tokens at a reasoning AI model and get better answers, especially with a simple harness. Benchmark performance is actually limited by token usage. https://t.co/jP3TgIsT2T https://t.co/DQs6OJvNTy