@emollick
I hate to keep bringing this up, but studies cannot lump reasoners with earlier models when considering AI abilities And while studies don’t need to always use the latest models, they should test to see if there are trends in ability as model size scales to anticipate the future https://t.co/t1iO9w2E0N