@HelloSurgeAI
Teaching LLMs to follow instructions? Step 1. Teaching them to have taste? That's the endgame. An 8-line poem about the moon can check every box: ✅ Moon: mentioned ✅ Lines: 8 ✅ Rhymes: yes! ...and still be completely forgettable. The models that win aren't the most obedient. They're the ones that understand quality. Nuance. Voice. The ones that take your breath away. This is what separates the best frontier labs from the rest: understanding the difference between "technically correct" and "actually good." (That's why a lot of post-training researchers aren't just scientists. They're also... Artists.) Our CEO @echen talked about this on Gradient Dissent a couple months ago: training for taste, scaling the complexities of human judgment, and the art and science of post-training. https://t.co/Ui7936KsgA