@seb_ruder
MENLO framework includes: 📊 6,423 human-labeled prompt-response preference pairs 🌐 47 language varieties 🧭 4 structured quality dimensions (fluency, tone, etc.) ✅ High inter-annotator agreement ⚖️ Pairwise judgments → better signal https://t.co/SW5nX5FwrX