@rao2z
In the year since LRMs ("reasoning models") hit the scene, we have been trying to understand, analyze and demystify them.. Here are our efforts to date--conveniently all in one place (First..) Evaluation of LRMs on Planning 📜https://t.co/uu3aZzVSX4 (9/24)& 📜https://t.co/dRg7qa3uoz (TMLR) 🧵 https://t.co/CvHuWhlKNj Semantics of Intermediate Tokens (CoT's) Study on Mazes: 📜https://t.co/4LGWfiCZ5e 🧵 https://t.co/y3BthniqSG Study on CoTemp Q&A: 📜 https://t.co/Cnlb96mqKd 🧵 https://t.co/CaeVu0ex46 Analysis of RL on LRMs 📜https://t.co/021pXx842x 🧵https://t.co/XbqAyJIyB4 Interpretability of Intermediate tokens 📜https://t.co/e2J5pQLhGj 🧵https://t.co/74FSZvQ7c2 Intermediate tokens and problem complexity 📜https://t.co/C5y772QIue 🧵 https://t.co/UKgCwgHKeQ Perspective on LRMs 📜https://t.co/Skv2WIKyZY (also at https://t.co/d2fIIX82NT) (Position against anthropomorphization of Intermediate Tokens) 📜https://t.co/4f5eg5vRnA 🧵https://t.co/f6E3c2j4dm Relevant recent talks https://t.co/6lyhPLYVcY TBC..