@omarsar0
Knowledge or Reasoning? Evaluation matters, and even more so when using reasoning LLMs. Look at final response accuracy, but also pay attention to thinking trajectories. Lots of good findings on this one. Here are my notes: https://t.co/88Sk9LP7n6