@HamelHusain
How do you write evals to test your AI's ability to express uncertainty or refrain from answering when it shouldn't? Answer: assemble the right dataset and use it as a test harness + look at the data 😅 . Links in reply https://t.co/b678c2vsBf