@johnowhitaker
Dang it, I made an eval I thought I'd trounce LLMs at: identifying species in photos I've taken over the years, given ~5 plausible options. TIL 1) I don't know my latin names as well as I thought, and 2) 4o apparently does 😂 Writeup once I do the human baseline score + polish https://t.co/iEVirKt3Vz