@BlancheMinerva
Claude-4 Sonnet scores quite well on SPOT, our recent benchmark for identifying errors in academic papers. Its precision of 11.3% is far ahead of its competition, but probably not something you'd want to rely on to report you for fraud... https://t.co/Bwg3beDscd