@arafatkatze
Turns out @openblocklabs is a complete fraud who gamed their Terminal bench SOTA score. They cheated by putting the result verifier values INSIDE the binary before running the eval and then publicly reported that score as their SOTA score. Read the breakdown here