@arankomatsuzaki
LiveCodeBench Pro: How Do Olympiad Medalists Judge LLMs in Competitive Programming? - A benchmark composed of problems from Codeforces, ICPC, and IOI that are continuously updated - The best model achieves only 53% pass@1 on medium-difficulty problems and 0% on hard problems https://t.co/uuTsU7xw5J