@iScienceLuvr
We have a holiday surprise for y'all! Introducing Medmarks v0.1! At Sophont, we're interested in pushing forward the medical capabilities of LLMs but we realized open benchmarking is still quite lacking. So we created an evaluation suite! We spent the past 3 months working with our @MedARC_AI research community and @PrimeIntellect to build the Medmarks leaderboard. We hope you find it interesting!