@SophontAI
We're releasing Medmarks v0.1, the largest completely open-source automated evaluation suite for assessing the medical capabilities of LLMs! Developed in our @MedARC_AI community, w/ support from @PrimeIntellect So far we’ve explored 46 models to figure out the best! https://t.co/Hfrwm12cnW