@iScienceLuvr
OwkinZero: Accelerating Biological Discovery with AI "We introduce a new benchmark of eight datasets with over 300,000 verifiable Q&A pairs, designed to test complex problem-solving across the drug discovery pipeline." "We demonstrate that specialized models, post- trained via reinforcement learning, substantially outperform larger, state-of-the-art commercial LLMs on our biological benchmarks." "We uncover insights into cross-task generalization, where specialist models trained on a single task show improved performance on unseen, out-of- domain tasks compared to their base models." "Our OwkinZero models, trained on a mixture of datasets, amplify this effect, achieving broader cross-task generalization and outperforming single- task specialists even on their respective in-domain tasks."