@iScienceLuvr
CURIE: Evaluating LLMs On Multitask Scientific Long Context Understanding and Reasoning "We introduce CURIE, a scientific long-Context Understanding, Reasoning and Information Extraction benchmark to measure the potential of Large Language Models (LLMs) in scientific… https://t.co/ErIGSJNpEv