@rasbt
Since both phi-1.5 and mistral 7B are now supported in Lit-GPT, I just ran it through a random selection of Evaluation Harness tasks. It's pretty good at random arithmetic benchmarks. Otherwise, I'd say it's good but maybe not that suspiciously good. https://t.co/UzsghzXKuW