@emollick
Interesting attempt by Salesforce to create a benchmark for realistic business tasks - we need more of these! Worth tracking over time (though I would love to see an contest, ARC-AGI style, to ask people to try to beat these benchmarks and see if they can with prompts & tools) https://t.co/eWokRVlFHk