@AhmedHAwadallah
Fara-7B is our first agentic small language model for computer use. We learned a lot, and looking forward to next steps: *Agentic models can be small, yet remain capable *Unlike solutions that rely on chat model wrappers, even small agentic models can process screenshots and perform direct GUI actions such as scrolling, typing, and clicking. *Simulation-driven multi-agent synthetic data to automates task generation, trajectory generation and validation is a way to address the agentic data scarcity gap, and in our case costs < $1 per task. *Evaluating CUA is hard ; we release WebTailBench, a new eval set with diverse tasks not found in other benchmarks, and work with an external party, Browserbase, to independently assessed Fara-7B using human annotators. Model available on Foundry and HuggingFace and can run on device on Copilot+ PC