@reach_vb
๐จ Apple just released FastVLM on Hugging Face - 0.5, 1.5 and 7B real-time VLMs with WebGPU support ๐คฏ > 85x faster and 3.4x smaller than comparable sized VLMs > 7.9x faster TTFT for larger models > designed to output fewer output tokens and reduce encoding time for high resolution images Bonus: works in REALTIME directly in your browser powered by transformers.js and WebGPU ๐ฅ Try it out on the demo below ๐