@reach_vb
Welcome Bunny - MLLM! 📸 Smol models ftw! > Lightweight & powerful multimodal model. > Plug-and-play vision encoders. > Vision encoders - EVA-CLIP, SigLIP > LLM - Phi-1.5, StableLM-2 and Phi-2 > Bunny-3B (SigLIP and Phi-2) - beats 13B models. https://t.co/vE7wg7Ai0h