@arankomatsuzaki
Google presents PaLM2-VAdapter SoTA visual understanding and multi-modal reasoning capabilities with 30∼70% fewer parameters than the SoTA VLM, marking a significant efficiency improvement https://t.co/lvC4UJ0DqM https://t.co/K1zS8Mu95L