@ADarmouni
Honestly FineVision is a pretty impressive work of aggregation 200 training sets condensed in a dataset of 18B images, segmented in 9 different subcategories, multi-turn, with quality rating and very documented ablation studies? As always, @huggingface delivers in open data https://t.co/5DmP1ZaG6J