@kohjingyu
You can now try out πGILL on HuggingFace Spaces! GILL is a model capable of conditioning on interleaved image + text inputs to generate image + text outputs. Try the demo now: https://t.co/VlWCIs0ifQ Project page: https://t.co/w6VgcntGl2 More cool examples below π§΅π https://t.co/R9QU0lswnW