HuggingFaceM4/idefics2-8b
Image-Text-to-Text
•
Updated
•
40.9k
•
308
Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation.