Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Norm
's Collections
Visual Document Understanding
Diffusion
LLM
Fundamental Research
Visual Document Understanding
updated
Sep 22, 2023
A collection of papers about image encoder + text decoder for document AI.
Upvote
-
Kosmos-2.5: A Multimodal Literate Model
Paper
•
2309.11419
•
Published
Sep 20, 2023
•
49
Upvote
-
Share collection
View history
Collection guide
Browse collections