Visual Document Understanding - a Norm Collection

Norm 's Collections

Visual Document Understanding

LLM

Fundamental Research

Visual Document Understanding

updated Sep 22, 2023

A collection of papers about image encoder + text decoder for document AI.

Kosmos-2.5: A Multimodal Literate Model

Paper • 2309.11419 • Published Sep 20, 2023 • 49