Collections

Discover the best community collections!

Collections including paper arxiv:2310.03744
Vision Language Models Papers 🖼️💬📝
Papers about vision-language models, most important ones are on top of the list.
LLaVa-NeXT
LLaVa-NeXT (also known as LLaVa-1.6) improves upon the 1.5 series by incorporating higher image resolutions and more reasoning/OCR datasets.
Multimodal Papers
Collection by 28 days ago
to_read
Collection by 15 days ago
VLMs for 3D reconstructions and their evaluation
List of papers to help with developing a model that reviews a photogrammetry scan and evaluates its quality