OpenGVLab
/

InternVL-Chat-V1-1

Visual Question Answering

feature-extraction

Model card Files Files and versions Community

czczup commited on Apr 26

Commit

a449055

•

1 Parent(s): ccc339e

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ pipeline_tag: visual-question-answering
   <img src="https://cdn-uploads.huggingface.co/production/uploads/64119264f0f81eb569e0d569/4IG0h_KJ2cvpp9Kdm0Jf7.webp" alt="Image Description" width="300" height="300">
 </p>
-\[[Paper](https://arxiv.org/abs/2312.14238)\]  \[[GitHub](https://github.com/OpenGVLab/InternVL)\] \[[Chat Demo](https://internvl.opengvlab.com/)\] \[[中文解读](https://zhuanlan.zhihu.com/p/675877376)]
 We released InternVL-Chat-V1.1, featuring a structure similar to LLaVA, including a ViT, an MLP projector, and an LLM. In this version, we explored increasing the resolution to 448x448, enhancing OCR capabilities, and improving support for Chinese conversations.

   <img src="https://cdn-uploads.huggingface.co/production/uploads/64119264f0f81eb569e0d569/4IG0h_KJ2cvpp9Kdm0Jf7.webp" alt="Image Description" width="300" height="300">
 </p>
+\[[InternVL 1.5 Technical Report](https://arxiv.org/abs/2404.16821)\]  \[[Paper](https://arxiv.org/abs/2312.14238)\]  \[[GitHub](https://github.com/OpenGVLab/InternVL)\] \[[Chat Demo](https://internvl.opengvlab.com/)\] \[[中文解读](https://zhuanlan.zhihu.com/p/675877376)]
 We released InternVL-Chat-V1.1, featuring a structure similar to LLaVA, including a ViT, an MLP projector, and an LLM. In this version, we explored increasing the resolution to 448x448, enhancing OCR capabilities, and improving support for Chinese conversations.