Multimodal Augmentation for Documents: Recovering “Comprehension” in “Reading and Comprehension” task 3 days ago • 15
view post Post 701 The Document AI team ( @Molbap , @rwightman , @danaaubakirova ) at Hugging Face is developing a new multimodal data augmentation pipeline utilising both visual and textual aspects of document images.Check out my latest blog post for more details: https://huggingface.co/blog/danaaubakirova/doc-augmentationPlease, share your thoughts and suggestions with us.And stay tuned for the updates!