PaliGemma Release Collection Pretrained and mix checkpoints for PaliGemma ā¢ 11 items ā¢ Updated 3 days ago ā¢ 91
InstantFamily: Masked Attention for Zero-shot Multi-ID Image Generation Paper ā¢ 2404.19427 ā¢ Published 20 days ago ā¢ 65
š¤ Facial Expressions Recognition Collection Embrace the future of Facial Expressions Recognition with the latest AI-powered technologies! š ā¢ 4 items ā¢ Updated 8 days ago ā¢ 6
Russian speaking 7B models Collection There is some my 7B models good speak and understand Russian language. Approved by some data-set my own tests. Will be link to github repo soon...šŖ¬ ā¢ 7 items ā¢ Updated 3 days ago ā¢ 3
š¤ Big Five Personality Traits Collection The latest AI technologies usher in a new era of Big Five personality assessment š ā¢ 4 items ā¢ Updated 19 days ago ā¢ 2
view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community Apr 15 ā¢ 127
š¤ LLM Spaces Collection A collection of applications demonstrating large language models (LLMs) š ā¢ 13 items ā¢ Updated 14 days ago ā¢ 6
PhysAvatar: Learning the Physics of Dressed 3D Avatars from Visual Observations Paper ā¢ 2404.04421 ā¢ Published Apr 5 ā¢ 14
GeneAvatar: Generic Expression-Aware Volumetric Head Avatar Editing from a Single Image Paper ā¢ 2404.02152 ā¢ Published Apr 2 ā¢ 3
Efficient 3D Implicit Head Avatar with Mesh-anchored Hash Table Blendshapes Paper ā¢ 2404.01543 ā¢ Published Apr 2 ā¢ 3
Audio-Visual Compound Expression Recognition Method based on Late Modality Fusion and Rule-based Decision Paper ā¢ 2403.12687 ā¢ Published Mar 19 ā¢ 3
VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis Paper ā¢ 2403.08764 ā¢ Published Mar 13 ā¢ 34
š Avatars Collection The latest AI-powered technologies usher in a new era of realistic avatars! š ā¢ 33 items ā¢ Updated 6 days ago ā¢ 49
š¼ļø Image Enhancement Collection Embrace the future of Image Enhancement with the latest AI-powered technologies! š ā¢ 1 item ā¢ Updated 19 days ago ā¢ 5
š Speech Enhancement Collection Unlocking a new era in Speech Enhancement, powered by the latest AI technologies, for superior audio quality improvements! š ā¢ 8 items ā¢ Updated 19 days ago ā¢ 7
PixArt-Ī£: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation Paper ā¢ 2403.04692 ā¢ Published Mar 7 ā¢ 35
YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information Paper ā¢ 2402.13616 ā¢ Published Feb 21 ā¢ 44
Vision-Based Hand Gesture Customization from a Single Demonstration Paper ā¢ 2402.08420 ā¢ Published Feb 13 ā¢ 7
CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages Paper ā¢ 2309.09400 ā¢ Published Sep 17, 2023 ā¢ 77