Dmitry Ryumin's picture

Dmitry Ryumin

DmitryRyumin

·

https://dmitryryumin.github.io

AI & ML interests

Machine Learning and Applications, Multi-Modal Understanding

Organizations

DmitryRyumin's activity

upvoted a collection 6 days ago

PaliGemma Release

Pretrained and mix checkpoints for PaliGemma • 11 items • Updated 3 days ago • 91

upvoted a paper 19 days ago

InstantFamily: Masked Attention for Zero-shot Multi-ID Image Generation

Paper • 2404.19427 • Published 20 days ago • 65

upvoted a collection 20 days ago

🤔 Facial Expressions Recognition

Embrace the future of Facial Expressions Recognition with the latest AI-powered technologies! 🚀 • 4 items • Updated 8 days ago • 6

upvoted an article 25 days ago

Article

Custom architectures with HuggingFace 🤗

By

•

28 days ago

• 20

upvoted a collection 27 days ago

Russian speaking 7B models

There is some my 7B models good speak and understand Russian language. Approved by some data-set my own tests. Will be link to github repo soon...🪬 • 7 items • Updated 3 days ago • 3

upvoted an article 29 days ago

Article

Fine-tune Llama 3 with ORPO

By

•

28 days ago

• 179

upvoted a collection about 1 month ago

🤗 Big Five Personality Traits

The latest AI technologies usher in a new era of Big Five personality assessment 🚀 • 4 items • Updated 19 days ago • 2

upvoted an article about 1 month ago

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Apr 15

• 127

upvoted a collection about 1 month ago

🤖 LLM Spaces

A collection of applications demonstrating large language models (LLMs) 🚀 • 13 items • Updated 14 days ago • 6

upvoted a paper about 1 month ago

PhysAvatar: Learning the Physics of Dressed 3D Avatars from Visual Observations

Paper • 2404.04421 • Published Apr 5 • 14

upvoted 4 papers about 2 months ago

GeneAvatar: Generic Expression-Aware Volumetric Head Avatar Editing from a Single Image

Paper • 2404.02152 • Published Apr 2 • 3

Efficient 3D Implicit Head Avatar with Mesh-anchored Hash Table Blendshapes

Paper • 2404.01543 • Published Apr 2 • 3

Adversarial AutoMixup

Paper • 2312.11954 • Published Dec 19, 2023 • 2

ViTAR: Vision Transformer with Any Resolution

Paper • 2403.18361 • Published Mar 27 • 48

upvoted 2 papers 2 months ago

Audio-Visual Compound Expression Recognition Method based on Late Modality Fusion and Rule-based Decision

Paper • 2403.12687 • Published Mar 19 • 3

VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis

Paper • 2403.08764 • Published Mar 13 • 34

upvoted 3 collections 2 months ago

🎭 Avatars

The latest AI-powered technologies usher in a new era of realistic avatars! 🚀 • 33 items • Updated 6 days ago • 49

🖼️ Image Enhancement

Embrace the future of Image Enhancement with the latest AI-powered technologies! 🚀 • 1 item • Updated 19 days ago • 5

🔊 Speech Enhancement

Unlocking a new era in Speech Enhancement, powered by the latest AI technologies, for superior audio quality improvements! 🚀 • 8 items • Updated 19 days ago • 7

upvoted a paper 2 months ago

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Paper • 2403.04692 • Published Mar 7 • 35

upvoted a collection 3 months ago

OpenCodeInterpreter

18 items • Updated Mar 3 • 73

upvoted 3 papers 3 months ago

Neural Network Diffusion

Paper • 2402.13144 • Published Feb 20 • 92

YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Paper • 2402.13616 • Published Feb 21 • 44

Vision-Based Hand Gesture Customization from a Single Demonstration

Paper • 2402.08420 • Published Feb 13 • 7

upvoted a collection 7 months ago

ICCV 2023 Demos

Demos for ICCV 2023 papers • 38 items • Updated Oct 5, 2023 • 10

upvoted a paper 8 months ago

CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages

Paper • 2309.09400 • Published Sep 17, 2023 • 77