Anton Obukhov

toshas

AI & ML interests

None yet

Organizations

Posts 3

view post
Post
1844
Another gem from our lab β€” DGInStyle! We use Stable Diffusion to generate semantic segmentation data for autonomous driving and train domain-generalizable networks.

πŸ“Ÿ Website: https://dginstyle.github.io
🧾 Paper: https://arxiv.org/abs/2312.03048
πŸ€— Hugging Face Paper: DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control (2312.03048)
πŸ€— Hugging Face Model: yurujaja/DGInStyle
πŸ™ Code: https://github.com/yurujaja/DGInStyle

In a nutshell, our pipeline overcomes the resolution loss of Stable Diffusion latent space and the style bias of ControlNet, as shown in the attached figures. This allows us to generate sufficiently high-quality pairs of images and semantic masks to train domain-generalizable semantic segmentation networks.

Team: Yuru Jia ( @yurujaja ), Lukas Hoyer, Shengyu Huang, Tianfu Wang ( @Tianfwang ), Luc Van Gool, Konrad Schindler, and Anton Obukhov ( @toshas ).
view post
Post
1897
Introducing Marigold-LCM 🌼 β€” a FAST version of the now popular state-of-the-art depth estimator! Thanks to the latent consistency distillation, it retains the precision of the original Marigold but reaches the solution in just a few steps!

Check out the teaser video attached below and play with the new demo - it accepts videos now! Also, meet the new team member: Tianfu Wang ( @Tianfwang )

πŸ€— Demo: prs-eth/marigold-lcm
πŸ€— Model: prs-eth/marigold-lcm-v1-0
πŸ€— Original Marigold post: https://huggingface.co/posts/toshas/656973498012745
πŸ€— Paper: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation (2312.02145)
🌐 Website: https://marigoldmonodepth.github.io
πŸ‘Ύ Code: https://github.com/prs-eth/marigold
πŸ‘Ύ Code: pip install diffusers

models

None public yet

datasets

None public yet