Compositional Text-to-Image Generation with Dense Blob Representations Paper • 2405.08246 • Published 20 days ago • 11
Coin3D: Controllable and Interactive 3D Assets Generation with Proxy-Guided Conditioning Paper • 2405.08054 • Published 21 days ago • 21
Beyond Scaling Laws: Understanding Transformer Performance with Associative Memory Paper • 2405.08707 • Published 20 days ago • 26
Xmodel-VLM: A Simple Baseline for Multimodal Vision Language Model Paper • 2405.09215 • Published 19 days ago • 14
CAT3D: Create Anything in 3D with Multi-View Diffusion Models Paper • 2405.10314 • Published 18 days ago • 38
Chameleon: Mixed-Modal Early-Fusion Foundation Models Paper • 2405.09818 • Published 18 days ago • 96
Infinite-ID: Identity-preserved Personalization via ID-semantics Decoupling Paradigm Paper • 2403.11781 • Published Mar 18 • 17
IDAdapter: Learning Mixed Features for Tuning-Free Personalization of Text-to-Image Models Paper • 2403.13535 • Published Mar 20 • 20
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation Paper • 2403.04692 • Published Mar 7 • 36
Inst-Inpaint: Instructing to Remove Objects with Diffusion Models Paper • 2304.03246 • Published Apr 6, 2023 • 2