PaliGemma Release Collection Pretrained and mix checkpoints for PaliGemma • 11 items • Updated 3 days ago • 91
view article Article Multimodal Augmentation for Documents: Recovering “Comprehension” in “Reading and Comprehension” task By danaaubakirova • 4 days ago • 15
view article Article Synthetic dataset generation techniques: Self-Instruct By davanstrien • 5 days ago • 3
StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation Paper • 2405.01434 • Published 18 days ago • 44
RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion Paper • 2404.07199 • Published Apr 10 • 22
InstantFamily: Masked Attention for Zero-shot Multi-ID Image Generation Paper • 2404.19427 • Published 20 days ago • 65
Edit Your Image! Collection Find all the trending and useful Gradio demos that you can use to edit your images. • 21 items • Updated 24 days ago • 21
FABLES: Evaluating faithfulness and content selection in book-length summarization Paper • 2404.01261 • Published Apr 1 • 3
Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image Synthesis Paper • 2404.13686 • Published 29 days ago • 25
view article Article The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare Apr 19 • 64
view article Article Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent 28 days ago • 71
Factorized Diffusion: Perceptual Illusions by Noise Decomposition Paper • 2404.11615 • Published Apr 17 • 2
[lecture artifacts] aligning open language models Collection artifacts referenced in the talk timeline! Slides: https://docs.google.com/presentation/d/1quMyI4BAx4rvcDfk8jjv063bmHg4RxZd9mhQloXpMn0/edit?usp=sharin • 63 items • Updated Apr 17 • 43
ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback Paper • 2404.07987 • Published Apr 11 • 46
HF-curated models available on Workers AI Collection A collection of models curated with Hugging Face that can be run on Cloudflare's Workers AI serverless inference platform. • 15 items • Updated Apr 2 • 48
SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions Paper • 2403.16627 • Published Mar 25 • 20
Transparent Image Layer Diffusion using Latent Transparency Paper • 2402.17113 • Published Feb 27 • 5
LayerDiffusion: Layered Controlled Image Editing with Diffusion Models Paper • 2305.18676 • Published May 30, 2023 • 1
DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models Paper • 2402.19481 • Published Feb 29 • 16
Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation Paper • 2402.10210 • Published Feb 15 • 28
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits Paper • 2402.17764 • Published Feb 27 • 566
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases Paper • 2402.14905 • Published Feb 22 • 80
Word Tour: One-dimensional Word Embeddings via the Traveling Salesman Problem Paper • 2205.01954 • Published May 4, 2022 • 1
Differential Diffusion: Giving Each Pixel Its Strength Paper • 2306.00950 • Published Jun 1, 2023 • 2
SDXL-Lightning: Progressive Adversarial Diffusion Distillation Paper • 2402.13929 • Published Feb 21 • 24
Gemma release Collection Groups the Gemma models released by the Google team. • 40 items • Updated 6 days ago • 304
Zeroshot Classifiers Collection These are my current best zeroshot classifiers. Some of my older models are downloaded more often, but the models in this collection are newer/better. • 11 items • Updated Apr 3 • 77
Text-to-Image Base Models Collection All text-to-image open source base models, with their respective license • 28 items • Updated 10 days ago • 17
L3GO: Language Agents with Chain-of-3D-Thoughts for Generating Unconventional Objects Paper • 2402.09052 • Published Feb 14 • 16
Self-Discover: Large Language Models Self-Compose Reasoning Structures Paper • 2402.03620 • Published Feb 6 • 102
OLMo Suite Collection Artifacts for the first set of OLMo models. • 12 items • Updated 5 days ago • 35
AnimateLCM: Accelerating the Animation of Personalized Diffusion Models and Adapters with Decoupled Consistency Learning Paper • 2402.00769 • Published Feb 1 • 17
Diffuse to Choose: Enriching Image Conditioned Inpainting in Latent Diffusion Models for Virtual Try-All Paper • 2401.13795 • Published Jan 24 • 64
Tweets to Citations: Unveiling the Impact of Social Media Influencers on AI Research Visibility Paper • 2401.13782 • Published Jan 24 • 2
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data Paper • 2401.10891 • Published Jan 19 • 53
MAGNeT Collection Masked Audio Generation using a Single Non-Autoregressive Transformer • 9 items • Updated Apr 4 • 30
Diffusion DPO LoRA Collection How to train: https://github.com/huggingface/diffusers/tree/main/examples/research_projects/diffusion_dpo • 4 items • Updated Jan 12 • 4
PIXART-δ: Fast and Controllable Image Generation with Latent Consistency Models Paper • 2401.05252 • Published Jan 10 • 43
Optimizing diffusion models Collection Provides a list of papers focusing on optimizing T2I diffusion models, targeting fewer timesteps, architecture optimization, and more. • 20 items • Updated 24 days ago • 12
Pixel-Aware Stable Diffusion for Realistic Image Super-resolution and Personalized Stylization Paper • 2308.14469 • Published Aug 28, 2023 • 6