radames (Radamés Ajna)

upvoted a collection 4 days ago

PaliGemma Release

Collection

Pretrained and mix checkpoints for PaliGemma • 11 items • Updated 3 days ago • 91

upvoted 2 articles 4 days ago

Article

Multimodal Augmentation for Documents: Recovering “Comprehension” in “Reading and Comprehension” task

By

•

4 days ago

• 15

Article

Synthetic dataset generation techniques: Self-Instruct

By

•

5 days ago

• 3

upvoted an article 5 days ago

Article

2024-04-22 - Hub Incident Post Mortem

By

•

3 days ago

• 15

upvoted 2 papers 6 days ago

StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation

Paper • 2405.01434 • Published 18 days ago • 44

RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion

Paper • 2404.07199 • Published Apr 10 • 22

upvoted 2 papers 18 days ago

Iterative Reasoning Preference Optimization

Paper • 2404.19733 • Published 20 days ago • 41

InstantFamily: Masked Attention for Zero-shot Multi-ID Image Generation

Paper • 2404.19427 • Published 20 days ago • 65

upvoted a paper 20 days ago

ZeST: Zero-Shot Material Transfer from a Single Image

Paper • 2404.06425 • Published Apr 9 • 4

upvoted a collection 24 days ago

Edit Your Image!

Collection

Find all the trending and useful Gradio demos that you can use to edit your images. • 21 items • Updated 24 days ago • 21

upvoted a collection 25 days ago

OpenELM Instruct Models

Collection

4 items • Updated Apr 12 • 96

upvoted 2 papers 27 days ago

FABLES: Evaluating faithfulness and content selection in book-length summarization

Paper • 2404.01261 • Published Apr 1 • 3

Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image Synthesis

Paper • 2404.13686 • Published 29 days ago • 25

upvoted 3 articles 28 days ago

Article

The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare

Apr 19

• 64

Article

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent

28 days ago

• 71

Article

Welcome Llama 3 - Meta's new open LLM

Apr 18

• 239

upvoted a paper about 1 month ago

Factorized Diffusion: Perceptual Illusions by Noise Decomposition

Paper • 2404.11615 • Published Apr 17 • 2

upvoted an article about 1 month ago

Article

AI Apps in a Flash with Gradio's Reload Mode

Apr 16

• 16

upvoted a collection about 1 month ago

[lecture artifacts] aligning open language models

Collection

artifacts referenced in the talk timeline! Slides: https://docs.google.com/presentation/d/1quMyI4BAx4rvcDfk8jjv063bmHg4RxZd9mhQloXpMn0/edit?usp=sharin • 63 items • Updated Apr 17 • 43

upvoted 2 papers about 1 month ago

ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback

Paper • 2404.07987 • Published Apr 11 • 46

Rho-1: Not All Tokens Are What You Need

Paper • 2404.07965 • Published Apr 11 • 79

upvoted a collection about 1 month ago

CodeGemma Release

Collection

16 items • Updated 6 days ago • 59

upvoted an article about 1 month ago

Article

Outpainting II - Differential Diffusion

By

•

27 days ago

• 24

upvoted a collection about 2 months ago

HF-curated models available on Workers AI

Collection

A collection of models curated with Hugging Face that can be run on Cloudflare's Workers AI serverless inference platform. • 15 items • Updated Apr 2 • 48

upvoted 2 papers about 2 months ago

SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions

Paper • 2403.16627 • Published Mar 25 • 20

ReNoise: Real Image Inversion Through Iterative Noising

Paper • 2403.14602 • Published Mar 21 • 19

upvoted 4 papers 2 months ago

upvoted 4 papers 3 months ago

Trajectory Consistency Distillation

Paper • 2402.19159 • Published Feb 29 • 13

DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models

Paper • 2402.19481 • Published Feb 29 • 16

Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation

Paper • 2402.10210 • Published Feb 15 • 28

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27 • 566

upvoted a collection 3 months ago

MobiLlama

Collection

Collection of MobiLlama Language Models. • 6 items • Updated 24 days ago • 14

upvoted 6 papers 3 months ago

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22 • 80

Genie: Generative Interactive Environments

Paper • 2402.15391 • Published Feb 23 • 67

Word Tour: One-dimensional Word Embeddings via the Traveling Salesman Problem

Paper • 2205.01954 • Published May 4, 2022 • 1

Differential Diffusion: Giving Each Pixel Its Strength

Paper • 2306.00950 • Published Jun 1, 2023 • 2

SDXL-Lightning: Progressive Adversarial Diffusion Distillation

Paper • 2402.13929 • Published Feb 21 • 24

Aria Everyday Activities Dataset

Paper • 2402.13349 • Published Feb 20 • 28

upvoted 2 collections 3 months ago

Gemma release

Collection

Groups the Gemma models released by the Google team. • 40 items • Updated 6 days ago • 304

Zeroshot Classifiers

Collection

These are my current best zeroshot classifiers. Some of my older models are downloaded more often, but the models in this collection are newer/better. • 11 items • Updated Apr 3 • 77

upvoted a paper 3 months ago

Chain-of-Thought Reasoning Without Prompting

Paper • 2402.10200 • Published Feb 15 • 90

upvoted a collection 3 months ago

Text-to-Image Base Models

Collection

All text-to-image open source base models, with their respective license • 28 items • Updated 10 days ago • 17

upvoted 3 papers 3 months ago

L3GO: Language Agents with Chain-of-3D-Thoughts for Generating Unconventional Objects

Paper • 2402.09052 • Published Feb 14 • 16

Self-Discover: Large Language Models Self-Compose Reasoning Structures

Paper • 2402.03620 • Published Feb 6 • 102

Training-Free Consistent Text-to-Image Generation

Paper • 2402.03286 • Published Feb 5 • 62

upvoted a collection 3 months ago

OLMo Suite

Collection

Artifacts for the first set of OLMo models. • 12 items • Updated 5 days ago • 35

upvoted 4 papers 4 months ago

AnimateLCM: Accelerating the Animation of Personalized Diffusion Models and Adapters with Decoupled Consistency Learning

Paper • 2402.00769 • Published Feb 1 • 17

Diffuse to Choose: Enriching Image Conditioned Inpainting in Latent Diffusion Models for Virtual Try-All

Paper • 2401.13795 • Published Jan 24 • 64

Tweets to Citations: Unveiling the Impact of Social Media Influencers on AI Research Visibility

Paper • 2401.13782 • Published Jan 24 • 2

Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data

Paper • 2401.10891 • Published Jan 19 • 53

upvoted 2 collections 4 months ago

MAGNeT

Collection

Masked Audio Generation using a Single Non-Autoregressive Transformer • 9 items • Updated Apr 4 • 30

Diffusion DPO LoRA

Collection

How to train: https://github.com/huggingface/diffusers/tree/main/examples/research_projects/diffusion_dpo • 4 items • Updated Jan 12 • 4

upvoted a paper 4 months ago

PIXART-δ: Fast and Controllable Image Generation with Latent Consistency Models

Paper • 2401.05252 • Published Jan 10 • 43

upvoted a collection 4 months ago

Optimizing diffusion models

Collection

Provides a list of papers focusing on optimizing T2I diffusion models, targeting fewer timesteps, architecture optimization, and more. • 20 items • Updated 24 days ago • 12

upvoted 3 papers 5 months ago

TinyLlama: An Open-Source Small Language Model

Paper • 2401.02385 • Published Jan 4 • 81

Pixel-Aware Stable Diffusion for Realistic Image Super-resolution and Personalized Stylization

Paper • 2308.14469 • Published Aug 28, 2023 • 6

aMUSEd: An Open MUSE Reproduction

Paper • 2401.01808 • Published Jan 3 • 26

Radamés Ajna

AI & ML interests

Articles

Hugging Face + Google Visual Blocks

Organizations

radames's activity

Multimodal Augmentation for Documents: Recovering “Comprehension” in “Reading and Comprehension” task

Synthetic dataset generation techniques: Self-Instruct

2024-04-22 - Hub Incident Post Mortem

The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent

Welcome Llama 3 - Meta's new open LLM

AI Apps in a Flash with Gradio's Reload Mode

Outpainting II - Differential Diffusion