AdinaY (Adina Yakefu)

upvoted a paper about 23 hours ago

Grounding DINO 1.5: Advance the "Edge" of Open-Set Object Detection

Paper • 2405.10300 • Published 1 day ago • 12

upvoted a paper about 24 hours ago

Chameleon: Mixed-Modal Early-Fusion Foundation Models

Paper • 2405.09818 • Published 2 days ago • 50

upvoted 2 papers 3 days ago

Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Paper • 2405.08748 • Published 4 days ago • 13

What matters when building vision-language models?

Paper • 2405.02246 • Published 15 days ago • 73

upvoted an article 3 days ago

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

4 days ago

• 91

upvoted a collection 3 days ago

MAmmoTH2

Collection

Scaling up instruction data from the web for to build better LLMs • 10 items • Updated 7 days ago • 4

upvoted 2 papers 4 days ago

Plot2Code: A Comprehensive Benchmark for Evaluating Multi-modal Large Language Models in Code Generation from Scientific Plots

Paper • 2405.07990 • Published 5 days ago • 15

RLHF Workflow: From Reward Modeling to Online RLHF

Paper • 2405.07863 • Published 5 days ago • 51

upvoted a collection 5 days ago

Yi-1.5 (2024/05)

Collection

6 items • Updated 6 days ago • 59

upvoted a collection 11 days ago

SPPO

Collection

Self-Play Preference Optimization • 4 items • Updated 14 days ago • 2

upvoted a paper 12 days ago

From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generation

Paper • 2404.15267 • Published 25 days ago • 4

upvoted a collection 12 days ago

🎭 Avatars

Collection

The latest AI-powered technologies usher in a new era of realistic avatars! 🚀 • 33 items • Updated 4 days ago • 49

upvoted 3 papers 15 days ago

Octopus v4: Graph of language models

Paper • 2404.19296 • Published 18 days ago • 89

StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation

Paper • 2405.01434 • Published 16 days ago • 44

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published 16 days ago • 92

upvoted a collection 22 days ago

llama3-zh

Collection

Portfolio of LLAMA3 fine-tune models • 51 items • Updated 22 days ago • 7

upvoted a paper 24 days ago

OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework

Paper • 2404.14619 • Published 25 days ago • 120

upvoted a collection 25 days ago

Phi-3

Collection

Phi-3 family of models • 7 items • Updated 1 day ago • 199

upvoted 3 papers 25 days ago

upvoted an article 29 days ago

Article

Public Policy at Hugging Face

Apr 8

• 16

upvoted 2 papers 29 days ago

Introducing v0.5 of the AI Safety Benchmark from MLCommons

Paper • 2404.12241 • Published 30 days ago • 10

Dynamic Typography: Bringing Words to Life

Paper • 2404.11614 • Published about 1 month ago • 40

upvoted a collection 29 days ago

WizardLM

Collection

0 items • Updated 10 days ago • 95

upvoted an article 29 days ago

Article

Welcome Llama 3 - Meta's new open LLM

about 1 month ago

• 238

upvoted a paper 30 days ago

Rho-1: Not All Tokens Are What You Need

Paper • 2404.07965 • Published Apr 11 • 79

upvoted a collection about 2 months ago

MaLLaM 🌙

Collection

Pretrain from scratch 4096 context length on 90B tokens Malaysian text, https://huggingface.co/papers/2401.13565 • 10 items • Updated 24 days ago • 8

upvoted 9 papers about 2 months ago

LLM Agent Operating System

Paper • 2403.16971 • Published Mar 25 • 62

FlashFace: Human Image Personalization with High-fidelity Identity Preservation

Paper • 2403.17008 • Published Mar 25 • 18

DreamReward: Text-to-3D Generation with Human Preference

Paper • 2403.14613 • Published Mar 21 • 33

MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?

Paper • 2403.14624 • Published Mar 21 • 50

Mora: Enabling Generalist Video Generation via A Multi-Agent Framework

Paper • 2403.13248 • Published Mar 20 • 71

AnimateDiff-Lightning: Cross-Model Diffusion Distillation

Paper • 2403.12706 • Published Mar 19 • 17

mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding

Paper • 2403.12895 • Published Mar 19 • 27

LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

Paper • 2403.13372 • Published Mar 20 • 53

VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding

Paper • 2403.11481 • Published Mar 18 • 10

upvoted 5 papers 2 months ago

Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset

Paper • 2403.09029 • Published Mar 14 • 52

On the Societal Impact of Open Foundation Models

Paper • 2403.07918 • Published Feb 27 • 16

Adding NVMe SSDs to Enable and Accelerate 100B Model Fine-tuning on a Single GPU

Paper • 2403.06504 • Published Mar 11 • 52

Motion Mamba: Efficient and Long Sequence Motion Generation with Hierarchical and Bidirectional Selective SSM

Paper • 2403.07487 • Published Mar 12 • 11

DragAnything: Motion Control for Anything using Entity Representation

Paper • 2403.07420 • Published Mar 12 • 11

upvoted a collection 2 months ago

Models Trained on Ultra Series

Collection

The collection of open-source models that adopt Ultra Series datasets for training • 22 items • Updated Mar 12 • 4

upvoted 2 papers 2 months ago

Yi: Open Foundation Models by 01.AI

Paper • 2403.04652 • Published Mar 7 • 58

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6 • 172

upvoted 2 collections 3 months ago

⚓️ Sailor Language Models

Collection

Sailor: Open Language Models tailored for South-East Asia (SEA) released by Sea AI Lab. • 18 items • Updated 2 days ago • 14

OpenCodeInterpreter

Collection

18 items • Updated Mar 3 • 73

upvoted 6 papers 3 months ago

StarCoder 2 and The Stack v2: The Next Generation

Paper • 2402.19173 • Published Feb 29 • 123

EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

Paper • 2402.17485 • Published Feb 27 • 182

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27 • 566

Towards Open-ended Visual Quality Comparison

Paper • 2402.16641 • Published Feb 26 • 15

ChatMusician: Understanding and Generating Music Intrinsically with LLM

Paper • 2402.16153 • Published Feb 25 • 54

FuseChat: Knowledge Fusion of Chat Models

Paper • 2402.16107 • Published Feb 25 • 35

upvoted a collection 3 months ago

⛔️🔦 Provenance, Watermarking & Deepfake Detection

Collection

Technical tools for more control over non-consensual synthetic content • 14 items • Updated Apr 1 • 36

upvoted a paper 3 months ago

OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement

Paper • 2402.14658 • Published Feb 22 • 77

upvoted a collection 3 months ago

Gemma release

Collection

Groups the Gemma models released by the Google team. • 40 items • Updated 4 days ago • 302

upvoted 2 papers 3 months ago

Neural Network Diffusion

Paper • 2402.13144 • Published Feb 20 • 92

The FinBen: An Holistic Financial Benchmark for Large Language Models

Paper • 2402.12659 • Published Feb 20 • 13

upvoted 2 collections 3 months ago

Sora参考论文

Collection

OpenAI "Video generation models as world simulators"技术报告后面的参考论文，总共32篇。OpenAI的ImageGPT和Dalle3这两篇缺失，链接已补充到note中。 • 32 items • Updated Feb 18 • 53

Sora Reference Papers

Collection

A collection of all papers referenced in OpenAI's "Video generation models as world simulators" technical report • openai.com/sora • 30 items • Updated Feb 20 • 50

Adina Yakefu

AI & ML interests

Organizations

AdinaY's activity

PaliGemma – Google's Cutting-Edge Open Vision Language Model

Public Policy at Hugging Face

Welcome Llama 3 - Meta's new open LLM