BitsFusion: 1.99 bits Weight Quantization of Diffusion Model Paper • 2406.04333 • Published 4 days ago • 27
ShareGPT4Video: Improving Video Understanding and Generation with Better Captions Paper • 2406.04325 • Published 4 days ago • 56
Jina CLIP: Your CLIP Model Is Also Your Text Retriever Paper • 2405.20204 • Published 11 days ago • 26
Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality Paper • 2405.21060 • Published 10 days ago • 57
view article Article Introducing NPC-Playground, a 3D playground to interact with LLM-powered NPCs 5 days ago • 12
sentiment-analysis-advances Collection This collection list studies aimed at advancing granular sentiment analysis in mass-media news • 6 items • Updated 8 days ago • 1
AutoCoder: Enhancing Code Large Language Model with AIEV-Instruct Paper • 2405.14906 • Published 18 days ago • 21
C4AI Aya 23 Collection Aya 23 is an open weights research release of an instruction fine-tuned model with highly advanced multilingual capabilities. • 3 items • Updated 18 days ago • 34
ConvLLaVA: Hierarchical Backbones as Visual Encoder for Large Multimodal Models Paper • 2405.15738 • Published 17 days ago • 42
FIFO-Diffusion: Generating Infinite Videos from Text without Training Paper • 2405.11473 • Published 22 days ago • 53
Chameleon: Mixed-Modal Early-Fusion Foundation Models Paper • 2405.09818 • Published 25 days ago • 101
A decoder-only foundation model for time-series forecasting Paper • 2310.10688 • Published Oct 14, 2023 • 4
PaliGemma Release Collection Pretrained and mix checkpoints for PaliGemma • 11 items • Updated 24 days ago • 110
Compressed LLMs for nm-vllm Collection LLMs compressed using SparseGPT and GPTQ for optimized inference with nm-vllm https://github.com/neuralmagic/nm-vllm • 18 items • Updated 18 days ago • 9
Sparse Foundational Llama 2 Models Collection Sparse pre-trained and fine-tuned Llama models made by Neural Magic + Cerebras • 27 items • Updated 24 days ago • 7
Enabling High-Sparsity Foundational Llama Models with Efficient Pretraining and Deployment Paper • 2405.03594 • Published May 6 • 7
view article Article Multimodal Augmentation for Documents: Recovering “Comprehension” in “Reading and Comprehension” task By danaaubakirova • 25 days ago • 15
LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report Paper • 2405.00732 • Published Apr 29 • 115
view article Article Train custom AI models with the trainer API and adapt them to 🤗 By not-lain • 8 days ago • 24
🤖 LLM Spaces Collection A collection of applications demonstrating large language models (LLMs) 🚀 • 17 items • Updated 11 days ago • 6
🔊 Speech Enhancement Collection Unlocking a new era in Speech Enhancement, powered by the latest AI technologies, for superior audio quality improvements! 🚀 • 8 items • Updated May 1 • 7
🖼️ Image Enhancement Collection Embrace the future of Image Enhancement with the latest AI-powered technologies! 🚀 • 1 item • Updated May 1 • 5
🤔 Facial Expressions Recognition Collection Embrace the future of Facial Expressions Recognition with the latest AI-powered technologies! 🚀 • 4 items • Updated 29 days ago • 6
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context Paper • 1901.02860 • Published Jan 9, 2019 • 2
Chatbot is Not All You Need: Information-rich Prompting for More Realistic Responses Paper • 2312.16233 • Published Dec 25, 2023 • 2
Awesome SFT datasets Collection A curated list of interesting datasets to fine-tune language models with. • 43 items • Updated Apr 12 • 94
view article Article SeeMoE: Implementing a MoE Vision Language Model from Scratch By AviSoori1x • May 6 • 26
Granite Code Models Collection A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 18 items • Updated 10 days ago • 142
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models Paper • 2405.01535 • Published May 2 • 104
EMOPortraits: Emotion-enhanced Multimodal One-shot Head Avatars Paper • 2404.19110 • Published Apr 29 • 3
🎭 Avatars Collection The latest AI-powered technologies usher in a new era of realistic avatars! 🚀 • 41 items • Updated 6 days ago • 51
view article Article 🧑⚖️ "Replacing Judges with Juries" using distilabel By alvarobartt • May 3 • 14
view article Article A Guide to Designing New Functional Proteins and Improving Protein Function, Stability, and Diversity with Generative AI By AmelieSchreiber • 27 days ago • 21
view article Article Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints May 1 • 54
InstantFamily: Masked Attention for Zero-shot Multi-ID Image Generation Paper • 2404.19427 • Published Apr 30 • 68