PaliGemma Release Collection Pretrained and mix checkpoints for PaliGemma • 11 items • Updated 2 days ago • 88
Compressed LLMs for nm-vllm Collection LLMs compressed using SparseGPT and GPTQ for optimized inference with nm-vllm https://github.com/neuralmagic/nm-vllm • 17 items • Updated 9 days ago • 7
Sparse Foundational Llama 2 Models Collection Sparse pre-trained and fine-tuned Llama models made by Neural Magic + Cerebras • 27 items • Updated 1 day ago • 5
Enabling High-Sparsity Foundational Llama Models with Efficient Pretraining and Deployment Paper • 2405.03594 • Published 13 days ago • 6
view article Article Multimodal Augmentation for Documents: Recovering “Comprehension” in “Reading and Comprehension” task By danaaubakirova • 3 days ago • 15
LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report Paper • 2405.00732 • Published 20 days ago • 107
view article Article Adapt custom AI models to the trainer API and to 🤗 By not-lain • 5 days ago • 14
🤖 LLM Spaces Collection A collection of applications demonstrating large language models (LLMs) 🚀 • 13 items • Updated 13 days ago • 6
🔊 Speech Enhancement Collection Unlocking a new era in Speech Enhancement, powered by the latest AI technologies, for superior audio quality improvements! 🚀 • 8 items • Updated 18 days ago • 7
🖼️ Image Enhancement Collection Embrace the future of Image Enhancement with the latest AI-powered technologies! 🚀 • 1 item • Updated 18 days ago • 5
🤔 Facial Expressions Recognition Collection Embrace the future of Facial Expressions Recognition with the latest AI-powered technologies! 🚀 • 4 items • Updated 7 days ago • 6
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context Paper • 1901.02860 • Published Jan 9, 2019 • 2
Chatbot is Not All You Need: Information-rich Prompting for More Realistic Responses Paper • 2312.16233 • Published Dec 25, 2023 • 2
Awesome SFT datasets Collection A curated list of interesting datasets to fine-tune language models with. • 43 items • Updated Apr 12 • 89
view article Article SeeMoE: Implementing a MoE Vision Language Model from Scratch By AviSoori1x • 13 days ago • 24
Granite Code Models Collection A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 10 items • Updated 7 days ago • 118
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models Paper • 2405.01535 • Published 17 days ago • 92
EMOPortraits: Emotion-enhanced Multimodal One-shot Head Avatars Paper • 2404.19110 • Published 19 days ago • 3
🎭 Avatars Collection The latest AI-powered technologies usher in a new era of realistic avatars! 🚀 • 33 items • Updated 5 days ago • 49
view article Article 🧑⚖️ "Replacing Judges with Juries" using distilabel By alvarobartt • 16 days ago • 14
view article Article A Guide to Designing New Functional Proteins and Improving Protein Function, Stability, and Diversity with Generative AI By AmelieSchreiber • 5 days ago • 15
view article Article Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints 18 days ago • 50
InstantFamily: Masked Attention for Zero-shot Multi-ID Image Generation Paper • 2404.19427 • Published 19 days ago • 64
GreenBitAI MLX LLM Collection GreenBitAI's Low-bit LLMs in MLX format • 69 items • Updated 12 days ago • 4
Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models Paper • 2404.18796 • Published 20 days ago • 62
AutoCrawler: A Progressive Understanding Web Agent for Web Crawler Generation Paper • 2404.12753 • Published 30 days ago • 38
view article Article Expanding Model Context and Creating Chat Models with a Single Click By maywell • 21 days ago • 29
view article Article ⚗️ 🧑🏼🌾 Let's grow some Domain Specific Datasets together By burtenshaw • 20 days ago • 25
PLLaVA : Parameter-free LLaVA Extension from Images to Videos for Video Dense Captioning Paper • 2404.16994 • Published 24 days ago • 30
How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites Paper • 2404.16821 • Published 24 days ago • 48
PuLID: Pure and Lightning ID Customization via Contrastive Alignment Paper • 2404.16022 • Published 25 days ago • 16
CatLIP: CLIP-level Visual Recognition Accuracy with 2.7x Faster Pre-training on Web-scale Image-Text Data Paper • 2404.15653 • Published 25 days ago • 24
LLaVA++ (LLaMA-3 and Phi-3-Mini) Collection Extending Visual Capabilities of LLaVA with LLaMA-3 and Phi-3 • 11 items • Updated 19 days ago • 21
Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated about 1 month ago • 523
How Good Are Low-bit Quantized LLaMA3 Models? An Empirical Study Paper • 2404.14047 • Published 27 days ago • 37
OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework Paper • 2404.14619 • Published 26 days ago • 120
view article Article LLM Comparison/Test: Llama 3 Instruct 70B + 8B HF/GGUF/EXL2 (20 versions tested and compared!) By wolfram • 25 days ago • 38
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone Paper • 2404.14219 • Published 27 days ago • 230
view article Article seemore: Implement a Vision Language Model from Scratch By AviSoori1x • 7 days ago • 41