LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report Paper β’ 2405.00732 β’ Published 21 days ago β’ 109
Embedding Model Datasets Collection A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co/datasets?other=sentence-transformers β’ 49 items β’ Updated 5 days ago β’ 10
view article Article Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints 19 days ago β’ 51
Phi-3 Collection Phi-3 family of small language models. Language models are available in short- and long-context lengths. β’ 7 items β’ Updated about 20 hours ago β’ 203
view article Article Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent 28 days ago β’ 71
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone Paper β’ 2404.14219 β’ Published 28 days ago β’ 230
Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases β’ 5 items β’ Updated Apr 18 β’ 525
HF-curated models available on Workers AI Collection A collection of models curated with Hugging Face that can be run on Cloudflare's Workers AI serverless inference platform. β’ 15 items β’ Updated Apr 2 β’ 48
Chronos Models Collection Chronos: Pretrained (language) models for time series forecasting based on the T5 architecture. β’ 6 items β’ Updated Mar 18 β’ 25
Leaderboards and benchmarks β¨ Collection Cool leaderboard spaces collection for models across modalities! Text, vision, audio, ... β’ 61 items β’ Updated 6 days ago β’ 59
OLMo Suite Collection Artifacts for the first set of OLMo models. β’ 12 items β’ Updated 5 days ago β’ 35
Gemma release Collection Groups the Gemma models released by the Google team. β’ 40 items β’ Updated 6 days ago β’ 304
Code Models Collection Models for generating and analyzing code β’ 46 items β’ Updated 11 days ago β’ 1
PIXART-Ξ΄: Fast and Controllable Image Generation with Latent Consistency Models Paper β’ 2401.05252 β’ Published Jan 10 β’ 43
Nemotron 3 8B Collection The Nemotron 3 8B Family of models is optimized for building production-ready generative AI applications for the enterprise. β’ 5 items β’ Updated Feb 19 β’ 37
LLM Leaderboard best models β€οΈβπ₯ Collection A daily uploaded list of models with best evaluations on the LLM leaderboard: β’ 70 items β’ Updated 4 days ago β’ 307
Geospatial Models Collection Geospatial Models on the Hub. If you want to submit more items to this collection, please request to join the geospatial organisation. β’ 4 items β’ Updated Sep 8, 2023 β’ 6
Llama 2: Open Foundation and Fine-Tuned Chat Models Paper β’ 2307.09288 β’ Published Jul 18, 2023 β’ 235