Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections trending this week

Code Llama Family

This collection hosts the transformers repos of the Code Llama release

Collection by

22 days ago

meta-llama/CodeLlama-7b-hf

Text Generation • Updated Mar 14 • 13k • 19
meta-llama/CodeLlama-13b-hf

Text Generation • Updated Mar 14 • 158 • 1
meta-llama/CodeLlama-34b-hf

Text Generation • Updated Mar 14 • 542 • 2
meta-llama/CodeLlama-70b-hf

Text Generation • Updated Mar 14 • 870 • 6

Mantis model family optimized for multi-image reasoning with interleaved text/image format

Collection by

3 days ago

Running on Zero

13

👁

Mantis

Multimodal Language Model
TIGER-Lab/Mantis-8B-clip-llama3

Updated 6 days ago • 265 • 1
TIGER-Lab/Mantis-8B-siglip-llama3

Updated 6 days ago • 960 • 6
TIGER-Lab/Mantis-8B-Fuyu

Text Generation • Updated 6 days ago • 39 • 2

OpenELM Pretrained Models

Collection by

17 days ago

apple/OpenELM-270M

Text Generation • Updated 9 days ago • 23.8k • 53
apple/OpenELM-450M

Text Generation • Updated 9 days ago • 3.31k • 21
apple/OpenELM-1_1B

Text Generation • Updated 9 days ago • 3.83k • 21
apple/OpenELM-3B

Text Generation • Updated 9 days ago • 3.78k • 100

Portuguese LLM Leaderboard best models ❤️‍🔥

A daily uploaded list of models with best evaluations on the PT-LLM leaderboard:

Collection by

about 2 hours ago

nicholasKluge/TeenyTinyLlama-460m

Text Generation • Updated Apr 9 • 714 • 5
h2oai/h2o-danube2-1.8b-base

Text Generation • Updated Apr 5 • 5.21k • 35
stabilityai/stablelm-2-zephyr-1_6b

Text Generation • Updated 4 days ago • 63.5k • 164
Qwen/Qwen1.5-4B

Text Generation • Updated Apr 5 • 35.7k • 29

MoEs papers reading list

Collection by

about 7 hours ago

117

Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer

Paper • 1701.06538 • Published Jan 23, 2017 • 4
Sparse Networks from Scratch: Faster Training without Losing Performance

Paper • 1907.04840 • Published Jul 10, 2019 • 3
ZeRO: Memory Optimizations Toward Training Trillion Parameter Models

Paper • 1910.02054 • Published Oct 4, 2019 • 3
A Mixture of h-1 Heads is Better than h Heads

Paper • 2005.06537 • Published May 13, 2020 • 2

RealVisXL (SDXL)

Collection by

Feb 27

SG161222/RealVisXL_V4.0

Text-to-Image • Updated 28 days ago • 79.5k • 46
SG161222/RealVisXL_V4.0_Lightning

Text-to-Image • Updated 28 days ago • 10.4k • 12
SG161222/RealVisXL_V3.0

Text-to-Image • Updated 28 days ago • 91.5k • 60
SG161222/RealVisXL_V3.0_Turbo

Text-to-Image • Updated 28 days ago • 26.1k • 27

yentinglin/Taiwan-LLM-8x7B-DPO

Text Generation • Updated Feb 8 • 1.87k • 15
yentinglin/Taiwan-LLM-13B-v2.0-chat

Text Generation • Updated Mar 11 • 1.78k • 44
Taiwan LLM: Bridging the Linguistic Divide with a Culturally Aligned Language Model

Paper • 2311.17487 • Published Nov 29, 2023 • 2
yentinglin/Taiwan-LLM-13B-v2.0-chat-awq

Text Generation • Updated Jan 12 • 109 • 3

Transformers.js demos

A collection of my favorite WebML demos, built with Transformers.js!

Collection by

2 days ago

Running

696

🎤

Whisper Web
Running

362

🖼️

Remove Background Web

In-browser background removal
Running

238

🖼️

Depth Anything Web
Running

203

👀

Distil Whisper Web

Small LMs Text Embedding

Contrastive fine-tuned version of Language Models up to 2B parameters using LoRA

Collection by

2 days ago

trapoom555/MiniCPM-2B-Text-Embedding-cft

Sentence Similarity • Updated 2 days ago • 3
trapoom555/Gemma-2B-Text-Embedding-cft

Sentence Similarity • Updated 2 days ago • 3
trapoom555/Phi-2-Text-Embedding-cft

Sentence Similarity • Updated 2 days ago • 3

Previous
1
2
3
4
5
...
3,830
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs

Collections

Mantis

Whisper Web

Remove Background Web

Depth Anything Web

Distil Whisper Web