-
Attention Is All You Need
Paper • 1706.03762 • Published • 36 -
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Paper • 1810.04805 • Published • 11 -
Universal Language Model Fine-tuning for Text Classification
Paper • 1801.06146 • Published • 6 -
Language Models are Few-Shot Learners
Paper • 2005.14165 • Published • 10
Effi
itseffi
·
AI & ML interests
None yet
Organizations
Collections
4
-
meta-llama/Llama-2-70b-hf
Text Generation • Updated • 393k • 807 -
tiiuae/falcon-180B
Text Generation • Updated • 8.27k • 1.1k -
meta-llama/Meta-Llama-3-8B-Instruct
Text Generation • Updated • 2.55M • 2.44k -
meta-llama/Meta-Llama-3-70B-Instruct
Text Generation • Updated • 454k • 1.08k
models
None public yet
datasets
None public yet