arxiv:2309.00071
Jeffrey Quesnelle PRO
emozilla
AI & ML interests
None yet
Organizations
Papers
1
models
60
emozilla/llama2-1.2b-init
Text Generation
•
Updated
•
85
emozilla/llama3-1.6b-init
Text Generation
•
Updated
•
26
emozilla/llama3-1.3b-gptneox-init
Text Generation
•
Updated
•
407
emozilla/8B_128K_bs_8M_rope_512K_step_1000_lr_2e-5
Text Generation
•
Updated
•
1
emozilla/llama-1.1b-init
Text Generation
•
Updated
•
84
emozilla/LWM-Text-1M-mpe64k
Text Generation
•
Updated
•
4
emozilla/LWM-Text-1M-mpe32k
Text Generation
•
Updated
•
56
emozilla/LWM-Text-1M-mpe4k
Text Generation
•
Updated
•
3
emozilla/LWM-Text-1M-GGUF
Updated
•
205
emozilla/bt3
Text Generation
•
Updated
•
5
•
1
datasets
34
emozilla/PaulGrahamEssays
Viewer
•
Updated
emozilla/c4-validation.00000-of-00008
Viewer
•
Updated
•
11
emozilla/hermes2-tokenized-llama-alpaca
Viewer
•
Updated
emozilla/yarn-train-tokenized-8k-mistral
Viewer
•
Updated
•
2
emozilla/story-summary-training-mistral-9k-1_4_24
Viewer
•
Updated
•
2
emozilla/yarn-train-tokenized-8k-llama
Viewer
•
Updated
•
570
emozilla/yarn-train-tokenized-32k-mistral
Viewer
•
Updated
•
1
emozilla/yarn-train-tokenized-16k-mistral
Viewer
•
Updated
•
52
•
13
emozilla/pg19
Viewer
•
Updated
•
253
•
9
emozilla/Long-Data-Collections-Fine-Tune
Viewer
•
Updated
•
2