This is a Llama 2 architecture model series trained on the FineWeb dataset. This is ~500 Million model and uses tiktoken cl100k_base model as tokenizer
- Downloads last month
- 0
This is a Llama 2 architecture model series trained on the FineWeb dataset. This is ~500 Million model and uses tiktoken cl100k_base model as tokenizer