arxiv:2404.00399
Jordan Clive
jordiclive
AI & ML interests
NLG, Multi-task learning, Parameter efficiency, Retrieval-enhanced Transformer
Organizations
models
18
jordiclive/flan-t5-3b-summarizer
Summarization
•
Updated
•
341
•
30
jordiclive/Llama-2-70b-oasst-1-200
Text Generation
•
Updated
•
3.18k
•
2
jordiclive/Llama-2-70b-hf-sp
Text Generation
•
Updated
•
4
jordiclive/scaled-llama-7b-lora-16k-rp2
Text Generation
•
Updated
•
6
jordiclive/falcon-40b-lora-sft-stage2-1.1k
Text Generation
•
Updated
•
3
jordiclive/falcon-40b-lora-sft-1.1k
Updated
jordiclive/falcon_lora_40b_ckpt_18000_pretrain
Updated
jordiclive/Lora-llama-65B-pre-train-oasst
Text Generation
•
Updated
jordiclive/falcon_lora_40b_ckpt_500_oasst_1
Text Generation
•
Updated
•
1
jordiclive/lora-llama-33B-alpaca_gpt4-dolly_15k-vicuna-r64
Text Generation
•
Updated