Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models Mar 20 • 17
TinyStories: How Small Can Language Models Be and Still Speak Coherent English? Paper • 2305.07759 • Published May 12, 2023 • 28 • 7
Becoming self-instruct: introducing early stopping criteria for minimal instruct tuning Paper • 2307.03692 • Published Jul 5, 2023 • 24 • 3
Generative AI for Synthetic Data Generation: Methods, Challenges and the Future Paper • 2403.04190 • Published Mar 7 • 2
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models Paper • 2405.01535 • Published 15 days ago • 92 • 11
ALoRA: Allocating Low-Rank Adaptation for Fine-tuning Large Language Models Paper • 2403.16187 • Published Mar 24 • 2
CoIN: A Benchmark of Continual Instruction tuNing for Multimodel Large Language Model Paper • 2403.08350 • Published Mar 13 • 2
Balancing Speciality and Versatility: a Coarse to Fine Framework for Supervised Fine-tuning Large Language Model Paper • 2404.10306 • Published Apr 16 • 2
Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation Paper • 2403.11808 • Published Mar 18 • 3
PYRA: Parallel Yielding Re-Activation for Training-Inference Efficient Task Adaptation Paper • 2403.09192 • Published Mar 14 • 2
Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey Paper • 2403.14608 • Published Mar 21 • 2
MixLoRA: Enhancing Large Language Models Fine-Tuning with LoRA based Mixture of Experts Paper • 2404.15159 • Published 26 days ago • 2
BAdam: A Memory Efficient Full Parameter Training Method for Large Language Models Paper • 2404.02827 • Published Apr 3 • 2
Enhancing Pre-Trained Generative Language Models with Question Attended Span Extraction on Machine Reading Comprehension Paper • 2404.17991 • Published 20 days ago • 2
GeMQuAD : Generating Multilingual Question Answering Datasets from Large Language Models using Few Shot Learning Paper • 2404.09163 • Published Apr 14 • 2
IndicGenBench: A Multilingual Benchmark to Evaluate Generation Capabilities of LLMs on Indic Languages Paper • 2404.16816 • Published 22 days ago • 1 • 2
Optimizing Language Model's Reasoning Abilities with Weak Supervision Paper • 2405.04086 • Published 11 days ago • 1 • 3