GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection Paper • 2403.03507 • Published Mar 6 • 176
MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning Paper • 2405.12130 • Published 13 days ago • 41
Chameleon: Mixed-Modal Early-Fusion Foundation Models Paper • 2405.09818 • Published 18 days ago • 96