Infrastructure - a zzfive Collection

zzfive 's Collections

3d

image

LLMs

video

agent

cv

audio

Infrastructure

updated 2 days ago

CatLIP: CLIP-level Visual Recognition Accuracy with 2.7x Faster Pre-training on Web-scale Image-Text Data

Paper • 2404.15653 • Published Apr 24 • 24
MoDE: CLIP Data Experts via Clustering

Paper • 2404.16030 • Published Apr 24 • 11
MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning

Paper • 2405.12130 • Published 14 days ago • 42
Reducing Transformer Key-Value Cache Size with Cross-Layer Attention

Paper • 2405.12981 • Published 13 days ago • 23
LiteVAE: Lightweight and Efficient Variational Autoencoders for Latent Diffusion Models

Paper • 2405.14477 • Published 11 days ago • 15
Thermodynamic Natural Gradient Descent

Paper • 2405.13817 • Published 12 days ago • 13
Tele-Aloha: A Low-budget and High-authenticity Telepresence System Using Sparse RGB Cameras

Paper • 2405.14866 • Published 11 days ago • 5
Transformers Can Do Arithmetic with the Right Embeddings

Paper • 2405.17399 • Published 7 days ago • 47
2BP: 2-Stage Backpropagation

Paper • 2405.18047 • Published 6 days ago • 19
VeLoRA: Memory Efficient Training using Rank-1 Sub-Token Projections

Paper • 2405.17991 • Published 6 days ago • 9
Jina CLIP: Your CLIP Model Is Also Your Text Retriever

Paper • 2405.20204 • Published 4 days ago • 19