makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch By AviSoori1x • 1 day ago • 11
Can we create pedagogically valuable multi-turn synthetic datasets from Cosmopedia? By davanstrien • 1 day ago • 4
Evalverse: Revolutionizing Large Language Model Evaluation with a Unified, User-Friendly Framework By Yescia • 1 day ago
A Guide to Designing New Functional Proteins and Improving Protein Function, Stability, and Diversity with Generative AI By AmelieSchreiber • 7 days ago • 12
Token Merging for fast LLM inference : Background and first trials with Mistral By samchain • 8 days ago • 1
Estimating Memory Consumption of LLMs for Inference and Fine-Tuning for Cohere Command-R+ By Andyrasika • 12 days ago • 4
Post-OCR-Correction: 1 billion words dataset of automated OCR correction by LLM By Pclanglais • 13 days ago • 9
makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch By AviSoori1x • 1 day ago • 11
Can we create pedagogically valuable multi-turn synthetic datasets from Cosmopedia? By davanstrien • 1 day ago • 4
Evalverse: Revolutionizing Large Language Model Evaluation with a Unified, User-Friendly Framework By Yescia • 1 day ago
A Guide to Designing New Functional Proteins and Improving Protein Function, Stability, and Diversity with Generative AI By AmelieSchreiber • 7 days ago • 12
Token Merging for fast LLM inference : Background and first trials with Mistral By samchain • 8 days ago • 1
Estimating Memory Consumption of LLMs for Inference and Fine-Tuning for Cohere Command-R+ By Andyrasika • 12 days ago • 4
Post-OCR-Correction: 1 billion words dataset of automated OCR correction by LLM By Pclanglais • 13 days ago • 9