Nathan Lambert
natolambert
AI & ML interests
Reinforcement learning, Ethics, Robotics, Dynamics Models
Articles
Organizations
natolambert's activity
allenai/OLMo-1.7-7B
Text Generation
•
Updated
•
368
•
37
Qwen/CodeQwen1.5-7B
Text Generation
•
Updated
•
8.44k
•
54
young-geng/koala
Updated
•
75
HuggingFaceH4/zephyr-orpo-141b-A35b-v0.1
Text Generation
•
Updated
•
3.04k
•
230
mrfakename/mixtral-8x22b
Updated
•
9
mightbe/Better-PairRM
Updated
•
266
•
10
google/recurrentgemma-2b
Text Generation
•
Updated
•
9.23k
•
87
mistral-community/Mistral-7B-v0.2
Text Generation
•
Updated
•
51.1k
•
221
Nexusflow/Starling-RM-34B
Updated
•
2.72k
•
69
weqweasdas/RM-Gemma-7B
Text Classification
•
Updated
•
130
•
6
Ray2333/reward-model-Mistral-7B-instruct-Unified-Feedback
Text Classification
•
Updated
•
1.59k
•
10
HuggingFaceH4/starchat2-15b-v0.1
Text Generation
•
Updated
•
5.28k
•
88
Nexusflow/Starling-LM-7B-beta
Text Generation
•
Updated
•
20.8k
•
314