Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
allenai
's Collections
OLMo Suite
Paloma
Tulu V2 Suite
Reward Bench
WildBench
Reward Bench
updated
Mar 20
Datasets, spaces, and models for the reward model benchmark!
Upvote
2
allenai/reward-bench
Viewer
•
Updated
1 day ago
•
2.79k
•
38
Running
101
📐
Reward Bench Leaderboard
allenai/preference-test-sets
Viewer
•
Updated
Mar 14
•
517
•
16
allenai/reward-bench-results
Updated
6 days ago
•
2
•
2
Upvote
2
Share collection
View history
Collection guide
Browse collections