Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard)
Open LLM Leaderboard
community
AI & ML interests
Evaluating open LLMs
Organization Card
About org cards
Open LLM Leaderboard
This is the hub organisation maintaining the Open LLM Leaderboard.
In this space you will find the dataset with detailed results and queries for the models on the leaderboard.
Score results are here, and current state of requests is here. For the detailed prediction, look for your model name in the datasets below!
Collections
3
A daily uploaded list of models with best evaluations on the LLM leaderboard:
-
chargoddard/Yi-34B-Llama
Text Generation • Updated • 3.4k • 56 -
yunconglong/Truthful_DPO_TomGrc_FusionNet_7Bx2_MoE_13B
Text Generation • Updated • 3.43k • 51 -
fblgit/UNA-SimpleSmaug-34b-v1beta
Text Generation • Updated • 2.18k • 19 -
cloudyu/TomGrc_FusionNet_34Bx2_MoE_v0.1_DPO_f16
Text Generation • Updated • 2.43k • 13
models
None public yet
datasets
6399
open-llm-leaderboard/requests
Updated
•
3
•
20
open-llm-leaderboard/dynamic_model_information
Updated
•
4
•
5
open-llm-leaderboard/details_shyamieee__JARVIS-v3.0
Updated
open-llm-leaderboard/results
Updated
•
88
•
48
open-llm-leaderboard/details_gradientai__Llama-3-8B-Instruct-Gradient-1048k
Updated
•
3
open-llm-leaderboard/details_gradientai__Llama-3-8B-Instruct-262k
Updated
•
4
open-llm-leaderboard/details_freewheelin__free-evo-qwen72b-v0.8
Updated
open-llm-leaderboard/details_NeverSleep__Llama-3-Lumimaid-70B-v0.1
Updated
open-llm-leaderboard/details_wannaphong__numfalm-3b
Updated
open-llm-leaderboard/details_abhishek__autotrain-llama3-70b-orpo-v1
Updated