Open-LLM-Leaderboard:
From Multi-choice to Openstyle Questions for LLMs Evaluation, Benchmark, and Arena
Aidar Myrzakhan
*
,
Sondos Mahmoud Bsharat
*
,
Zhiqiang Shen
*
*
joint first author & equal contribution
VILA Lab
,
Mohamed bin Zayed University of AI (MBZUAI)
Paper
Github
Hugging Face
Home
Open-LLM-Leaderboard
OSQ-Benchmark
Open-LLM-Leaderboard
Small Scale
Large Scale
Model
Average
MMLU
WinoGrande
PIQA
CommonsenseQA
Race
MedMCQA
OpenkookQA