Adding Evaluation Results

by leaderboard-pr-bot - opened Nov 17, 2023

←

Files changed (1) hide show

README.md CHANGED Viewed

	@@ -274,3 +274,17 @@ Our code and checkpoints are open to research purpose, and they are allowed for
274
275	If you are interested to leave a message to either our research team or product team, join our Discord or WeChat groups! Also, feel free to send an email to [email protected].
276

 If you are interested to leave a message to either our research team or product team, join our Discord or WeChat groups! Also, feel free to send an email to [email protected].
+# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
+Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_Qwen__Qwen-14B)
+| Metric | Value |
+|-----------------------|---------------------------|
+| Avg. | 60.07 |
+| ARC (25-shot) | 58.28 |
+| HellaSwag (10-shot) | 83.99 |
+| MMLU (5-shot) | 67.7 |
+| TruthfulQA (0-shot) | 49.43 |
+| Winogrande (5-shot) | 76.8 |
+| GSM8K (5-shot) | 58.98 |
+| DROP (3-shot) | 25.31 |