You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
dataset version metric mode opencompass.models.huggingface.HuggingFace_models_Yi-1.5-9B
---
math 5f997e accuracy gen 28.3
openai_humaneval 8e312c humaneval_pass@1 gen 25.61
humaneval_plus 8e312c humaneval_plus_pass@1 gen 21.34
mbpp 3ede66 score gen 58.6
mbpp 3ede66 pass gen 293
mbpp 3ede66 timeout gen 4
mbpp 3ede66 failed gen 24
mbpp 3ede66 wrong_answer gen 179
The text was updated successfully, but these errors were encountered:
我使用opencompass对Yi-1.5-9B在MATH(4 shot),HumanEval/HumanEval plus(0 shot),MBPP(3 shot)的测试集上进行评估。评估的结果和官方提供的指标有一定差距,能否提供一下官方的评测脚本或者详细参数以便复现指标?
下面是我的评测脚本和结果
The text was updated successfully, but these errors were encountered: