-
Notifications
You must be signed in to change notification settings - Fork 9.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
使用lm-eval工具对开源数据的10%进行测试对比,这个性能结果正常嘛 #281
Comments
有木有类似测试结果的 |
|
when I use https://tiger-ai-lab.github.io/CritiqueFineTuning/
However, there is still a big gap with the results shown in the paper. What should I do, thank you. |
set the max length to 16384, will be okay. math-500 to 86.8%, time 24 to 50%, and gpqa to 47.47%. |
The text was updated successfully, but these errors were encountered: