We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
我用的是微调后的llama模型,请问,当开启block_attn后,模型的输出总会打满到最大长度,导致额外的输出或者乱码,这个应该如何解决。
The text was updated successfully, but these errors were encountered:
This issue is stale because it has been open for 60 days with no activity. 当前issue 60天内无活动,被标记为stale。
Sorry, something went wrong.
This issue was closed because it has been inactive for 14 days since being marked as stale. 当前issue 被标记为stale已有14天,即将关闭。
DesmonDay
No branches or pull requests
请提出你的问题
我用的是微调后的llama模型,请问,当开启block_attn后,模型的输出总会打满到最大长度,导致额外的输出或者乱码,这个应该如何解决。
The text was updated successfully, but these errors were encountered: