-
Notifications
You must be signed in to change notification settings - Fork 285
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
assert all((~torch.isinf(scores.view(-1))) & (~torch.isnan(scores.view(-1)))) [rank0]: AssertionError #287
Comments
if i change to
no problem |
when selection_value is 0, the next_state_value is nan, that is a bug? |
感谢哥们帮我解决这个问题,我还以为qwen不能跑很头疼 |
It's a little bit weird because print(all((~torch.isinf(current_logits.view(-1))) & (~torch.isnan(current_logits.view(-1))))) after line 61. If the output is true, probably you have set some values in the logits to 'inf' or '-inf'. |
@liuchen6667 This is wrong. My solution only limits the output to only one bit, but it is not the correct approach. I think as @t1101675 said, it may be that the qwen output is empty or the terminator has be changed to inf. It may be a model problem, or the code may be incompatible. If more people have this problem, i think it is most likely that the code is incompatible. |
确实啊,原论文也没有提到qwen2.5,感觉就是没有适配,难顶 |
@t1101675 Hi, Can you adapt the qwen model? |
I tried printing mask, selection_value, next_state_value, then
the selection_value is
the next_state_value is
the next_state_value was masked by mask. |
i add this solve it @liuchen6667 |
这真靠谱吗哥,我还是自己去另立炉灶吧,这代码复杂的一 |
I'm not certain if this solution works as intended. I suspect that current_logits may contain NaN, Inf, or extremely large values, which could cause next_state_value to become NaN after applying torch.logsumexp. I'd be happy to take a closer look if you could provide more details about the configurations you're using, such as the model, tokenization method, etc. |
@liuchen6667 怎么样,说实话效果不效果还真不知道,但代码挺难跑通的,不知道有没有人跑通,有点浪费时间了 |
原论文里边也没提到qwen,我猜就是适配问题,建议不纠结了 |
我这个符合预期吗?tot_loss不稳定,有时候还有负数 |
感觉不太符合预期,tot_loss 不太可能是负数 |
@t1101675 能加您个微信讨论下吗?我的:junge1300780479 |
@liuchen6667 用了他们新代码好像没有这个问题了,我没有加lm data,只有prompt data |
@liuchen6667 也可以加个微信交流下 我的:junge1300780479 |
报错
The text was updated successfully, but these errors were encountered: