Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

训练不起来(24G*8张卡) #106

Open
EddieEduardo opened this issue Aug 16, 2024 · 4 comments
Open

训练不起来(24G*8张卡) #106

EddieEduardo opened this issue Aug 16, 2024 · 4 comments

Comments

@EddieEduardo
Copy link

你好,感谢分享代码!

尝试跑一下,每次都是在这就退出了,这个是内存不够的原因吗?
f1db02b2f3a0f2f9c888acd39214041

有什么解决办法吗?

感谢!!!

@niuniuBUAA
Copy link

我猜是计算机内存的问题,你不要8张卡一起用,因为源码是先加载在cpu再搬到gpu,弄两张内存应该够

@EddieEduardo
Copy link
Author

感谢,work了!!!

@EddieEduardo
Copy link
Author

我猜是计算机内存的问题,你不要8张卡一起用,因为源码是先加载在cpu再搬到gpu,弄两张内存应该够

感谢回复,请问这个epoch数在哪里修改呢?

@ice025atline
Copy link

請問是什麼原理??8張不要一起用反而能行?在哪修改的?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants