Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

paddlepaddle分布式docker部署命令行错误 #5622

Open
Yiheng-Liu opened this issue Feb 22, 2023 · 0 comments
Open

paddlepaddle分布式docker部署命令行错误 #5622

Yiheng-Liu opened this issue Feb 22, 2023 · 0 comments

Comments

@Yiheng-Liu
Copy link

Yiheng-Liu commented Feb 22, 2023

文档链接&描述 Document Links & Description

文档链接https://www.paddlepaddle.org.cn/documentation/docs/zh/guides/06_distributed_training/deployment_cn.html
错误位置:裸机及Docker化部署-paddle环境安装-docker安装命令
错误原因:docker --host命令部分版本不支持,本机20.10.12版本测试不通过,报错

unknown flag: --host

同时,由于没有未挂载/dev/shm会导致多卡无法正常运行,只能正常运行一卡。
更多建议:nvidia-docker2貌似已经弃用nvidia-docker命令,改为--runtime方式,可以在命令行中进行体现,同时,在启动docker时将/dev/shm进行挂载。

请提出你的建议/提个PR修复这个问题 Please give your suggestion/update a PR to fix it

命令行代码可以修改为:

docker run --runtime=nvidia --gpus all --name paddle -it --network host -v $PWD:/paddle -v /dev/shm:/dev/shm registry.baidubce.com/paddlepaddle/paddle:2.2.2-gpu-cuda11.2-cudnn8 /bin/bash
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant