We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
我在模拟4机32卡运行GPT7B的情况,使用相同命令,在intel平台(CPU型号 8358)最多只能占用不到40个核心,而在amd平台(CPU型号7543 )可以完全占满,请问这种情况正常吗?
The text was updated successfully, but these errors were encountered:
可以提供一下你运行命令参数?CPU的利用率一般和仿真时指定的线程数量相关
Sorry, something went wrong.
hi ,通过nproc指定的线程数,两个平台命令完全相同
nproc
AS_SEND_LAT=3 AS_NVLS_ENABLE=1 ./bin/SimAI_simulator -t nproc -w ./gpt_7B-world_size32-tp4-pp1-ep1-gbs8192-mbs1-seq4096-MOE-False-GEMM-False-flash_attn-True.txt -n ./HPN_7_0_32_gpus_8_in_one_server_with_single_plane_100Gbps_A100 -c astra-sim-alibabacloud/inputs/config/SimAI.conf | tee gpt7b.log
No branches or pull requests
我在模拟4机32卡运行GPT7B的情况,使用相同命令,在intel平台(CPU型号 8358)最多只能占用不到40个核心,而在amd平台(CPU型号7543 )可以完全占满,请问这种情况正常吗?
The text was updated successfully, but these errors were encountered: