-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
推理速度不科学啊 #4
Comments
使用的是GPU版本吗?目前NCNN的GPU推理效率并不很高 |
cpu的,没有开vulkan的。另外就是精度问题,用我的相同的模型推理出的句子长度比onnx要短30%,发音也有问题,我看是在dp后计算帧数就明显少了 |
是不是模型是fp16的原因? |
有可能,试试int8的 |
支持中文的代码呀 |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
我集成到自己的工程中,同样的模型单线程linux下速度竟然只有onnx推理的1/8,不知为何
The text was updated successfully, but these errors were encountered: