-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Use perf_analyzer without GPU/CUDA #1
Comments
Hi, can you clarify your setup? |
I tried perf_analyzer in both
The triton server and perf_analyzer are run on separate machines. The perf_analyzer one doesn't have GPU/CUDA installed. |
@matthewkotila i thought we removed this requirement? |
* first pass. Hardcoded values * Working for concurrency (hardcoded whenever count windows is used for now) * working for req rate as well * Add CLI. Add/fix unit tests * Remove hack. Restore all normal functionality * Refactor thread config into one class. Add more testing * Rename arg to request-count * Fix request rate bug * Update info print * fix corner case * move fixme to a story tag * add assert to avoid corner case * rename variables * self review #1 * copyright changes * add doxygen to functions * Don't allow sweeping over multiple concurrency or request rate with request-count
* first pass. Hardcoded values * Working for concurrency (hardcoded whenever count windows is used for now) * working for req rate as well * Add CLI. Add/fix unit tests * Remove hack. Restore all normal functionality * Refactor thread config into one class. Add more testing * Rename arg to request-count * Fix request rate bug * Update info print * fix corner case * move fixme to a story tag * add assert to avoid corner case * rename variables * self review #1 * copyright changes * add doxygen to functions * Don't allow sweeping over multiple concurrency or request rate with request-count
Is your feature request related to a problem? Please describe.
We are building a perf analyzer workflow on CI, and we plan to use perf_analyzer as client. The client worker won't have GPU/CUDA. But I tried both
pip install tritonclient
and the NGC image, both failed by complaining there is no cuda/gpu.Describe the solution you'd like
I don't get why perf_analyzer requires GPU. It should be able to run perfectly on machines that has no GPU.
The text was updated successfully, but these errors were encountered: