Replies: 1 comment
-
Sorry, it doesn't |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi,
Does lmdeploy/turbomind engine support any way to profile the memory usage of each part of the model?
For example, can I see how much memory the model is occupying vs the activation states vs the KV cache?
Beta Was this translation helpful? Give feedback.
All reactions