diff --git a/Popular_Models_Guide/Hermes-2-Pro-Llama-3-8B/README.md b/Popular_Models_Guide/Hermes-2-Pro-Llama-3-8B/README.md index 1c3625b4..6e8ef4ec 100644 --- a/Popular_Models_Guide/Hermes-2-Pro-Llama-3-8B/README.md +++ b/Popular_Models_Guide/Hermes-2-Pro-Llama-3-8B/README.md @@ -202,7 +202,7 @@ First, let's start Triton SDK container: docker run --rm -it --net host --shm-size=2g \ --ulimit memlock=-1 --ulimit stack=67108864 --gpus all \ -v /path/to/tensorrtllm_backend/inflight_batcher_llm/client:/tensorrtllm_client \ - -v /path/to/Hermes-2-Pro-Llama-3-8B/repo:/Llama-2-7b-hf \ + -v /path/to/Hermes-2-Pro-Llama-3-8B/repo:/Hermes-2-Pro-Llama-3-8B \ nvcr.io/nvidia/tritonserver:-py3-sdk ```