Add support for nvclip endpoint #147

dyastremsky · 2024-10-22T17:57:47Z

Add support for NVClip NIM endpoint.

Benchmarking NVCLip:

dyastremsky · 2024-10-22T18:03:49Z

genai-perf/genai_perf/inputs/converters/base_converter.py

@@ -64,3 +64,9 @@ def _select_model_name(self, config: InputsConfig, index: int) -> str:
            raise GenAIPerfException(
                f"Model selection strategy '{config.model_selection_strategy}' is unsupported"
            )
+
+    def _add_request_params(


This gets reused among converters, so I moved it to the base converter class. Other converts can reimplement it for custom logic.

dyastremsky · 2024-10-22T18:07:16Z

genai-perf/genai_perf/inputs/input_constants.py

+    ################################################################
+    # Triton backends
+    ################################################################
+    TENSORRTLLM = auto()


It looks like the logic added here expected the options to only list the Triton backends. However, because that was unclear when adding OutputFormats here, it got out of sync.

We might need a better approach long-term as its own ticket, but I did clean-up here to fix the issue for now. See help prompt with the correct backends displayed below.

dyastremsky · 2024-10-22T18:07:50Z

genai-perf/genai_perf/parser.py

@@ -634,7 +638,7 @@ def _add_endpoint_args(parser):
    endpoint_group.add_argument(
        "--backend",
        type=str,
-        choices=utils.get_enum_names(ic.OutputFormat)[2:],
+        choices=utils.get_enum_names(ic.OutputFormat)[0:2],


We might need a better approach long-term as its own ticket, but I did clean-up here to fix the issue for now. See help prompt with the correct backends displayed below.

Add support for nvclip converter Add support for nvclip converter Add support for nvclip converter Add support for nvclip converter

nicolasnoble

This looks good to me, I haven't seen anything wrong with this PR.

One general comment however, which isn't related to this PR in particular, but I am starting to wish there was a bit more docstrings throughout the codebase, to better describe and document the APIs that we create, not only for the external users, but also for ourselves, when we have to maintain code in the future.

Don't take this as a request to document this one PR thoroughly, but this may be something to consider for future work / refactoring.

As an example, I feel I haven't documented enough my own python script that I submitted in third_party, tho I'd defend against my own argument saying this is a temporary fix until abseil's fix trickled down to us:

https://github.com/triton-inference-server/third_party/blob/main/tools/patch.py

dyastremsky · 2024-10-22T23:40:34Z

This looks good to me, I haven't seen anything wrong with this PR.

One general comment however, which isn't related to this PR in particular, but I am starting to wish there was a bit more docstrings throughout the codebase, to better describe and document the APIs that we create, not only for the external users, but also for ourselves, when we have to maintain code in the future.

Don't take this as a request to document this one PR thoroughly, but this may be something to consider for future work / refactoring.

As an example, I feel I haven't documented enough my own python script that I submitted in third_party, tho I'd defend against my own argument saying this is a temporary fix until abseil's fix trickled down to us:

https://github.com/triton-inference-server/third_party/blob/main/tools/patch.py

Thanks for reviewing and providing feedback! I'm happy to talk about how to best do this for GenAI-Perf and our codebase in general sometime. The base converter class has some docstrings to document converter functions, though we can probably do more. Finding clearer ways to document is always good.

dyastremsky self-assigned this Oct 22, 2024

dyastremsky temporarily deployed to GITLAB October 22, 2024 17:57 — with GitHub Actions Inactive

dyastremsky temporarily deployed to GITLAB October 22, 2024 17:58 — with GitHub Actions Inactive

dyastremsky force-pushed the dyas-nvclip branch from baad3cd to 9472697 Compare October 22, 2024 17:59

dyastremsky temporarily deployed to GITLAB October 22, 2024 17:59 — with GitHub Actions Inactive

dyastremsky force-pushed the dyas-nvclip branch from 9472697 to 3e72239 Compare October 22, 2024 18:02

dyastremsky temporarily deployed to GITLAB October 22, 2024 18:03 — with GitHub Actions Inactive

dyastremsky commented Oct 22, 2024

View reviewed changes

dyastremsky force-pushed the dyas-nvclip branch from 3e72239 to 0353bb9 Compare October 22, 2024 18:04

dyastremsky temporarily deployed to GITLAB October 22, 2024 18:04 — with GitHub Actions Inactive

dyastremsky commented Oct 22, 2024

View reviewed changes

Add support for nvclip converter

986fe50

Add support for nvclip converter Add support for nvclip converter Add support for nvclip converter Add support for nvclip converter

dyastremsky force-pushed the dyas-nvclip branch from 0353bb9 to 986fe50 Compare October 22, 2024 18:08

dyastremsky temporarily deployed to GITLAB October 22, 2024 18:08 — with GitHub Actions Inactive

dyastremsky requested review from nicolasnoble, matthewkotila and ganeshku1 October 22, 2024 18:09

dyastremsky marked this pull request as ready for review October 22, 2024 18:09

nicolasnoble approved these changes Oct 22, 2024

View reviewed changes

dyastremsky merged commit 55f1f34 into main Oct 22, 2024
6 checks passed

dyastremsky deleted the dyas-nvclip branch October 22, 2024 23:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for nvclip endpoint #147

Add support for nvclip endpoint #147

dyastremsky commented Oct 22, 2024 •

edited

Loading

dyastremsky Oct 22, 2024

dyastremsky Oct 22, 2024 •

edited

Loading

dyastremsky Oct 22, 2024

nicolasnoble left a comment

dyastremsky commented Oct 22, 2024

Add support for nvclip endpoint #147

Add support for nvclip endpoint #147

Conversation

dyastremsky commented Oct 22, 2024 • edited Loading

dyastremsky Oct 22, 2024

Choose a reason for hiding this comment

dyastremsky Oct 22, 2024 • edited Loading

Choose a reason for hiding this comment

dyastremsky Oct 22, 2024

Choose a reason for hiding this comment

nicolasnoble left a comment

Choose a reason for hiding this comment

dyastremsky commented Oct 22, 2024

dyastremsky commented Oct 22, 2024 •

edited

Loading

dyastremsky Oct 22, 2024 •

edited

Loading