Feature request: Output only generated text #89

jiangshining · 2023-11-03T01:33:30Z

The returned output starts with the original text of the input. This is a waste of network width, especially when the input is very long. Can a flag be provided to control the return of only generated text output? thanks

harrydrippin · 2023-11-03T05:50:44Z

+1 on this. I'm also trying to handle this issue, because my case also has very long input.

BasicCoder · 2023-11-03T14:21:31Z

May be you can reference this #95

harrydrippin · 2023-11-06T05:45:48Z

@BasicCoder Your one is also good, but it will be better if tensorrt_llm backend can cut input tokens out from the result, by itself. I'm using tensorrt_llm backend only (not ensembled one), because I separated my tokenizer to another server due to my business logics. We may not need to use additional Python backend if this feature is supported by tensorrt_llm backend itself.

…t as well (#88) (#89) * Replace binding index-based methods with name-based alternatives * Remove unused variables * Remove unused variables * Remove allInput*Specified() * Delete TRTV1Interface * Replace getProfileShapeValues() with getProfileTensorValues() * Remove buffer_bindings_ * Enhancements * Replace isExecutionBinding() * Add INT64 support * Remove hasImplicitBatchDimension() * Update Copyright * Remove unused variables * Undo copyright * Undo Copyright * Undo copyright * Fix the handling in INT64 shape tensors output * Fix data dependent output shapes * Fix pre commit errors * Update copyright * Resolve review comments * Include source for building on TRT 8 (#86) (#87) * Include source for building on TRT 8 * Apply suggestions from code review --------- * Fix envvar access in CMake --------- Co-authored-by: Sai Kiran Polisetty <[email protected]> Co-authored-by: Misha Chornyi <[email protected]>

byshiue assigned ncomly-nvidia Nov 3, 2023

byshiue added the feature request New feature or request label Nov 5, 2023

ncomly-nvidia added the triaged Issue has been triaged by maintainers label Nov 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature request: Output only generated text #89

Feature request: Output only generated text #89

jiangshining commented Nov 3, 2023

harrydrippin commented Nov 3, 2023

BasicCoder commented Nov 3, 2023

harrydrippin commented Nov 6, 2023

Feature request: Output only generated text #89

Feature request: Output only generated text #89

Comments

jiangshining commented Nov 3, 2023

harrydrippin commented Nov 3, 2023

BasicCoder commented Nov 3, 2023

harrydrippin commented Nov 6, 2023