Update on the development branch #892
kaiyux
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi,
The TensorRT-LLM team is pleased to announce that we are pushing an update to the development branch (and the Triton backend) this January 16th, 2024.
This update includes:
temperature
parameter of sampling configuration should be 0.enable_trt_overlap
argument for GPT manager by defaultdocs/source/new_workflow.md
documentationThanks,
The TensorRT-LLM Engineering Team
Beta Was this translation helpful? Give feedback.
All reactions