Skip to content

Release 2.49.0 corresponding to NGC container 24.08

Latest
Compare
Choose a tag to compare
@nvda-mesharma nvda-mesharma released this 30 Aug 18:37
· 4 commits to main since this release
98947a7

What's Changed

  • refactor: Remove explicit callings to garbage collect by @kthui in #55
  • perf: Check for cancellation on response thread by @kthui in #54
  • feat: Add vLLM counter metrics access through Triton by @yinggeh in #53
  • feat: Report histogram metrics to Triton metrics server by @yinggeh in #58
  • feat: Report more histogram metrics by @yinggeh in #61

Full Changelog: v24.07...v24.08