Skip to content

DJLServing v0.24.0 release

Compare
Choose a tag to compare
@zachgk zachgk released this 17 Oct 22:10
· 1007 commits to master since this release

Key Features

  • Updates Components
    • Updates Neuron to 2.14.1
    • Updates DeepSpeed to 0.10.0
  • Improved Python logging
  • Improved SeqScheduler
  • Adds DeepSpeed dynamic int8 quantization with SmoothQuant
  • Supports for llama 2
  • Supports Safetensors
  • Adds Neuron dynamic batching and rolling batch
  • Adds Adapter API Preview
  • Supports HuggingFace Stopwords

Enhancement

Bug fixes

Documentation and Examples

CI

New Contributors

Full Changelog: v0.23.0...v0.24.0