v1.1.0
Stable Release
Notice: This release is not backwards compatible - see notes below on upgrading
-
Breaking Changes
- Triton engine size supports variable request size (-1)
-
Features & Bug fixes
- Add version number of serving session task
- Triton engine support for variable request (matrix) sizes
- Triton support, fix --aux-config to support more configurations elements
- Huggingface Transformer support
Preprocess
class as module (see note below)
Note: To add a Preprocess
class from a module (the entire module folder will be packaged)
preprocess_folder
├── __init__.py # from .sub.some_file import Preprocess
└── sub
└── some_file.py
Pass the top folder as a path for --preprocess
, for example:
clearml-serving --id <serving_session_id> model add --preprocess /path/to/preprocess_folder ...
Upgrading from v1.0
- Take down the serving containers (docker-compose or k8s)
- Update the clearml-serving CLI
pip3 install -U clearml-serving
- Re-add a single existing endpoint with
clearml-serving model add ...
(press yes when asked)
(it will upgrade the clearml-serving session definitions) - Pull latest serving containers (
docker-compose pull ...
or k8s) - Re-spin serving containers (docker-compose or k8s)