Included separate inference pool #2040

RobertSamoilescu · 2025-02-03T12:29:43Z

Currently MLServer can't deal well with models that are loaded on the same inference pool with other models that are heavy used. In this case there is a risk of starvation and we want to allow the user to be able to create models on separate processes (different inference pool).

MLServer does this but only if the model uses a specific custom environment tarball.

Proposed solution: Introduce inference_pool_gid to the ModelParameters class. In this way, we give the user the option to add the model to a dedicated inference pool group. Note that this allows to add either a single model or multiple models to the same group. I also included the autogenerate_inference_pool_gid boolean flag. When set to True and no inference_pool_gid is provided, a gid is generated using uuid4. This will add the model to a dedicated group - this way the user does not have to specify the gid themself.

sakoush

LGTM. Docs should follow this PR to describe the new fields.

mlserver/parallel/registry.py

mlserver/settings.py

RobertSamoilescu added 6 commits January 31, 2025 16:28

Included gid for inference pool

ce21805

Updated existing tests to include inference_pool_gid

8566db7

Included test for default gid and env gid

6ca2f2f

Minor refactoring and fixed some tests

5315397

Included autogenerate infernece pool gid flag

6ee96b2

Fixed linting

df97cb6

RobertSamoilescu requested a review from sakoush February 3, 2025 12:29

sakoush approved these changes Feb 4, 2025

View reviewed changes

mlserver/parallel/registry.py Outdated Show resolved Hide resolved

mlserver/parallel/registry.py Outdated Show resolved Hide resolved

mlserver/settings.py Show resolved Hide resolved

RobertSamoilescu added 3 commits February 4, 2025 13:51

Renamed _get_environment_hash_gid to _append_gid_environment_hash

3665433

Replaced inline if else with multiple returns

887ae5a

Refactored append method

21686cf

RobertSamoilescu merged commit 5065a63 into SeldonIO:master Feb 4, 2025
50 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Included separate inference pool #2040

Included separate inference pool #2040

RobertSamoilescu commented Feb 3, 2025

sakoush left a comment

Included separate inference pool #2040

Included separate inference pool #2040

Conversation

RobertSamoilescu commented Feb 3, 2025

sakoush left a comment

Choose a reason for hiding this comment