Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Hot Reload #58

Closed
HRashidi opened this issue Feb 26, 2024 · 2 comments
Closed

Hot Reload #58

HRashidi opened this issue Feb 26, 2024 · 2 comments
Labels
enhancement New feature or request

Comments

@HRashidi
Copy link
Contributor

Enhancement Description

  • Right now, if any part of the sdk is changed, the whole sdk needs to be restarted. This is not ideal because it takes a lot of time to restart the SDK especially if some of the models take a long time to load.

Advantages

  • Using this improvement, we don't need to restart the entire sdk to add or remove new models or features especially when updating the model weights.

Possible Implementation

  • Split the sdk into different pars. so each deployments are launched separately. This also allow us to change number of replicas or auto scaling based on the loads.
@HRashidi HRashidi added the enhancement New feature or request label Feb 26, 2024
@movchan74
Copy link
Contributor

We already have an issue for that: #29
If you have a suggestion on how to implement it, please add it as a comment there. Please check if an issue already exists before opening a new one.

@HRashidi
Copy link
Contributor Author

HRashidi commented Mar 5, 2024

duplicated

@HRashidi HRashidi closed this as completed Mar 5, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants