Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DeepSpeed #88

Open
ccmaymay opened this issue Mar 9, 2023 · 3 comments
Open

DeepSpeed #88

ccmaymay opened this issue Mar 9, 2023 · 3 comments
Labels
new-framework New software framework to consider for improving backend performance Performance issues or improvements

Comments

@ccmaymay
Copy link
Collaborator

ccmaymay commented Mar 9, 2023

w/ bf16

@ccmaymay ccmaymay added the new-framework New software framework to consider for improving backend label Mar 9, 2023
@ccmaymay
Copy link
Collaborator Author

ccmaymay commented Mar 10, 2023

https://github.com/microsoft/DeepSpeed-MII#deploying-mii-public

Has grpc server, seems to resemble our own https://github.com/microsoft/DeepSpeed-MII/blob/main/mii/server.py for example:

        # XXX: fetch model provider based on model name in a more general way
        if model_name == "gpt-neox":
            provider = mii.constants.MODEL_PROVIDER_NAME_EA
        elif ("bigscience/bloom" == model_name) or ("microsoft/bloom" in model_name):
            provider = mii.constants.MODEL_PROVIDER_NAME_HF_LLM
        elif self.task == mii.Tasks.TEXT2IMG:
            provider = mii.constants.MODEL_PROVIDER_NAME_DIFFUSERS
        else:
            provider = mii.constants.MODEL_PROVIDER_NAME_HF
        server_args_str += f" --provider {provider}"

@ccmaymay ccmaymay self-assigned this Mar 13, 2023
@ccmaymay
Copy link
Collaborator Author

Branch status: event loop not found at query time

@ccmaymay ccmaymay added the performance Performance issues or improvements label Jul 5, 2023
@ccmaymay
Copy link
Collaborator Author

Looks like DeepSpeed-MII recently removed the Bloom-specific LLM functionality and replaced it with a generalized "meta tensor" functionality: microsoft/DeepSpeed-MII@60a85dc

may be worth trying out again

@ccmaymay ccmaymay removed their assignment Jul 29, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
new-framework New software framework to consider for improving backend performance Performance issues or improvements
Projects
None yet
Development

When branches are created from issues, their pull requests are automatically linked.

1 participant