Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Any plans on supporting Llama3.2 text and multimodal on Qualcomm AI 100? #152

Open
alew3 opened this issue Oct 10, 2024 · 2 comments
Open
Assignees

Comments

@alew3
Copy link

alew3 commented Oct 10, 2024

Do you plan on supporting Llama3.2 (text/multimodal) on Qualcomm A100?
I saw this post (https://www.qualcomm.com/news/onq/2024/09/qualcomm-partners-with-meta-to-support-llama-3-point-2-big-deal-for-on-device-ai) , but it seems to have compatibility only for Snapdragon chips.

@alew3
Copy link
Author

alew3 commented Oct 10, 2024

BTW, your README link to models comming soon is broken (https://github.com/quic/efficient-transformers/blob/main/models-coming-soon)

@quic-rishinr
Copy link
Contributor

quic-rishinr commented Oct 14, 2024

Hi Aless, The Llama 3.2 1B and 3B models work out of the box in the current repository, provided you use one of the latest product software releases. Could you share details on Qualcomm Cloud AI100 instances and the software SDK you are using?

The changes for the Llama 3.2 text models (11B and 90B) are currently under review. If you would like to run these models, you can cherry-pick the changes #134 onto the mainline and proceed with the validation.

Regarding the Llama 3.2 multimodal model, it is still under evaluation. I will keep you updated on any progress.

Additionally, I will address the issue with the broken README link.

@anujgupt-github anujgupt-github changed the title Any plans on supporting Llama3.2 text and multimodal on QualcommA100? Any plans on supporting Llama3.2 text and multimodal on Qualcomm AI 100? Oct 15, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants