Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature]: DeepSeek-R1-Distill-Qwen or similar distilled DeepSeek gguf support #1059

Open
zsogitbe opened this issue Jan 28, 2025 · 5 comments
Labels
enhancement New feature or request

Comments

@zsogitbe
Copy link
Contributor

zsogitbe commented Jan 28, 2025

Martin, do we have support for DeepSeek-R1-Distill-Qwen gguf or similar already (distilled version of deepseek)?
Thank you.

@martindevans
Copy link
Member

No I don't think so, not yet. We're currently about 5 weeks out of date.

Glancing through the PRs, I can see these two which looks relevant:

The next version update is already in the works, I think there's just one major issue left to resolve which I hope to get to this weekend.

@martindevans martindevans added the enhancement New feature or request label Jan 28, 2025
@zsogitbe
Copy link
Contributor Author

OK! The last cpp version should add support. Thank you!

@phil-scott-78
Copy link
Contributor

Pretty excited to see the new binaries added. Not just deepseek, but it'll be an opportunity to try and incorporate the changes they've made to support jinja templates and as of a few days ago the tooling support.

@martindevans
Copy link
Member

Is the new and improved Jinja support in the core llama.cpp, or just in the examples (e.g. common.cpp)? A lot of big changes like that land in examples (which LLamaSharp does not include) before they make their way into the main project.

@phil-scott-78
Copy link
Contributor

it's in core. A pretty big one too, I think some of the features related to Deepthink might require it. I know the tooling stuff does ggerganov/llama.cpp#11016

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants