Adds new local models and updates documentation #97

vishnuravi · 2025-02-02T02:33:41Z

Adds new local models and updates documentation

⚙️ Release Notes

Adds support for running the following additional models, including those used in the SpeziLLM on-device benchmarking study:

Llama3.1-Aloe-Beta-8B
Llama3-Med42-8B
Qwen2-7B-4bit
DeepSeek-R1-Distill-Qwen-1.5B-8bit
DeepSeek-R1-Distill-Qwen-7B-4bit
DeepSeek-R1-Distill-Llama-8B-4bit-mlx

Updates documentation and README to reflect changes in the API.

📝 Code of Conduct & Contributing Guidelines

By submitting creating this pull request, you agree to follow our Code of Conduct and Contributing Guidelines:

I agree to follow the Code of Conduct and Contributing Guidelines.

philippzagar

Thanks @vishnuravi, let's see what these models can do! 🚀

Sources/SpeziLLMLocal/Configuration/LLMLocalModel.swift

PSchmiedmayer

Thank you @vishnuravi 🚀

codecov · 2025-02-03T16:25:31Z

Codecov Report

Attention: Patch coverage is 0% with 13 lines in your changes missing coverage. Please review.

Project coverage is 39.12%. Comparing base (fe15019) to head (2535c21).
Report is 1 commits behind head on main.

Files with missing lines	Patch %	Lines
...es/SpeziLLMLocal/Configuration/LLMLocalModel.swift	0.00%	13 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main      #97      +/-   ##
==========================================
+ Coverage   38.34%   39.12%   +0.79%     
==========================================
  Files          64       64              
  Lines        2345     2357      +12     
==========================================
+ Hits          899      922      +23     
+ Misses       1446     1435      -11

Files with missing lines	Coverage Δ
...s/SpeziLLMLocalDownload/LLMLocalDownloadView.swift	`0.00% <ø> (ø)`
...es/SpeziLLMLocal/Configuration/LLMLocalModel.swift	`0.00% <0.00%> (ø)`

... and 2 files with indirect coverage changes

Continue to review full report in Codecov by Sentry.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update fe15019...2535c21. Read the comment docs.

This reverts commit 7bb83b6.

Co-authored-by: Paul Schmiedmayer <[email protected]>

This reverts commit ad15bdb.

…-4bit-mlx models

…into deepseek

PSchmiedmayer

Thank you for improving the documentation @vishnuravi!

I had some minor comments that should be all easily resolvable here or we create follow-up issues (e.g. Onboarding dependency).

README.md

Sources/SpeziLLMLocalDownload/SpeziLLMLocalDownload.docc/SpeziLLMLocalDownload.md

README.md

Co-authored-by: Paul Schmiedmayer <[email protected]>

Sources/SpeziLLMLocal/Configuration/LLMLocalModel.swift

README.md

vishnuravi changed the base branch from main to feature/bump-mlx-2.21.2 February 2, 2025 02:34

philippzagar approved these changes Feb 2, 2025

View reviewed changes

Sources/SpeziLLMLocal/Configuration/LLMLocalModel.swift Show resolved Hide resolved

vishnuravi mentioned this pull request Feb 2, 2025

Bump mlx to version 2.21.2 #94

Merged

1 task

vishnuravi marked this pull request as ready for review February 3, 2025 02:02

vishnuravi changed the title ~~Add support for running DeepSeek r1 locally~~ Add support for running DeepSeek r1 distilled models locally Feb 3, 2025

vishnuravi changed the title ~~Add support for running DeepSeek r1 distilled models locally~~ Add support for running DeepSeek R1 distilled models locally Feb 3, 2025

vishnuravi force-pushed the feature/bump-mlx-2.21.2 branch from 13becbc to 7a1e836 Compare February 3, 2025 14:06

Base automatically changed from feature/bump-mlx-2.21.2 to main February 3, 2025 14:41

vishnuravi force-pushed the deepseek branch from 90af7c2 to 77f584a Compare February 3, 2025 15:10

PSchmiedmayer approved these changes Feb 3, 2025

View reviewed changes

Leon Nissen and others added 19 commits February 3, 2025 13:50

bump mlx to version 2.21.2

b912e74

Fixes warnings

62b0cf2

Revert "Fixes warnings"

7f13d57

This reverts commit 7bb83b6.

Autofix warnings

f3d54c0

Suppress cyclomatic complexity

1915b64

Update SpeziChat

127a3d2

Update Sources/SpeziLLMLocal/LLMLocalSession+Generate.swift

6e55c08

Co-authored-by: Paul Schmiedmayer <[email protected]>

Revert "Update Sources/SpeziLLMLocal/LLMLocalSession+Generate.swift"

11fcf00

This reverts commit ad15bdb.

Fix return from void function warning

03d596a

Refactor into private functions

ecf7a77

Refactor processing of remaining tokens into private function

2a2ea86

Move error handling and parameter generation to private functions

b453e7a

Remove broken links to Apple documentation

66af686

Remove broken link from documentation

aceabc3

Add license header

68f5362

Custom LinkSpector config

71980dc

Add deepseek r1 distill qwen 1.5B model

1790751

Add DeepSeek-R1-Distill-Qwen-7B-4bit and DeepSeek-R1-Distill-Llama-8B…

4a78e5f

…-4bit-mlx models

Adds LLMLocalPlatform to Configuration

14f6a49

Update development team

4f88780

vishnuravi force-pushed the deepseek branch from 2e77114 to 4f88780 Compare February 3, 2025 18:50

vishnuravi and others added 6 commits February 3, 2025 13:51

Merge branch 'main' into deepseek

b73c685

Update example in docs

970e3f8

Merge branch 'deepseek' of https://github.com/StanfordSpezi/SpeziLLM …

a28e4e4

…into deepseek

Update docs for LLMLocalDownloadView

f27067e

Update docs

0eea68b

Fix formatting in README

d3cc941

PSchmiedmayer approved these changes Feb 5, 2025

View reviewed changes

vishnuravi and others added 2 commits February 8, 2025 10:51

Update README.md

eaf1420

Co-authored-by: Paul Schmiedmayer <[email protected]>

Update README per comments

791fbfa

vishnuravi changed the title ~~Add support for running DeepSeek R1 distilled models locally~~ Adds new models and updates documentation Feb 9, 2025

vishnuravi changed the title ~~Adds new models and updates documentation~~ Adds new local Hugging Face models and updates documentation Feb 9, 2025

vishnuravi changed the title ~~Adds new local Hugging Face models and updates documentation~~ Adds new local models and updates documentation Feb 9, 2025

vishnuravi added 4 commits February 11, 2025 12:25

Add Aloe Beta 8B and Med42 8B

824e4ba

Add Qwen2 7B 4bit

96b6e47

Update README

15154cf

Fix qwen 1.5 case name

fec3680

vishnuravi commented Feb 12, 2025

View reviewed changes

Sources/SpeziLLMLocal/Configuration/LLMLocalModel.swift Show resolved Hide resolved

LeonNissen reviewed Feb 12, 2025

View reviewed changes

README.md Outdated Show resolved Hide resolved

Switch to GPT-4o

2535c21

vishnuravi merged commit 4a86cbf into main Feb 12, 2025
19 of 20 checks passed

vishnuravi deleted the deepseek branch February 12, 2025 00:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds new local models and updates documentation #97

Adds new local models and updates documentation #97

vishnuravi commented Feb 2, 2025 •

edited

Loading

philippzagar left a comment

PSchmiedmayer left a comment

codecov bot commented Feb 3, 2025 •

edited

Loading

PSchmiedmayer left a comment

Adds new local models and updates documentation #97

Adds new local models and updates documentation #97

Conversation

vishnuravi commented Feb 2, 2025 • edited Loading

Adds new local models and updates documentation

⚙️ Release Notes

📝 Code of Conduct & Contributing Guidelines

philippzagar left a comment

Choose a reason for hiding this comment

PSchmiedmayer left a comment

Choose a reason for hiding this comment

codecov bot commented Feb 3, 2025 • edited Loading

Codecov Report

PSchmiedmayer left a comment

Choose a reason for hiding this comment

vishnuravi commented Feb 2, 2025 •

edited

Loading

codecov bot commented Feb 3, 2025 •

edited

Loading