MPT 30B inference on Mac M1 #353

RonanKMcGovern · 2023-07-07T12:27:00Z

RonanKMcGovern
Jul 7, 2023

Is it realistic to try and get inference running on Mac M1 with similar results quality as on a GPU?

I find 7B and 13B models are not good enough to get working well with functions. Also, I like that MPT has extendable context compared to Falcon and llama.

If I were to try and get MPT 30B running, can I bootstrap using work from llama? Thanks

ggerganov · 2023-10-10T11:04:27Z

ggerganov
Oct 10, 2023
Maintainer

This is now supported in llama.cpp : ggerganov/llama.cpp#3417

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MPT 30B inference on Mac M1 #353

{{title}}

Replies: 1 comment

{{title}}

Select a reply

MPT 30B inference on Mac M1 #353

RonanKMcGovern Jul 7, 2023

Replies: 1 comment

ggerganov Oct 10, 2023 Maintainer

RonanKMcGovern
Jul 7, 2023

ggerganov
Oct 10, 2023
Maintainer