Skip to content

Importing Exllamav2 taking so much time #342

Answered by turboderp
luisfrentzen-cc asked this question in Q&A
Discussion options

You must be logged in to vote

2+ minutes sounds excessive, but I guess it's possible for a slow CPU?

I don't know if copying the cache works, but if you can't use any of the prebuilt wheels you can also build your own wheel with something like:

pip wheel --no-deps -w dist .

This should create a .whl file in the dist directory, containing both the exllamav2 and exllamav2_ext packages. Then install it as part of the docker image build with pip install whatever.whl

Replies: 1 comment 4 replies

Comment options

You must be logged in to vote
4 replies
@luisfrentzen-cc
Comment options

@turboderp
Comment options

Answer selected by luisfrentzen-cc
@luisfrentzen-cc
Comment options

@luisfrentzen-cc
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants