[Model Request] please add "pkshatech/GLuCoSE-base-ja" #294

xrd · 2024-07-10T20:17:53Z

What happened?

I would like to use "pkshatech/GLuCoSE-base-ja"

What Python version are you on? e.g. python --version

Python3.10

Version

0.2.7 (Latest)

What os are you seeing the problem on?

Linux

Relevant stack traces and/or logs

No response

quanduongduc · 2024-07-17T02:26:48Z

You can manually add a custom model to the list of supported ONNX models in fastembed.text.onnx_embedding like this:

from fastembed.text.onnx_embedding import supported_onnx_models

supported_onnx_models.append({
    "model": "Alibaba-NLP/gte-large-en-v1.5",
    "dim": 1024,
    "description": "Custom Model",
    "size_in_GB": 1.7,
    "sources": {
        # from huggingface
        "hf": "Alibaba-NLP/gte-large-en-v1.5",
    },
    # This will make fastembed downloading this model file : https://huggingface.co/Alibaba-NLP/gte-large-en-v1.5/blob/main/onnx/model.onnx
    "model_file": "onnx/model.onnx",
})

# must be imported before the supported_onnx_models updated
from qdrant_client import QdrantClient, models

But it requires your model must be in ONNX format because fastembed uses the ONNX runtime. Also, make sure to include accurate values for dim, sources, and model_file.

I'm not sure about the accurate of embedding results but the model runs without any exceptions

Anush008 · 2024-07-17T03:54:36Z

You can also PR the model to be added.

It could look like #270.

joein · 2024-07-17T12:21:26Z

@quanduongduc with this approach you would need to be careful with the postprocessing required by particular models, some models require pooling onnx output, some of them don't. You'd need to choose a proper implementation like onnx_embedding or pooled_embedding.

We also don't guarantee that these implementations won't be refactored without warnings, since we treat them as internal classes

No one restricts you from doing it though

xrd changed the title ~~Model Request "pkshatech/GLuCoSE-base-ja"~~ [Model Request] please add "pkshatech/GLuCoSE-base-ja" Jul 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Model Request] please add "pkshatech/GLuCoSE-base-ja" #294

[Model Request] please add "pkshatech/GLuCoSE-base-ja" #294

xrd commented Jul 10, 2024

quanduongduc commented Jul 17, 2024 •

edited

Loading

Anush008 commented Jul 17, 2024 •

edited

Loading

joein commented Jul 17, 2024 •

edited

Loading

[Model Request] please add "pkshatech/GLuCoSE-base-ja" #294

[Model Request] please add "pkshatech/GLuCoSE-base-ja" #294

Comments

xrd commented Jul 10, 2024

What happened?

What Python version are you on? e.g. python --version

Version

What os are you seeing the problem on?

Relevant stack traces and/or logs

quanduongduc commented Jul 17, 2024 • edited Loading

Anush008 commented Jul 17, 2024 • edited Loading

joein commented Jul 17, 2024 • edited Loading

quanduongduc commented Jul 17, 2024 •

edited

Loading

Anush008 commented Jul 17, 2024 •

edited

Loading

joein commented Jul 17, 2024 •

edited

Loading