You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Yeah, unfortunately, Vietnamese models might be underrepresented in fastembed and Vietnamese is supported only by the multilingual models
It seems, that these particular models do not have as much of attention (~1-10k download per month), and we are trying to keep fastembed slim
Also, the authors of the models have not converted it to onnx
If you are willing to convert the models to onnx, you can use these models with custom model functionality we added in fastembed 0.6.0
If a model follows a typical preprocessing / postprocessing (just pooling / normalization) steps, it can be added to fastembed in runtime via .add_custom_model (example from readme)
Once you convert the model, you can upload it either from huggingface, or, if you want to keep it private, via specific_model_path param (which is a path to the directory where you model is stored on disk)
Thanks for the clarification! I understand the focus on keeping fastembed slim. I'll look into converting the models to ONNX and testing the custom model functionality in fastembed 0.6.0. Appreciate the insights!
Which model would you like to support?
https://huggingface.co/keepitreal/vietnamese-sbert
https://huggingface.co/itdainb/PhoRanker
What are the main advantages of this model?
I notice that fastembed does not support Vietnamese model, so I am so grateful if you add them to fastembed
The text was updated successfully, but these errors were encountered: