feat: Add option to select ONNXRuntime Execution Providers for fastembed #1119

Fannovel16 · 2024-10-01T12:16:03Z

Related Issues

In some cases, fastembed does not use GPU despite onnxruntime-gpu and fastembed-gpu are installed correctly

Proposed Changes:

Add option onnx_providers to set ONNXRuntime Execution Providers manually.

How did you test it?

WIP

Notes for the reviewer

Checklist

I have read the contributors guidelines and the code of conduct
I have updated the related issue with new insights and changes

CLAassistant · 2024-10-01T12:16:11Z

All committers have signed the CLA.

anakin87 · 2024-10-03T16:16:47Z

Just a quick thought: I don't think this would work, because this integration explicitly depends on fastembed.
GPU support requires fastembed-gpu, which cannot be installed in the same environment as fastembed (FastEmbed docs).

Maybe I'm wrong, but I think the whole integration needs to be reworked to support GPU. There may be several ways to do that...

Related: #905

Fannovel16 · 2024-10-04T02:10:36Z

Just a quick thought: I don't think this would work, because this integration explicitly depends on fastembed. GPU support requires fastembed-gpu, which cannot be installed in the same environment as fastembed (FastEmbed docs).

Maybe I'm wrong, but I think the whole integration needs to be reworked to support GPU. There may be several ways to do that...

Related: #905

@anakin87 On Google Colab, I installed fastembed-gpu and correct onnxruntime-gpu but it does not pick up GPU by default, even if "CUDAExecutionProvider" is available. Explicily stating providers fix that. I've been using this fork on my project rn

#@title Install library
!pip install git+https://github.com/deepset-ai/haystack "datasets>=2.6.1" "sentence-transformers>=3.0.0" markdown-it-py mdit_plain
!pip install langchain-text-splitters instructor

%cd /content
!git clone https://github.com/Fannovel16/haystack-core-integrations/
%cd /content/haystack-core-integrations/integrations/fastembed
!pip install .
%cd /content/haystack-core-integrations/integrations/qdrant
!pip install .
%cd /content

!pip install "onnxruntime-gpu<=1.17" -i https://aiinfra.pkgs.visualstudio.com/PublicPackages/_packaging/onnxruntime-cuda-12/pypi/simple/
!pip install fastembed-gpu

Add option to select ONNXRuntime Execution Providers for fastembed

fa24e58

Fannovel16 requested a review from a team as a code owner October 1, 2024 12:16

Fannovel16 requested review from shadeMe and removed request for a team October 1, 2024 12:16

github-actions bot added integration:fastembed type:documentation Improvements or additions to documentation labels Oct 1, 2024

Merge branch 'main' into main

c122ef3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add option to select ONNXRuntime Execution Providers for fastembed #1119

feat: Add option to select ONNXRuntime Execution Providers for fastembed #1119

Fannovel16 commented Oct 1, 2024 •

edited

Loading

CLAassistant commented Oct 1, 2024 •

edited

Loading

anakin87 commented Oct 3, 2024 •

edited

Loading

Fannovel16 commented Oct 4, 2024 •

edited

Loading

feat: Add option to select ONNXRuntime Execution Providers for fastembed #1119

Are you sure you want to change the base?

feat: Add option to select ONNXRuntime Execution Providers for fastembed #1119

Conversation

Fannovel16 commented Oct 1, 2024 • edited Loading

Related Issues

Proposed Changes:

How did you test it?

Notes for the reviewer

Checklist

CLAassistant commented Oct 1, 2024 • edited Loading

anakin87 commented Oct 3, 2024 • edited Loading

Fannovel16 commented Oct 4, 2024 • edited Loading

Fannovel16 commented Oct 1, 2024 •

edited

Loading

CLAassistant commented Oct 1, 2024 •

edited

Loading

anakin87 commented Oct 3, 2024 •

edited

Loading

Fannovel16 commented Oct 4, 2024 •

edited

Loading