Add notebook for semantic reranking with HF Eland model #314

leemthompo · 2024-08-13T08:33:59Z

Notebook for semantic reranking using retriever + model uploaded via Eland

Test it in Colab

gitnotebooks · 2024-08-13T08:34:02Z

Found 1 changed notebook. Review the changes at https://gitnotebooks.com/elastic/elasticsearch-labs/pull/314

jeffvestal · 2024-08-13T14:14:23Z

Enabling telemetry step throws an error, referencing es_client before it is created. Should you be passing client ?

[<ipython-input-5-738307b21c1d>](https://localhost:8080/#) in <cell line: 4>()
      2 from telemetry import enable_telemetry
      3 
----> 4 es_client = enable_telemetry(es_client, "11-semantic-reranking-hugging-face")

NameError: name 'es_client' is not defined```

jeffvestal · 2024-08-13T14:26:39Z

eland_import_hub_model: error: argument --cloud-id: expected one argument
also an error with ES_API_KEY

should be

!eland_import_hub_model \
  --cloud-id $ELASTIC_CLOUD_ID \
  --es-api-key $ELASTIC_API_KEY \
  --hub-model-id cross-encoder/ms-marco-MiniLM-L-6-v2 \
  --task-type text_similarity \
  --clear-previous \
  --start

jeffvestal · 2024-08-13T14:34:51Z

I think you need to create an Inference endpoint after uploading the reranking model.
The last query in the notebook throws

  "error": {
    "root_cause": [],
    "type": "search_phase_execution_exception",
    "reason": "Computing updated ranks for results failed",
    "phase": "rank-feature",
    "grouped": true,
    "failed_shards": [],
    "caused_by": {
      "type": "resource_not_found_exception",
      "reason": "Inference endpoint not found [my-msmarco-minilm-model]"
    }
  },
  "status": 404
}```

demjened

Please add the missing step and check the other comments.

demjened · 2024-08-13T14:43:44Z

notebooks/search/11-semantic-reranking-hugging-face.ipynb

+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\u001b[?25l   \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m0.0/480.2 kB\u001b[0m \u001b[31m?\u001b[0m eta \u001b[36m-:--:--\u001b[0m\r\u001b[2K   \u001b[91m━━━━━━━\u001b[0m\u001b[91m╸\u001b[0m\u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m92.2/480.2 kB\u001b[0m \u001b[31m4.0 MB/s\u001b[0m eta \u001b[36m0:00:01\u001b[0m\r\u001b[2K   \u001b[91m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m\u001b[90m╺\u001b[0m \u001b[32m471.0/480.2 kB\u001b[0m \u001b[31m7.5 MB/s\u001b[0m eta \u001b[36m0:00:01\u001b[0m\r\u001b[2K   \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m480.2/480.2 kB\u001b[0m \u001b[31m5.1 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",


From what I've seen in other notebooks, in general we omit unnecessary output and only display the output of stages that showcase the current feature (in this case the before/after search results). This makes the notebook cleaner and easier to follow.
You can do this by clearing all cell outputs in your notebook tool (in VSCode there is a button on top), and replaying only the steps you want to see outputs of.

100% I deleted what I think is noise, bu it can be helpful to have some outputs (like the eland step) for error prone steps so users can see what 'success' looks like

notebooks/search/11-semantic-reranking-hugging-face.ipynb

demjened · 2024-08-13T14:53:36Z

notebooks/search/11-semantic-reranking-hugging-face.ipynb

+    "                }\n",
+    "            },\n",
+    "            \"field\": \"plot\",\n",
+    "            \"inference_id\": \"my-msmarco-minilm-model\",\n",


I think we're missing a step in the notebook, the one that creates the rerank inference endpoint pointing to the uploaded MSMarco model.

demjened

LGTM. Not sure why the CI is failing, looks unrelated to this notebook.

Add notebook for semantic reranking with HF Eland model

b9419bc

leemthompo requested a review from demjened August 13, 2024 08:33

leemthompo self-assigned this Aug 13, 2024

Use env variables

17c2803

demjened requested changes Aug 13, 2024

View reviewed changes

Updates per review, add missing step

ab0fa4e

demjened self-requested a review August 13, 2024 21:12

demjened approved these changes Aug 13, 2024

View reviewed changes

leemthompo merged commit 2b6aba8 into elastic:main Aug 14, 2024
2 of 5 checks passed

leemthompo deleted the rerank-hugging-face-eland branch August 14, 2024 13:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add notebook for semantic reranking with HF Eland model #314

Add notebook for semantic reranking with HF Eland model #314

leemthompo commented Aug 13, 2024 •

edited

Loading

gitnotebooks bot commented Aug 13, 2024

jeffvestal commented Aug 13, 2024

jeffvestal commented Aug 13, 2024

jeffvestal commented Aug 13, 2024

demjened left a comment

demjened Aug 13, 2024

leemthompo Aug 13, 2024

demjened Aug 13, 2024

demjened left a comment

Add notebook for semantic reranking with HF Eland model #314

Add notebook for semantic reranking with HF Eland model #314

Conversation

leemthompo commented Aug 13, 2024 • edited Loading

Test it in Colab

gitnotebooks bot commented Aug 13, 2024

jeffvestal commented Aug 13, 2024

jeffvestal commented Aug 13, 2024

jeffvestal commented Aug 13, 2024

demjened left a comment

Choose a reason for hiding this comment

demjened Aug 13, 2024

Choose a reason for hiding this comment

leemthompo Aug 13, 2024

Choose a reason for hiding this comment

demjened Aug 13, 2024

Choose a reason for hiding this comment

demjened left a comment

Choose a reason for hiding this comment

leemthompo commented Aug 13, 2024 •

edited

Loading