Improve notebook #44

PhilippeMoussalli · 2023-12-18T14:56:28Z

No description provided.

PhilippeMoussalli · 2023-12-18T14:56:47Z

README.md

@@ -70,5 +70,5 @@ fondant --help

 There are two options to run the pipeline:

- [Via python files and the Fondant CLI](./src/README.md): how you should run Fondant in production


This did not point to anything

Seems like I deleted it in this PR.

It might make sense to re-add it just for the indexing pipeline. WDYT?
If not, I would still add a link for the CLI to the documentation, and keep the link to the notebook.

Might be best to go for the second approach since we don't have a file ready to launch the pipeline from (currently organized in a function that creates the pipeline). Updated

RobbeSneyders

Thanks @PhilippeMoussalli

Could you remove the notebook outputs from git by running this command?

git config filter.strip-notebook-output.clean 'jupyter nbconvert --ClearOutputPreprocessor.enabled=True --to=notebook --stdin --stdout --log-level=ERROR'

Github won't even show me the diffs because they are too large 😅

RobbeSneyders · 2024-01-03T12:26:57Z

README.md

@@ -70,5 +70,5 @@ fondant --help

 There are two options to run the pipeline:

- [Via python files and the Fondant CLI](./src/README.md): how you should run Fondant in production


Seems like I deleted it in this PR.

It might make sense to re-add it just for the indexing pipeline. WDYT?
If not, I would still add a link for the CLI to the documentation, and keep the link to the notebook.

RobbeSneyders · 2024-01-03T12:28:07Z

src/weaviate/docker-compose.yaml

@@ -3,7 +3,7 @@ services:
  weaviate:
    image: semitechnologies/weaviate:1.20.5
    ports:
-      - 8080:8080
+      - 8081:8080


Is there a reason for this change?

Port 8080 is occupied when using jupyter on vertex workbench

RobbeSneyders · 2024-01-03T13:48:19Z

src/evaluation.ipynb

Can't see the diff on Github, but from inspecting it locally, I think you inserted some images inline which doesn't work well. Can you add them as separate images to the repo and reference them by link in the notebook like we do for the other images?

I think I embedded them because for some reason they are not rendered properly neither in the IDE visualizer nor when you run them with the local jupyter notebook

This is an example from the parameter search notebook, I launched the notebook command from the src directory

I see. It only works when starting from the root directory indeed.

mrchtr · 2024-01-04T15:14:39Z

src/evaluation.ipynb

Can't add the comment on the line.
I think this line:
"evaluation_llm_kwargs": {"openai_api_key": os.environ["OPENAI_KEY"], model_name : "gpt-3.5-turbo"} should be changed into:
"evaluation_llm_kwargs": {"openai_api_key": os.environ["OPENAI_KEY"], "model_name" : "gpt-3.5-turbo"}.

Nice catch, updated

PhilippeMoussalli added 9 commits December 18, 2023 14:47

add installation link

8ed0840

remove non-existing links

c498e96

Utilize optional GPU resources

fcdfad0

Remove obsolete argument

95e531a

Add missing docs

df3b7bb

Enable image visualization in edit mode

34be11b

Reduce weaviate logs when pulling images

9b240a6

Reduce weaviate logs when pulling images

bc469a7

resolve weaviate host for default docker context on linux

39996df

PhilippeMoussalli commented Dec 18, 2023

View reviewed changes

add missing images to evaluation pipeline

603868d

PhilippeMoussalli force-pushed the improve-notebook branch from 7c17345 to 603868d Compare December 18, 2023 14:58

PhilippeMoussalli requested review from RobbeSneyders, mrchtr and Hakimovich99 December 18, 2023 14:58

PhilippeMoussalli added 3 commits December 19, 2023 10:40

Merge branch 'main' into improve-notebook

e2170fd

formatting

79824aa

add cluster type local to emebdding component

190cc15

PhilippeMoussalli mentioned this pull request Dec 21, 2023

Fix illegal memory access embedding ml6team/fondant#735

Closed

Merge branch 'main' into improve-notebook

f104c8d

RobbeSneyders reviewed Jan 3, 2024

View reviewed changes

PhilippeMoussalli added 4 commits January 3, 2024 13:45

address PR feedback

f008eed

clear cell output

e201ad3

actually clear cell output

00d3c19

precommit

5385435

RobbeSneyders reviewed Jan 3, 2024

View reviewed changes

RobbeSneyders approved these changes Jan 3, 2024

View reviewed changes

mrchtr reviewed Jan 4, 2024

View reviewed changes

update evaluation llm kwargs arg

9386311

PhilippeMoussalli merged commit 02259e5 into main Jan 8, 2024
1 check passed

PhilippeMoussalli deleted the improve-notebook branch January 8, 2024 09:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve notebook #44

Improve notebook #44

PhilippeMoussalli commented Dec 18, 2023

PhilippeMoussalli Dec 18, 2023

RobbeSneyders Jan 3, 2024

PhilippeMoussalli Jan 3, 2024

RobbeSneyders left a comment

RobbeSneyders Jan 3, 2024

RobbeSneyders Jan 3, 2024

PhilippeMoussalli Jan 3, 2024

RobbeSneyders Jan 3, 2024

PhilippeMoussalli Jan 3, 2024 •

edited

Loading

RobbeSneyders Jan 3, 2024

mrchtr Jan 4, 2024

PhilippeMoussalli Jan 8, 2024

		@@ -70,5 +70,5 @@ fondant --help

		There are two options to run the pipeline:

		- [Via python files and the Fondant CLI](./src/README.md): how you should run Fondant in production

Improve notebook #44

Improve notebook #44

Conversation

PhilippeMoussalli commented Dec 18, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

RobbeSneyders left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

PhilippeMoussalli Jan 3, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

PhilippeMoussalli Jan 3, 2024 •

edited

Loading