Automate running benchmarks for all engines #134

tellet-q · 2024-04-18T10:24:49Z

Solves #123

Run *-default experiment for each engine using random-100 or glove-25-angular dataset against single-node deployment. Note that OS and ES have dedicated single-node deployments with reduced memory to fit into default github runner.

The workflow triggers:

on manual run
on push to master
on pull-request creation

tellet-q · 2024-04-18T10:53:43Z

We can adjust the triggering on a job level by adding conditionals like this:

      !(
        startsWith(github.event.head_commit.modified, 'tests/') || 
        startsWith(github.event.head_commit.modified, 'scripts/') || 
        startsWith(github.event.head_commit.modified, 'monitoring/') ||
        contains(github.event.head_commit.modified, '.dockerignore') ||
        contains(github.event.head_commit.modified, '.gitignore') ||
        contains(github.event.head_commit.modified, '.pre-commit-config.yaml') ||
        contains(github.event.head_commit.modified, 'Dockerfile') ||
        contains(github.event.head_commit.modified, 'LICENSE') ||
        contains(github.event.head_commit.modified, 'README.md')
        contains(github.event.head_commit.modified, 'run_all_engines.sh')
        contains(github.event.head_commit.modified, 'sync_results.sh')
      )

This will NOT trigger the jobs if the changes ONLY include changes in the specified folders and files. For any other case the jobs will run. Unfortunately I'll have to configure each job like this, so it'll look a bit cumbersome.

KShivendu

Great work! 🙌

I hope this will also expose UI (in /actions) to manually pick only one particular engine/dataset, right?

.github/workflows/actions/run-engine-benchmark/action.yaml

tools/wait_for_green_status.sh

KShivendu · 2024-04-18T11:11:22Z

This will NOT trigger the jobs if the changes ONLY include changes in the specified folders and files. For any other case the jobs will run. Unfortunately I'll have to configure each job like this, so it'll look a bit cumbersome.

Interesting that we can do this.

@tellet-q Can we do something like this instead?

      (
        startsWith(github.event.head_commit.modified, 'engine/{clients,server}/*<engine-name>*') || 
        startsWith(github.event.head_commit.modified, 'engine/base_client/')
      )

Where <engine-name> will vary for each job (engine)

tellet-q · 2024-04-18T11:43:36Z

I hope this will also expose UI (in /actions) to manually pick only one particular engine/dataset, right?

Unfortunately, no. There are no changes in the UI.

tellet-q · 2024-04-18T11:58:58Z

@tellet-q Can we do something like this instead?
      (
        startsWith(github.event.head_commit.modified, 'engine/{clients,server}/*<engine-name>*') || 
        startsWith(github.event.head_commit.modified, 'engine/base_client/')
      )
Where <engine-name> will vary for each job (engine)

Not exactly like this, but similar, yes.

    if: >
      (
      startsWith(github.event.head_commit.modified, 'engine/clients/pgvector') ||
      startsWith(github.event.head_commit.modified, 'engine/servers/pgvector') ||
      startsWith(github.event.head_commit.modified, 'engine/base_client/')
      )

In this case the job will run ONLY if changes were made in specified folders.

KShivendu

If you want to try this, then please give it a shot. Otherwise, it lgtm anyways.

Let's see how this works when merged.

tellet-q added 11 commits April 18, 2024 12:18

ci: Run *-default benchmarks for all engines

571b10e

Update poetry.lock

6f5a087

Use random-100 dataset

3ff2be0

Introduce waits

f7a4739

Reduce mem in OS and ES

c734044

Use glove-25 for OS

cdf802c

Use 4Gb for OS

a2dfcb2

Avoid curl std output, use glove-25 for all engines

9eb3350

Revert glove-25 for all engines

7fca270

Add action.yaml

139b12b

Configure triggers

7a94bb3

tellet-q requested a review from KShivendu April 18, 2024 10:24

KShivendu requested changes Apr 18, 2024

View reviewed changes

.github/workflows/actions/run-engine-benchmark/action.yaml Outdated Show resolved Hide resolved

tools/wait_for_green_status.sh Show resolved Hide resolved

Address review

cd4432a

tellet-q requested a review from KShivendu April 18, 2024 12:01

KShivendu approved these changes Apr 18, 2024

View reviewed changes

tellet-q merged commit 455b590 into master Apr 18, 2024

tellet-q deleted the ci/benchmark-all-engines branch April 18, 2024 14:19

tellet-q mentioned this pull request Apr 18, 2024

Automate testing of PRs across different engines #123

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Automate running benchmarks for all engines #134

Automate running benchmarks for all engines #134

Uh oh!

tellet-q commented Apr 18, 2024

Uh oh!

tellet-q commented Apr 18, 2024

Uh oh!

KShivendu left a comment

Uh oh!

Uh oh!

Uh oh!

KShivendu commented Apr 18, 2024

Uh oh!

tellet-q commented Apr 18, 2024

Uh oh!

tellet-q commented Apr 18, 2024

Uh oh!

KShivendu left a comment •

edited

Loading

Uh oh!

Uh oh!

Automate running benchmarks for all engines #134

Automate running benchmarks for all engines #134

Uh oh!

Conversation

tellet-q commented Apr 18, 2024

Uh oh!

tellet-q commented Apr 18, 2024

Uh oh!

KShivendu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

KShivendu commented Apr 18, 2024

Uh oh!

tellet-q commented Apr 18, 2024

Uh oh!

tellet-q commented Apr 18, 2024

Uh oh!

KShivendu left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

KShivendu left a comment •

edited

Loading