Temporarily update Dockerfile to run python/comps.py #237

jeancochrane · 2024-04-30T14:48:25Z

This PR is a companion to #236, intended to benchmark the current performance of the comps algorithm using numba. I don't plan to merge it and instead will close it once benchmarking is complete.

Findings

CUDA doesn't seem to make much of a difference, and is counterproductive if anything. This makes me wonder whether the algorithm needs to be redesigned to make better use of the GPU, but I'm considering that question out of scope for now.
There are big performance gains to be had by simply bumping the instance type with the existing numba code. If the numbers below hold, we could speed up the comps code by 2x if we switched to c5.24xlarge instances, which are about twice as expensive as the m4.10xlarge instances we use now, so we'd probably break even on the change.
At small scales (20k observations/10k comparisons), taichi appears to outperform numba, but this improvement disappears if we scale up the size of the data. At a large scale (100k observations/50k comparisons), they perform about the same.

20k observations, 10k comparisons

framework	instance type	arch	time	logs
taichi	g5.12xlarge	x86	2.36s	link
taichi	g5.12xlarge	CUDA	4.33s	link
taichi	m4.10xlarge	x86	4.44s	link
numba	g5.12xlarge	x86	6.07s	link
numba	m4.10xlarge	x86	10.52s	link

100k observations, 50k comparisons

framework	instance type	arch	time	logs
numba	g5.12xlarge	x86	31.87s	link
taichi	c5.24xlarge	x86	31.93s	link
taichi	m4.10xlarge	x86	34.09s	link
numba	c5.24xlarge	x86	37.31s	link
taichi	g5.12xlarge	x86	37.75s	link
taichi	g5.12xlarge	CUDA	43.58s	link
numba	m4.10xlarge	x86	64.19s	link

…leshoot stuck job

jeancochrane · 2024-05-03T15:56:44Z

Closing, see #236 for full results.

Temporarily update Dockerfile to run python/comps.py

5380bd0

jeancochrane temporarily deployed to deploy April 30, 2024 14:53 — with GitHub Actions Inactive

Switch to GPU instance for benchmarking

2202748

jeancochrane temporarily deployed to deploy April 30, 2024 16:26 — with GitHub Actions Inactive

Bump number of observations and comparisons in python/comps.py

2557cb9

jeancochrane temporarily deployed to deploy April 30, 2024 19:53 — with GitHub Actions Inactive

Switch back to CPU for comps benchmarking

b571e2c

jeancochrane temporarily deployed to deploy April 30, 2024 20:09 — with GitHub Actions Inactive

Switch back to GPU for comps benchmarking

fff25a6

jeancochrane temporarily deployed to deploy April 30, 2024 20:28 — with GitHub Actions Inactive

Try using c5 instances for comps benchmarking

7f41f24

jeancochrane temporarily deployed to deploy April 30, 2024 20:52 — with GitHub Actions Inactive

Bump to largest c5 instance for comps benchmarking

672efba

jeancochrane had a problem deploying to deploy April 30, 2024 21:06 — with GitHub Actions Failure

Slightly lower resource requirements for build-and-run-model to troub…

47652bb

…leshoot stuck job

jeancochrane temporarily deployed to deploy April 30, 2024 21:23 — with GitHub Actions Inactive

jeancochrane closed this May 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Temporarily update Dockerfile to run python/comps.py #237

Temporarily update Dockerfile to run python/comps.py #237

jeancochrane commented Apr 30, 2024 •

edited

Loading

jeancochrane commented May 3, 2024

Temporarily update Dockerfile to run python/comps.py #237

Temporarily update Dockerfile to run python/comps.py #237

Conversation

jeancochrane commented Apr 30, 2024 • edited Loading

Findings

20k observations, 10k comparisons

100k observations, 50k comparisons

jeancochrane commented May 3, 2024

jeancochrane commented Apr 30, 2024 •

edited

Loading