[Docker] Update DGL version to 2.3 and torch to 2.3 #883

jalencato · 2024-06-18T01:01:32Z

Issue #, if available:

Description of changes:

Fix the dependency version in local docker container

Torch 2.0+ does not support numpy >= 2.0 as we are using numpy.int64 in infer type.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

thvasilo · 2024-06-18T18:51:48Z

What's the error we're getting here? Was numpy 2.x just released?

jalencato · 2024-06-18T19:03:06Z

What's the error we're getting here? Was numpy 2.x just released?

Currently the our docker container will install some wrong version sub-dependencies. Like pyarrow & numpy. The default numpy version in the container now is 2.0, which will throw a warning:

A module that was compiled using NumPy 1.x cannot be run in NumPy 2.0.0 as it may crash. To support both 1.x and 2.x versions of NumPy, modules must be compiled with NumPy 2.0. Some module may need to rebuild instead e.g. with 'pybind11>=2.12'. If you are a user of the module, the easiest solution will be to downgrade to 'numpy<2' or try to upgrade the affected module. We expect that some modules will need time to support NumPy 2.
It will happen when importing the dgl.

Also check the issue here: #884, starting from 2.2, dgl stores its dependencies in a new place https://data.dgl.ai/wheels/torch-${TORCH_MAJOR_MINOR}/cu${DGL_CUDA_VERSION}/repo.html, previously it is https://data.dgl.ai/wheels/cu${DGL_CUDA_VERSION}/repo.html. It is about another fix.

Currently this PR is holding for waiting the regression performance. I want to make sure all the performance works good before asking for review.

jalencato

Update:

The bug only happens during torch version < 2.3, we may not change the Dockerfile now, but only leaves a comment about it.

thvasilo · 2024-06-29T10:37:20Z

We should try to pin the numpy to version 1.26.4 then, to avoid such issues. Generally, we want all the direct dependencies of GSF to be pinned (and perhaps all direct dependencies of DGL), then pin others too if we run into issues.

In the future we can look to create generate requirements files from a pyproject.toml, either using poetry as we do in GSProcessing, and also take a look at https://github.com/astral-sh/uv

jalencato added 2 commits June 17, 2024 17:59

Update Dockerfile.local

0939b41

Update Dockerfile.local

e943c7d

jalencato commented Jun 19, 2024

View reviewed changes

Update Dockerfile.local

e8dbe65

classicsong approved these changes Jun 23, 2024

View reviewed changes

Update Dockerfile.local

47a03e2

jalencato marked this pull request as ready for review June 25, 2024 19:59

jalencato added 3 commits June 25, 2024 17:23

Merge branch 'main' into bug_fix_docker_container

c1cead2

Merge branch 'main' into bug_fix_docker_container

cd2424e

Update Dockerfile.local

1ea9804

jalencato changed the title ~~[WIP] [Docker Bug] Update local dockerfile~~ [Docker] Update DGL version to 2.3 and torch to 2.3 Jun 28, 2024

Merge branch 'main' into bug_fix_docker_container

d22a4cc

jalencato merged commit 224bf1b into awslabs:main Jul 10, 2024
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Docker] Update DGL version to 2.3 and torch to 2.3 #883

[Docker] Update DGL version to 2.3 and torch to 2.3 #883

jalencato commented Jun 18, 2024 •

edited

Loading

thvasilo commented Jun 18, 2024

jalencato commented Jun 18, 2024

jalencato left a comment

thvasilo commented Jun 29, 2024 •

edited

Loading

[Docker] Update DGL version to 2.3 and torch to 2.3 #883

[Docker] Update DGL version to 2.3 and torch to 2.3 #883

Conversation

jalencato commented Jun 18, 2024 • edited Loading

thvasilo commented Jun 18, 2024

jalencato commented Jun 18, 2024

jalencato left a comment

Choose a reason for hiding this comment

thvasilo commented Jun 29, 2024 • edited Loading

jalencato commented Jun 18, 2024 •

edited

Loading

thvasilo commented Jun 29, 2024 •

edited

Loading