Skip to content

GatewayCluster with kubernetes backend fails to start after update to daskhub v4.5.4 #348

Closed
@bolliger32

Description

@bolliger32

What happened:
I received the following error: GatewayClusterError: Cluster 'adrastea.b4286778ea9b49f4b4264f982f5b278d' failed to start, see logs for more information. The logs suggest that the dask-scheduler command is missing an argument after --host, which looks intentional based on this code. Here are the logs:
image

This occurred upon update from v4.5.3 of the daskhub chart to v4.5.4. Note that several other issues occurred related to jupyterhub. I eventually worked my way through those and ultimately just deleted and recreated our GKE cluster. That fixed these other issues (primarily related to authentication) but this one remains.

What you expected to happen:
A working GatewayCluster object to be returned from the gateway.new_cluster() call

Minimal Complete Verifiable Example:
I'd imagine a lot of the reproducibility depends on our specific GKE infrastructure and chart config, but the actual code that raises this bug is just

import dask_gateway

gateway = dask_gateway.GatewayCluster()
cluster = gateway.new_cluster()

Anything else we need to know?:

Environment:

  • GKE cluster
  • daskhub chart version: 4.5.4
  • image: custom image running the following
    • Dask version: 2.30.0
    • Python version: 3.8.6
    • Operating System: ubuntu
    • Install method (conda, pip, source): conda

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions