Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix emr serverless python deps sample dockerfile #255

Merged
merged 10 commits into from
Oct 6, 2024

Conversation

HarshCasper
Copy link
Member

No description provided.

@cloutierMat
Copy link
Contributor

@HarshCasper I am not sure where that issue came from, but it appears that poetry was no longer creating a new virtual env at the designed path. Leading to the first error you encountered "/app/dist/bundle": not found

ERROR: failed to solve: failed to compute cache key: failed to calculate checksum of ref d5f73407-3555-412c-8048-1533588ab65c::feh1wta9so3q7b54u9f8exfxd: "/app/dist/bundle": not found

The first fix you proposed was properly creating a virtual env but didn't include the jobs/ module as it was configured in the pyproject.toml as packages = [{include = "jobs"}]. The excerpt below from the logs show that the module was not found.

    from jobs.spark_run import SparkRun\nModuleNotFoundError: No module named \'jobs\'\n24/09/05

Changes

  • I added --clear to the bundle command as per bundle documentation
  • Locked bundle to 1.4.1. Still not sure this was the issue, but it seems safer to do!

I kept you extra logs as it seems like a good debugging tool for the future 😄

@HarshCasper HarshCasper merged commit 636c092 into master Oct 6, 2024
1 check passed
@HarshCasper HarshCasper deleted the fix-emr-serverless-python-deps branch October 6, 2024 07:11
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants