Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Distributed execution PoC #571

Open
12 tasks
Tracked by #549
RobbeSneyders opened this issue Oct 30, 2023 · 0 comments
Open
12 tasks
Tracked by #549

Distributed execution PoC #571

RobbeSneyders opened this issue Oct 30, 2023 · 0 comments
Assignees
Labels
Core Core framework enhancement New feature or request Infrastructure Infrastructure and deployment

Comments

@RobbeSneyders
Copy link
Member

RobbeSneyders commented Oct 30, 2023

In order to allow distributed execution (run one component on multiple machines) with fondant we need to test and evaluate the current options.

A general solution that works for all orchestrators (current and future) seems not possible so we will need to tackle them one by one.

There seem to be dask examples (see epic) for all of them which might just be the cleanest solution where we create dask clusters dynamically where we can run the component code in parallel.

Vertex:

CustomContainerTrainingJob (1,2)

Tasks

Preview Give feedback

Docker-compose:

Docker swarm with multiple hosts and as dask cluster

Sagemaker:

BLOCKED NEEDS RUNNER (see issue), we have no sagemaker integration yet so the following is subject to change

Tasks

Preview Give feedback
@RobbeSneyders RobbeSneyders moved this from Backlog to Breakdown in Fondant development Oct 30, 2023
@GeorgesLorre GeorgesLorre added enhancement New feature or request Core Core framework Infrastructure Infrastructure and deployment labels Oct 30, 2023
@GeorgesLorre GeorgesLorre self-assigned this Oct 30, 2023
@RobbeSneyders RobbeSneyders moved this from Breakdown to Backlog in Fondant development Jan 29, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Core Core framework enhancement New feature or request Infrastructure Infrastructure and deployment
Projects
Status: Backlog
Development

No branches or pull requests

2 participants