Separate executor configuration from computation #389

tomwhite · 2024-02-20T12:21:03Z

The way that the examples are currently written means that we have executor configuration mixed up with the computation itself. For example, notice how the executor and its parameters (runtime, runtime_memory) are found at the beginning and end of this example:

cubed/examples/lithops/aws-lambda/lithops-add-asarray.py

Lines 8 to 24 in 738b70d

    
           tmp_path = sys.argv[1] 
        
           runtime = sys.argv[2] 
        
           spec = cubed.Spec(tmp_path, allowed_mem=100000) 
        
           executor = LithopsDagExecutor() 
        
           a = xp.asarray( 
        
               [[1, 2, 3, 4], [5, 6, 7, 8], [9, 10, 11, 12], [13, 14, 15, 16]], 
        
               chunks=(2, 2), 
        
               spec=spec, 
        
           ) 
        
           b = xp.asarray( 
        
               [[1, 2, 3, 4], [5, 6, 7, 8], [9, 10, 11, 12], [13, 14, 15, 16]], 
        
               chunks=(2, 2), 
        
               spec=spec, 
        
           ) 
        
           c = xp.add(a, b) 
        
           res = c.compute(executor=executor, runtime=runtime, runtime_memory=2000) 
        
           print(res)

We could improve the separation by setting the executor on the spec, so everything is set up in one go, like this:

    tmp_path = sys.argv[1]
    runtime = sys.argv[2]
    executor = LithopsDagExecutor(runtime=runtime, runtime_memory=2000)
    spec = cubed.Spec(tmp_path, allowed_mem=100000, executor=executor)
    a = xp.asarray(
        [[1, 2, 3, 4], [5, 6, 7, 8], [9, 10, 11, 12], [13, 14, 15, 16]],
        chunks=(2, 2),
        spec=spec,
    )
    b = xp.asarray(
        [[1, 2, 3, 4], [5, 6, 7, 8], [9, 10, 11, 12], [13, 14, 15, 16]],
        chunks=(2, 2),
        spec=spec,
    )
    c = xp.add(a, b)
    res = c.compute()
    print(res)

This is better, but it still means that every example is duplicated for every executor (Lithops AWS Lambda, Lithops GCF, Modal AWS, Modal GCP, Coiled, Dataflow, etc). While some duplication is OK for examples, it does feel excessive.

To improve this further this we could use something like donfig to allow the spec to be read from a config file, such as this one:

spec:
  work_dir: "s3://cubed-$USER-temp"
  allowed_mem: "2GB"
  executor_name: "lithops"
  executor_options:
    runtime: "cubed-runtime"
    runtime_memory: 2000

Then the example would look like this (note that the spec object disappears, and is automatically picked up from config instead):

    a = xp.asarray(
        [[1, 2, 3, 4], [5, 6, 7, 8], [9, 10, 11, 12], [13, 14, 15, 16]],
        chunks=(2, 2),
    )
    b = xp.asarray(
        [[1, 2, 3, 4], [5, 6, 7, 8], [9, 10, 11, 12], [13, 14, 15, 16]],
        chunks=(2, 2),
    )
    c = xp.add(a, b)
    res = c.compute()
    print(res)

It is run as follows, assuming the config file is in the lithops/aws-lambda directory:

CUBED_CONFIG=$(pwd)/lithops/aws-lambda python add-asarray.py

Note that the existing way of using a spec object programmatically would still work, this is just another way to configure things.

(It would also make it possible to implement #310 by using a config context manager to set the executor to one that raises if compute is called.)

Any thoughts @TomNicholas?

The text was updated successfully, but these errors were encountered:

TomNicholas · 2024-02-20T16:25:48Z

Interesting. I'm not convinced we should completely fold the executor into the spec when they represent different choices (runtime vs storage layer + config)... But then again I guess if the "spec" means just "all configuration needed to run any given workload" then it would make sense.

This is better, but it still means that every example is duplicated for every executor (Lithops AWS Lambda, Lithops GCF, Modal AWS, Modal GCP, Coiled, Dataflow, etc). While some duplication is OK for examples, it does feel excessive.

You're talking about a documentation issue here now right? I agree it feels silly to have the same workload examples documented twice for two executors. It also makes it much more likely that something in the docs gets out of date without us noticing.

To improve this further this we could use something like donfig to allow the spec to be read from a config file

I like the idea of a config file to set the defaults - I expect users will in practice set this up once and then never touch it again. Which is exactly what we want - them never to have to worry about configuration once they start doing science work.

We should just think about reproducibility / clarity if the options are being read from another file rather than from within the notebook. Maybe all options should be printed once the computation begins?

tomwhite · 2024-02-21T12:23:36Z

Interesting. I'm not convinced we should completely fold the executor into the spec when they represent different choices (runtime vs storage layer + config)... But then again I guess if the "spec" means just "all configuration needed to run any given workload" then it would make sense.

That's how I think about "spec". Having the ability to pass runtime parameters at the time compute is called may give extra flexibility, but it's not normally needed.

You're talking about a documentation issue here now right? I agree it feels silly to have the same workload examples documented twice for two executors. It also makes it much more likely that something in the docs gets out of date without us noticing.

Yes, quite.

I like the idea of a config file to set the defaults - I expect users will in practice set this up once and then never touch it again. Which is exactly what we want - them never to have to worry about configuration once they start doing science work.

💯

We should just think about reproducibility / clarity if the options are being read from another file rather than from within the notebook. Maybe all options should be printed once the computation begins?

That's a good idea.

tomwhite added runtime core labels Feb 20, 2024

tomwhite mentioned this issue Feb 21, 2024

Use donfig to control default spec #392

Merged

tomwhite linked a pull request Feb 22, 2024 that will close this issue

Create a single set of example scripts that can run on any executor #395

Merged

tomwhite closed this as completed in #395 Feb 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Separate executor configuration from computation #389

Separate executor configuration from computation #389

tomwhite commented Feb 20, 2024

TomNicholas commented Feb 20, 2024

tomwhite commented Feb 21, 2024

Separate executor configuration from computation #389

Separate executor configuration from computation #389

Comments

tomwhite commented Feb 20, 2024

TomNicholas commented Feb 20, 2024

tomwhite commented Feb 21, 2024