Parameter structure abstraction #74

Ebanflo42 · 2024-04-04T13:11:08Z

This might be a trivial one, but it's important from a usability perspective.

PjRt executables expect a slice of buffers or literals as input to run on. However, if you have a model with dozens of parameter tensors organizing them all into a slice manually becomes tedious (already visible in the mnist_xla example), so I think our abstract model API should basically be callable on any user-defined structure implementing Into<Vec<Literal>> or Into<Vec<PjRtBuffer>>. The gradient engine should also return gradients with the same desired structure if the parameter struct implements From<Vec<T>>.

This is basically the equivalent of JAX tree flattening/unflattening

I don't think we have to do anything very thoughtful for this one, just write the type abstract signatures and call into and from appropriately.

The text was updated successfully, but these errors were encountered:

Ebanflo42 · 2024-07-29T18:05:47Z

@atlv24 could you add the definition of your typeclass and its macros to a file called src/tree.rs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parameter structure abstraction #74

Parameter structure abstraction #74

Ebanflo42 commented Apr 4, 2024

Ebanflo42 commented Jul 29, 2024

Parameter structure abstraction #74

Parameter structure abstraction #74

Comments

Ebanflo42 commented Apr 4, 2024

Ebanflo42 commented Jul 29, 2024