Automatic layer construction + initialization #70

Ebanflo42 · 2024-04-04T12:44:20Z

We should have utility functions for constructing dense/convolutional layers (eventually more complex layers like LSTM or multihead attention), which take a context, input node identifiers, and initialization instructions, and return node identifiers for the layer output and the parameters.

A basic example of this is visible in the mnist_xla example.

Intializers should use XLA RNGs.

Ebanflo42 · 2024-07-29T18:09:35Z

relevant

As for construction, the main thing missing is convolutions. this should be an easy thing to finish up. the other thing is incorporating XLA RNGs which is a bit more math-heavy but still doable.

BradenEverson self-assigned this Apr 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Automatic layer construction + initialization #70

Automatic layer construction + initialization #70

Ebanflo42 commented Apr 4, 2024

Ebanflo42 commented Jul 29, 2024

Automatic layer construction + initialization #70

Automatic layer construction + initialization #70

Comments

Ebanflo42 commented Apr 4, 2024

Ebanflo42 commented Jul 29, 2024