Consider separately pass each activation and direction's weights&bias for lstm and gru

This is feedback from when trying to implement gru/lstm on CoreML driven by https://github.com/webmachinelearning/webnn/issues/689. 
The biases and weights are stacked together for forward and backward directions when it's bidirectional, similarly activations are passed as an array instead of distinct separate values params. 

I think it's more explicit and cleaner to follow the [CoreML's design](https://apple.github.io/coremltools/source/coremltools.converters.mil.mil.ops.defs.html#coremltools.converters.mil.mil.ops.defs.iOS15.recurrent.lstm) which:
- Pass bias & weights for each direction separately when it's bidrectional
- Pass activations separately for `recurrent_activation`, `cell_activation`, `activation`.
 
What do you think?

This also helps to unblock the lstm/gru implementation on CoreML from depending on the  outcome of [MLConstantOperand](https://github.com/webmachinelearning/webnn/pull/747) discussion.

@fdwr @huningxin 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Consider separately pass each activation and direction's weights&bias for lstm and gru #751

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Consider separately pass each activation and direction's weights&bias for lstm and gru #751

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions