Perform a reduce sum operation on tokens from different experts and then reorder the tokens to their original positions
Number of selected experts for each token.
Input feature before invert permutation and reduce sum.
Shape:
Expert weights of each token.
Shape:
The indices of invert permutation: mapping from permuted token index to origin token index.
Shape:
Output feature after invert permutation and reduce sum.
Shape: