[WebGPU EP] SoftMax Implementation #23538

vraspar · 2025-01-30T00:35:42Z

Increase coverage for WebGPU Op

…n provider

vraspar · 2025-01-30T00:39:03Z

onnxruntime/core/providers/webgpu/shader_variable.h

@@ -176,16 +176,18 @@ class ShaderVariableHelper : public ShaderIndicesHelper {
  template <typename TOffset>
  inline std::string GetByOffset(TOffset&& offset) const;

+  std::string_view StorageType() const;


I found them making these methods public easiest way to get tensor data types info when generating shader code. I am not sure if this is the best way to do this

github-actions

You can commit the suggested changes from lintrunner.

onnxruntime/core/providers/webgpu/shader_variable.h

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

fs-eire · 2025-01-30T20:09:04Z

onnxruntime/test/providers/cpu/math/softmax_test.cc

-           kNnapiExecutionProvider,  // NNAPI softmax does not support empty input
-           kQnnExecutionProvider}    // QNN doesn't support dim 0
+           kNnapiExecutionProvider,   // NNAPI softmax does not support empty input
+           kWebGpuExecutionProvider,  // WebGPU does not dim 0


Suggested change

kWebGpuExecutionProvider, // WebGPU does not dim 0

kWebGpuExecutionProvider, // WebGPU does not support dim 0

fs-eire · 2025-01-30T20:14:20Z

onnxruntime/core/providers/webgpu/tensor/transpose.cc

+             : program.SetDispatchGroupSize((output_size + WORKGROUP_SIZE - 1) / WORKGROUP_SIZE);
+  return context.RunProgram(program);
+}
+
 Status Transpose::ComputeInternal(ComputeContext& context) const {


This function should call DoTranspose() instead of duplicating the code.

fs-eire · 2025-01-30T20:15:41Z

onnxruntime/core/providers/webgpu/tensor/transpose.cc

@@ -97,6 +97,59 @@ Status TransposeProgram::GenerateShaderCode(ShaderHelper& shader) const {
  return Status::OK();
 }

+Status Transpose::DoTranspose(onnxruntime::webgpu::ComputeContext& context, const gsl::span<const size_t>& permutations, const Tensor& input, Tensor& output) {


always pass span as value instead of const reference.

fs-eire · 2025-01-30T20:38:29Z

onnxruntime/core/providers/webgpu/math/softmax.cc

+  shader.AddOutput("result", ShaderUsage::UseUniform | ShaderUsage::UseIndicesTypeAlias);
+  int components = input.NumComponents();
+
+  std::string threadMaxDecl = input.ElementType() == "f32" ? "var threadMax = x_value_t(-3.402823e+38f);\n" : "var threadMax = x_value_t(-65504.0h);\n";


It is not a good idea to rely on the return value of ShaderVariableHelper::ElementType.

The design of the shader helper classes (ShaderHelper, ShaderVariableHelper and ShaderIndicesHelper) uses an internal variable usage_ to track whether one or more certain flags are activated for a variable/indices to determine the final generated shader code. Functions like ShaderVariableHelper::ElementType are designed as internal methods that are only for the usage of generating shader code. Making them public will break the design assumption and is error-prone.

If you want to get the data type of a specific input, you can simply check Inputs()[0].var_type.

use const instead of var

fs-eire · 2025-01-30T20:53:33Z

onnxruntime/core/providers/webgpu/math/softmax.cc

+
+  // Define shared memory for row max and row sum
+  shader.AdditionalImplementation()
+      << "var<workgroup> rowMaxShared : x_value_t;\n"


use snake_case for variables and user defined functions in WGSL shader code

vraspar and others added 3 commits January 20, 2025 17:26

Add Softmax kernel and transpose utility function for WebGPU executio…

e7e3737

…n provider

Refactor Softmax implementation for WebGPU

f9b61db

Refactor Softmax and remove debug logs

87de607

vraspar commented Jan 30, 2025

View reviewed changes

vraspar added the ep:WebGPU ort-web webgpu provider label Jan 30, 2025

github-actions bot reviewed Jan 30, 2025

View reviewed changes

onnxruntime/core/providers/webgpu/shader_variable.h Outdated Show resolved Hide resolved

fix linting error

2d8b47d

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

vraspar requested review from skottmckay and fs-eire January 30, 2025 19:50

fs-eire reviewed Jan 30, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WebGPU EP] SoftMax Implementation #23538

[WebGPU EP] SoftMax Implementation #23538

vraspar commented Jan 30, 2025 •

edited

Loading

vraspar Jan 30, 2025

github-actions bot left a comment

fs-eire Jan 30, 2025

fs-eire Jan 30, 2025

fs-eire Jan 30, 2025

fs-eire Jan 30, 2025

fs-eire Jan 30, 2025

fs-eire Jan 30, 2025

	kWebGpuExecutionProvider, // WebGPU does not dim 0
	kWebGpuExecutionProvider, // WebGPU does not support dim 0

[WebGPU EP] SoftMax Implementation #23538

Are you sure you want to change the base?

[WebGPU EP] SoftMax Implementation #23538

Conversation

vraspar commented Jan 30, 2025 • edited Loading

vraspar Jan 30, 2025

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

fs-eire Jan 30, 2025

Choose a reason for hiding this comment

fs-eire Jan 30, 2025

Choose a reason for hiding this comment

fs-eire Jan 30, 2025

Choose a reason for hiding this comment

fs-eire Jan 30, 2025

Choose a reason for hiding this comment

fs-eire Jan 30, 2025

Choose a reason for hiding this comment

fs-eire Jan 30, 2025

Choose a reason for hiding this comment

vraspar commented Jan 30, 2025 •

edited

Loading