unexpected explicit tensor data for input tensor 'attention_mask' for model 'pipeline-poc-inference__isvc-211152d1e7' of type 'INT32', expected datatype 'INT64' #5407
-
Hi deployed an ONNX model on ModelMesh serving on Triton Runtime. Now trying to create python grpc client. But getting the following issues: Got the model metadata:
But now struggling to figure out the input data type issue, although tried in many different ways:
Here are the final codes I tried to run:
What could be the issue? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
You should be able to resolve this issue by changing |
Beta Was this translation helpful? Give feedback.
You should be able to resolve this issue by changing
list(map(int, b_input_mask.ravel().tolist()))
tolist(map(int64, b_input_mask.ravel().tolist()))
. In the future, if you run into an issue like this, checking the type of your input at each step viatype()
can be helpful. Triton is expecting an int64 due to the model config and input datatype, but the data is int32.