You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
inputs (List[Union(torch_tensorrt.Input, torch.Tensor)]): **Required** List of specifications of input shape, dtype and memory layout for inputs to the module. This argument is required. Input Sizes can be specified as torch sizes, tuples or lists. dtypes can be specified using
469
+
inputs (Optional[Sequence[torch_tensorrt.Input | torch.Tensor]]): **Required** List of specifications of input shape, dtype and memory layout for inputs to the module. This argument is required. Input Sizes can be specified as torch sizes, tuples or lists. dtypes can be specified using
459
470
torch datatypes or torch_tensorrt datatypes and you can use either torch devices or the torch_tensorrt device type enum
torch.randn((1, 3, 224, 244)) # Use an example tensor and let torch_tensorrt infer settings
472
483
]
473
-
474
-
method_name (str): Name of method to convert
475
-
input_signature Union(List, Tuple, torch_tensorrt.Input, torch.Tensor): A formatted collection of input specifications for the module. Input Sizes can be specified as torch sizes, tuples or lists. dtypes can be specified using
476
-
torch datatypes or torch_tensorrt datatypes and you can use either torch devices or the torch_tensorrt device type enum to select device type. **This API should be considered beta-level stable and may change in the future** ::
dla_sram_size (int): Fast software managed RAM used by DLA to communicate within a layer.
518
510
dla_local_dram_size (int): Host RAM used by DLA to share intermediate tensor data across operations
519
511
dla_global_dram_size (int): Host RAM used by DLA to store weights and metadata for execution
520
-
calibrator (Union(torch_tensorrt._C.IInt8Calibrator, tensorrt.IInt8Calibrator)): Calibrator object which will provide data to the PTQ system for INT8 Calibration
521
-
allow_shape_tensors: (Experimental) Allow aten::size to output shape tensors using IShapeLayer in TensorRT
522
512
523
513
Returns:
524
514
bytes: Serialized TensorRT engine, can either be saved to a file or deserialized via TensorRT APIs
0 commit comments