Skip to content

Triton Model Navigator v0.13.0

Latest
Compare
Choose a tag to compare
@knowicki-nvidia knowicki-nvidia released this 06 Dec 14:35
  • Updates:
    • new: Introducing custom_args in TensorConfig for custom runners to use which
      allows dynamic shapes setup for TorchTensorRT compilation
    • new: autocast_dtype added Torch runner configuration to set the dtype for autocast
    • new: New version of Onnx Runtime 1.20 for python version >= 3.10
    • new: Use torch.compile path in heuristic search for max batch size
    • change: Removed TensorFlow dependencies for nav.jax.optimize
    • change: Removed PyTorch dependencies from nav.profile
    • change: Collect all Python packages in status instead of filtered list
    • change: Use default throughput cutoff threshold for max batch size heuristic when None provided in configuration
    • change: Updated default ONNX opset to 20 for Torch >= 2.5
    • fix: Exception is raised with Python >=3.11 due to wrong dataclass initialization
    • fix: Removed option from ExportOption removed from Torch 2.5
    • fix: Improved preprocessing stage in Torch based runners
    • fix: Warn when using autocast with bfloat16 in Torch
    • fix: Pass runner configuration to runners in nav.profile