Skip to content

Triton Model Navigator v0.9.0

Compare
Choose a tag to compare
@kacper-kleczewski kacper-kleczewski released this 07 May 18:21
· 142 commits to main since this release
  • Updates:

    • new: TensorRT Timing Tactics Cache Management - using timing tactics cache files for optimization performance improvements
    • new: Added throughput saturation verification in nav.profile() (enabled by default)
    • new: Allow to override Inplace cache dir through MODEL_NAVIGATOR_DEFAULT_CACHE_DIR env variable
    • new: inplace nav.Module can now receive a function name to be used instead of call in modules/submodules, allows customizing modules with non-standard calls
    • fix: torch dynamo export and torch dynamo onnx export
    • fix: measurement stabilization in nav.profile()
    • fix: inplace inference through Torch
    • fix: trt_profiles argument handling in ONNX to TRT conversion
    • fix: optimal shape configuration for batch size in Inplace API
    • change: Disable TensorRT profile builder
    • change: nav.optimize() does not override module configuration
  • Known issues and limitations

    • DistillERT ONNX dynamo export does not support dynamic shapes
  • Version of external components used during testing: