Skip to content

Triton Model Navigator v0.12.0

Latest
Compare
Choose a tag to compare
@kacper-kleczewski kacper-kleczewski released this 10 Sep 12:27
  • Updates:

    • new: simple and detailed reporting of the optimization process
    • new: adjusted exporting TensorFlow SavedModel for Keras 3.x
    • new: inform user when wrapped a module which is not called during optimize
    • new: inform user when module use a custom forward function
    • new: support for dynamic shapes in Torch ExportedProgram
    • new: use ExportedProgram for Torch-TensorRT conversion
    • new: support back-off policy during profiling to avoid reporting local minimum
    • new: automatically scale conversion batch size when modules have different batch sizes in scope of a single pipeline
    • change: TensorRT conversion max batch size search rely on saturating throughput for base formats
    • change: adjusted profiling configuration for throughput cutoff search
    • change: include optimized pipeline to list of examined variants during nav.profile
    • change: performance is not executed when correctness failed for format and runtime
    • change: verify command is not executed when verify function is not provided
    • change: do not create a model copy before executing torch.compile
    • fix: pipelines sometimes obtain model and tensors on different devices during nav.profile
    • fix: extract graph from ExportedProgram for running inference
    • fix: runner configuration not propagated to pre-processing steps
  • Version of external components used during testing: