rfc: support for torch-tensorrt #908

hietalajulius · 2024-11-15T00:22:13Z

RFC: Add Torch-TensorRT Support

Summary

This PR adds support for NVIDIA's Torch-TensorRT in tch-rs, enabling GPU inference optimization through TensorRT while maintaining the PyTorch ergonomics.

Motivation

Torch-TensorRT is NVIDIA's official PyTorch-TensorRT integration that can provide:

Up to 5x faster inference compared to eager execution
Automatic optimization of PyTorch models
Seamless integration with existing PyTorch workflows
Support for both dynamic and static shapes

Implementation Details

Build System Changes

Added support through:

New torch-tensorrt feature flag in Cargo.toml
Build script modifications to link TensorRT libraries
C++ preprocessor flag USE_TORCH_TENSORRT

Important Prerequisites & Caveats

Python Environment Requirement
- Torch-TensorRT must be installed via pip in your Python environment:
  pip install torch-tensorrt
- The build script will detect TensorRT through this installation
Rust Nightly Requirement
- Requires Rust nightly toolchain due to the use of the -as-needed linker option
- Add to your project:
```
[toolchain]
channel = "nightly"
```
- Run with RUSTFLAGS="-Zunstable-options"
Runtime Environment

LD_LIBRARY_PATH must include TensorRT library paths
E.g.: export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/path/to/python3.12/site-packages/torch_tensorrt/lib:/path/to/python3.12/site-packages/tensorrt_libs

Usage

Enable in your project's Cargo.toml e.g.:

[features]
torch-tensorrt = ["tch/torch-tensorrt"]

Questions

Overall is this within the scope of tch-rs?
Only having support on nightly is not great, this can be avoided by using e.g. LD_PRELOAD (https://pytorch.org/TensorRT/user_guide/runtime.html) to overcome the -as-needed issue which is not amazing either
Installation via python is convenient in dev, but an actual production env would probably want to install the libs without the python dependency
Further tweaking of the linker flags probably needed to avoid setting LD_LIBRARY_PATH

rfc: support for torch-tensorrt

cc71c52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rfc: support for torch-tensorrt #908

rfc: support for torch-tensorrt #908

hietalajulius commented Nov 15, 2024

rfc: support for torch-tensorrt #908

Are you sure you want to change the base?

rfc: support for torch-tensorrt #908

Conversation

hietalajulius commented Nov 15, 2024

RFC: Add Torch-TensorRT Support

Summary

Motivation

Implementation Details

Build System Changes

Important Prerequisites & Caveats

Usage

Questions