Triton Example with AI Gateway #13587
FernandoDorado
started this conversation in
Ideas and feature requests
Replies: 1 comment 3 replies
-
@fffonion I believe that this warrants support for a new LLM type in our system (TensorRT-LLM)? |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hello,
We are currently evaluating open-source deployment tools and are particularly interested in the integration of TensorRT-LLM with Triton. We understand from the documentation that this can be achieved using a Custom Python Server, but the implementation details are not entirely clear.
Could we possibly collaborate on preparing a detailed demo or example showcasing this integration? Additionally, it might be beneficial to explore the creation of a step-by-step guide, which could serve as a valuable resource for both our team and the broader community.
Beta Was this translation helpful? Give feedback.
All reactions