Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Generative AI Multi-Node w/ Triton and TensorRT-LLM Guide/Tutorial #97

Merged
merged 2 commits into from
Jul 3, 2024

Conversation

whoisj
Copy link
Contributor

@whoisj whoisj commented Jun 11, 2024

This change creates the TensorRT-LLM Multi-Node tutorial and guide.

Includes:

  • Instructions w/ explainations (README)
  • Helm chart
  • Container definitions
  • Server-side Python script
  • Various helpful YAML files.

@whoisj whoisj added documentation Improvements or additions to documentation enhancement New feature or request labels Jun 11, 2024
@whoisj whoisj changed the title TRT-LLM Multi-Node Tutorial: initial check-in Generative AI Multi-Node w/ Triton and TensorRT-LLM Guide/Tutorial Jun 11, 2024
@whoisj whoisj force-pushed the jwyman/trtllm-mndm branch 4 times, most recently from c85cfae to 2a3a2ed Compare June 12, 2024 19:36
This change adds an exclusion for Deployment/Kubernetes/TensorRT-LLM_Multi-Node_Distributed_Models/ to the pre-commit-config.
@whoisj whoisj self-assigned this Jun 13, 2024
@whoisj whoisj force-pushed the jwyman/trtllm-mndm branch 2 times, most recently from 70f8410 to 5dd4725 Compare June 26, 2024 21:00
Copy link

@harryskim harryskim left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM but will defer to Neelay on the final approval.

@nnshah1 nnshah1 requested a review from harryskim July 1, 2024 17:03
@whoisj whoisj force-pushed the jwyman/trtllm-mndm branch 2 times, most recently from 4aa59b6 to 281e4ef Compare July 3, 2024 21:42
This change creates the TensorRT-LLM Multi-Node tutorial and guide.

Includes:
- Instructions w/ explainations (README)
- Helm chart
- Container definitions
- Server-side Python script
- Various helpful YAML files.
Copy link
Contributor

@nnshah1 nnshah1 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

looks good to me - can follow up with any changes in subsequent updates

@whoisj whoisj merged commit 81c510d into triton-inference-server:main Jul 3, 2024
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants