Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Llama tutorial for TRTLLM #62

Merged
merged 13 commits into from
Oct 27, 2023
Merged

Llama tutorial for TRTLLM #62

merged 13 commits into from
Oct 27, 2023

Conversation

jbkyang-nvi
Copy link
Collaborator

No description provided.

@jbkyang-nvi jbkyang-nvi changed the title Kyang add llama tutorials [DO NOT MERGE] Llama tutorial for TRTLLM Oct 26, 2023
Popular_Models_Guide/Llama2/trtllm_guide.md Outdated Show resolved Hide resolved
Popular_Models_Guide/Llama2/trtllm_guide.md Outdated Show resolved Hide resolved
Popular_Models_Guide/Llama2/trtllm_guide.md Show resolved Hide resolved
Popular_Models_Guide/Llama2/trtllm_guide.md Outdated Show resolved Hide resolved
Popular_Models_Guide/Llama2/trtllm_guide.md Show resolved Hide resolved
Popular_Models_Guide/Llama2/trtllm_guide.md Outdated Show resolved Hide resolved
Popular_Models_Guide/Llama2/trtllm_guide.md Outdated Show resolved Hide resolved
Popular_Models_Guide/Llama2/trtllm_guide.md Outdated Show resolved Hide resolved
Popular_Models_Guide/Llama2/trtllm_guide.md Outdated Show resolved Hide resolved
README.md Outdated Show resolved Hide resolved
README.md Outdated Show resolved Hide resolved
README.md Outdated Show resolved Hide resolved
nnshah1
nnshah1 previously approved these changes Oct 27, 2023
Copy link
Contributor

@nnshah1 nnshah1 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Great to see this!

nnshah1
nnshah1 previously approved these changes Oct 27, 2023
README.md Outdated Show resolved Hide resolved
nnshah1
nnshah1 previously approved these changes Oct 27, 2023
Copy link
Contributor

@nnshah1 nnshah1 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

> located in the same llama examples folder.
>
> ```bash
> python3 /run.py --engine_dir=<path to your engine>/1-gpu/ --max_output_len 100 --tokenizer_dir <path to your llama repo>/Llama-2-7b-hf --input_text "How do I count to ten in French?"
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
> python3 /run.py --engine_dir=<path to your engine>/1-gpu/ --max_output_len 100 --tokenizer_dir <path to your llama repo>/Llama-2-7b-hf --input_text "How do I count to ten in French?"
> python3 run.py --engine_dir=<path to your engine>/1-gpu/ --max_output_len 100 --tokenizer_dir <path to your llama repo>/Llama-2-7b-hf --input_text "How do I count to ten in French?"

```bash
tritonserver --model-repository=/opt/tritonserver/inflight_batcher_llm
```
Note if you built the engine with `--world-size X` where `X` is greater than 1, you will need to use the [launch_triton_server.py](https://github.com/triton-inference-server/tensorrtllm_backend/blob/release/0.5.0/scripts/launch_triton_server.py) script.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
Note if you built the engine with `--world-size X` where `X` is greater than 1, you will need to use the [launch_triton_server.py](https://github.com/triton-inference-server/tensorrtllm_backend/blob/release/0.5.0/scripts/launch_triton_server.py) script.
Note if you built the engine with `--world_size X` where `X` is greater than 1, you will need to use the [launch_triton_server.py](https://github.com/triton-inference-server/tensorrtllm_backend/blob/release/0.5.0/scripts/launch_triton_server.py) script.

@fpetrini15
Copy link
Collaborator

Beyond the small nits, LGTM! Great work!

@matthewkotila
Copy link
Contributor

@fpetrini15 Beyond the small nits, LGTM! Great work!

gif

nnshah1
nnshah1 previously approved these changes Oct 27, 2023
Copy link
Contributor

@krishung5 krishung5 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Great work, LGTM!

@jbkyang-nvi jbkyang-nvi merged commit 5283ae5 into main Oct 27, 2023
2 checks passed
@jbkyang-nvi jbkyang-nvi deleted the kyang-add-llama-tutorials branch October 27, 2023 21:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants