Llama tutorial for TRTLLM #62

jbkyang-nvi · 2023-10-13T03:44:34Z

No description provided.

Popular_Models_Guide/Llama2/trtllm_guide.md

README.md

Popular_Models_Guide/Llama2/trtllm_guide.md

README.md

nnshah1

Great to see this!

README.md

nnshah1

LGTM

fpetrini15 · 2023-10-27T20:09:24Z

Popular_Models_Guide/Llama2/trtllm_guide.md

+    > located in the same llama examples folder.
+    >
+    >   ```bash
+    >    python3 /run.py --engine_dir=<path to your engine>/1-gpu/ --max_output_len 100 --tokenizer_dir <path to your llama repo>/Llama-2-7b-hf --input_text "How do I count to ten in French?"


Suggested change

> python3 /run.py --engine_dir=<path to your engine>/1-gpu/ --max_output_len 100 --tokenizer_dir <path to your llama repo>/Llama-2-7b-hf --input_text "How do I count to ten in French?"

> python3 run.py --engine_dir=<path to your engine>/1-gpu/ --max_output_len 100 --tokenizer_dir <path to your llama repo>/Llama-2-7b-hf --input_text "How do I count to ten in French?"

fpetrini15 · 2023-10-27T20:11:36Z

Popular_Models_Guide/Llama2/trtllm_guide.md

+    ```bash
+    tritonserver --model-repository=/opt/tritonserver/inflight_batcher_llm
+    ```
+    Note if you built the engine with `--world-size X` where `X` is greater than 1, you will need to use the [launch_triton_server.py](https://github.com/triton-inference-server/tensorrtllm_backend/blob/release/0.5.0/scripts/launch_triton_server.py) script.


Suggested change

Note if you built the engine with `--world-size X` where `X` is greater than 1, you will need to use the [launch_triton_server.py](https://github.com/triton-inference-server/tensorrtllm_backend/blob/release/0.5.0/scripts/launch_triton_server.py) script.

Note if you built the engine with `--world_size X` where `X` is greater than 1, you will need to use the [launch_triton_server.py](https://github.com/triton-inference-server/tensorrtllm_backend/blob/release/0.5.0/scripts/launch_triton_server.py) script.

fpetrini15 · 2023-10-27T20:13:23Z

Beyond the small nits, LGTM! Great work!

matthewkotila · 2023-10-27T21:04:02Z

@fpetrini15 Beyond the small nits, LGTM! Great work!

krishung5

Great work, LGTM!

jbkyang-nvi added 2 commits October 13, 2023 14:38

initial commit

64fb97a

add details

f21a8e9

jbkyang-nvi force-pushed the kyang-add-llama-tutorials branch from cd530f5 to f21a8e9 Compare October 13, 2023 21:39

fix pre-commit

017bb8a

jbkyang-nvi changed the title ~~Kyang add llama tutorials [DO NOT MERGE]~~ Llama tutorial for TRTLLM Oct 26, 2023

krishung5 reviewed Oct 26, 2023

View reviewed changes

fpetrini15 reviewed Oct 26, 2023

View reviewed changes

Popular_Models_Guide/Llama2/trtllm_guide.md Outdated Show resolved Hide resolved

fpetrini15 reviewed Oct 26, 2023

View reviewed changes

README.md Outdated Show resolved Hide resolved

matthewkotila reviewed Oct 26, 2023

View reviewed changes

Popular_Models_Guide/Llama2/trtllm_guide.md Outdated Show resolved Hide resolved

matthewkotila reviewed Oct 26, 2023

View reviewed changes

Popular_Models_Guide/Llama2/trtllm_guide.md Outdated Show resolved Hide resolved

jbkyang-nvi added 3 commits October 26, 2023 16:28

addressed comments

f0ad6ca

update table to include notes

a8044ce

fixed typos

924e36d

jbkyang-nvi requested review from matthewkotila, nnshah1, fpetrini15, rmccorm4 and krishung5 October 26, 2023 23:43

jbkyang-nvi added 2 commits October 26, 2023 17:03

addressed more comments

dce8540

fixed table

045e719

nnshah1 reviewed Oct 27, 2023

View reviewed changes

README.md Outdated Show resolved Hide resolved

fpetrini15 reviewed Oct 27, 2023

View reviewed changes

README.md Outdated Show resolved Hide resolved

nnshah1 previously approved these changes Oct 27, 2023

View reviewed changes

nnshah1 dismissed their stale review via f7fdfc4 October 27, 2023 19:12

nnshah1 previously approved these changes Oct 27, 2023

View reviewed changes

addressed comments

fb30384

jbkyang-nvi dismissed nnshah1’s stale review via fb30384 October 27, 2023 19:21

jbkyang-nvi force-pushed the kyang-add-llama-tutorials branch from f7fdfc4 to fb30384 Compare October 27, 2023 19:21

address unseen comment

086b182

nnshah1 reviewed Oct 27, 2023

View reviewed changes

README.md Outdated Show resolved Hide resolved

update title leveling

0d27224

nnshah1 previously approved these changes Oct 27, 2023

View reviewed changes

fpetrini15 reviewed Oct 27, 2023

View reviewed changes

address nits

94309a7

jbkyang-nvi dismissed nnshah1’s stale review via 94309a7 October 27, 2023 21:15

nnshah1 previously approved these changes Oct 27, 2023

View reviewed changes

other unresolved nits

d7be3b2

jbkyang-nvi dismissed nnshah1’s stale review via d7be3b2 October 27, 2023 21:31

krishung5 approved these changes Oct 27, 2023

View reviewed changes

jbkyang-nvi merged commit 5283ae5 into main Oct 27, 2023
2 checks passed

jbkyang-nvi deleted the kyang-add-llama-tutorials branch October 27, 2023 21:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Llama tutorial for TRTLLM #62

Llama tutorial for TRTLLM #62

jbkyang-nvi commented Oct 13, 2023

nnshah1 left a comment

nnshah1 left a comment

fpetrini15 Oct 27, 2023

fpetrini15 Oct 27, 2023

fpetrini15 commented Oct 27, 2023

matthewkotila commented Oct 27, 2023

krishung5 left a comment

	> python3 /run.py --engine_dir=<path to your engine>/1-gpu/ --max_output_len 100 --tokenizer_dir <path to your llama repo>/Llama-2-7b-hf --input_text "How do I count to ten in French?"
	> python3 run.py --engine_dir=<path to your engine>/1-gpu/ --max_output_len 100 --tokenizer_dir <path to your llama repo>/Llama-2-7b-hf --input_text "How do I count to ten in French?"

	Note if you built the engine with `--world-size X` where `X` is greater than 1, you will need to use the [launch_triton_server.py](https://github.com/triton-inference-server/tensorrtllm_backend/blob/release/0.5.0/scripts/launch_triton_server.py) script.
	Note if you built the engine with `--world_size X` where `X` is greater than 1, you will need to use the [launch_triton_server.py](https://github.com/triton-inference-server/tensorrtllm_backend/blob/release/0.5.0/scripts/launch_triton_server.py) script.

Llama tutorial for TRTLLM #62

Llama tutorial for TRTLLM #62

Conversation

jbkyang-nvi commented Oct 13, 2023

nnshah1 left a comment

Choose a reason for hiding this comment

nnshah1 left a comment

Choose a reason for hiding this comment

fpetrini15 Oct 27, 2023

Choose a reason for hiding this comment

fpetrini15 Oct 27, 2023

Choose a reason for hiding this comment

fpetrini15 commented Oct 27, 2023

matthewkotila commented Oct 27, 2023

krishung5 left a comment

Choose a reason for hiding this comment