Skip to content

Commit

Permalink
Update for new image version
Browse files Browse the repository at this point in the history
  • Loading branch information
Maxusmusti committed Jan 22, 2024
1 parent 9117ac8 commit 181e6e5
Show file tree
Hide file tree
Showing 2 changed files with 2 additions and 2 deletions.
2 changes: 1 addition & 1 deletion language/llama2-70b/api-endpoint-artifacts/benchmark.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,7 @@ spec:
restartPolicy: Never
containers:
- name: mlperf-env
image: quay.io/meyceoz/mlperf-inference:v3-greedy
image: quay.io/meyceoz/mlperf-inference:v4
resources:
requests:
memory: 20000Mi
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -68,7 +68,7 @@ spec:
- name: MAX_BATCH_WEIGHT
value: "540000"
- name: MAX_SEQUENCE_LENGTH
value: "4096"
value: "2048"
- name: MAX_PREFILL_WEIGHT
value: "0"
- name: MAX_NEW_TOKENS
Expand Down

0 comments on commit 181e6e5

Please sign in to comment.