Replies: 2 comments
-
Thanks for reporting this. I do not think we used more memory, but we will check later. sglang/docs/references/hyperparameter_tuning.md Lines 27 to 31 in 16eb33f |
Beta Was this translation helpful? Give feedback.
0 replies
-
OK. Maybe it's just occasional. Thank you. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I've had a try on the latest release version v0.3.4.post1 recently. I suspect this version consumes more gpu memory which may be related to the cpu-time-hidden feature? I'm not very sure. I encounter
OutOfMemory
errors in v0.3.4.post1 which seldom appeared before in the older version.lanching server commands in
docker-compose.yaml
:client side:
the OutOfMemory logs:
The machine is A100-SXM4-40GB with 8 gpus.
Beta Was this translation helpful? Give feedback.
All reactions