LiteLLM seems to kill the request after 60s timeout - what to do? #16108

eleaner · 2025-10-30T23:37:58Z

eleaner
Oct 30, 2025

Hi,
I am running local llm on CPU only (super slow I know but it does not matter for my usecase)
when I run inference directly it just waits (300s for '2+2=?' inference)
but when I proxy the request through litellm - it get's killed after 60s and I believe and it tries again.
I have

router_settings:
  timeout: 1800

litellm_params:
  timeout: 1800

what am I missing?

eleaner · 2025-10-31T00:30:03Z

eleaner
Oct 31, 2025
Author

I also added

litellm_settings:
  request_timeout: 1800
  stream_timeout: 1800

but no improvement

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

LiteLLM seems to kill the request after 60s timeout - what to do? #16108

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Uh oh!

LiteLLM seems to kill the request after 60s timeout - what to do? #16108

Uh oh!

eleaner Oct 30, 2025

Replies: 1 comment

Uh oh!

eleaner Oct 31, 2025 Author

eleaner
Oct 30, 2025

eleaner
Oct 31, 2025
Author