Is there a way to have the LLM timeout or to cancel a request before the LLM starts to respond? #910
Unanswered
nicholusi2021
asked this question in
Q&A
Replies: 1 comment 3 replies
-
Have you debugged it at all to see where it's getting stuck? 10 minutes is extreme, even for a large prompt! |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Most of my requests take less than a minute for a response to come back. However, some requests take over 10 minutes before I stop and restart the service. I played around with the cancelation token, but that seems to only cancels the request once the LLM starts to respond so it didn't speed things up.
Is there a way to have a request timeout after a certain amount of time, or a better way to cancel a request while it's still processing? I would rather respond back with a generic response then have the user wait over 10 minutes.
Thanks!
Beta Was this translation helpful? Give feedback.
All reactions