Why do vllm set default keep-alive timeout to 5s? #8287
Unanswered
SeungminHeo
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi,
For huge size and long-context models takes more than 5s and no additional connections or requests are sent to server, so server can close connection and trigger exception. For example, openai sdk that uses httpx makes error message
Server disconnected without sending a response.
For this, I want to control this with uvicorn arguments, can this be overriden?
This is a "Guess", but exceptions are actually happening.
Beta Was this translation helpful? Give feedback.
All reactions