-
Notifications
You must be signed in to change notification settings - Fork 218
vLLM support for DocSum #885
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Signed-off-by: sgurunat <[email protected]>
…rt for vllm endpoints
for more information, see https://pre-commit.ci
Codecov ReportAll modified and coverable lines are covered by tests ✅
|
Signed-off-by: sgurunat <[email protected]>
…into vllm-docsum
for more information, see https://pre-commit.ci
|
Please update comps/llms/summarization/vllm/langchain/Dockerfile to .github/workflows/docker/compose/llms-compose-cd.yaml. The yaml is used for release images build. |
…workflows Signed-off-by: sgurunat <[email protected]>
|
@lvliang-intel - Updated the llm-compose-cd.yaml file. Its showing conflict |
|
Signed-off-by: sgurunat <[email protected]>
Signed-off-by: sgurunat <[email protected]>
|
@chensuyue @lvliang-intel Updated the git workflows to include vllm docsum build in llm-compose.yaml file |
Signed-off-by: sgurunat <[email protected]>
* Add model parameter for DocSumGateway in gateway.py file Signed-off-by: sgurunat <[email protected]> * Add langchain vllm support for DocSum along with authentication support for vllm endpoints * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Updated docker_compose_llm.yaml and README file with vLLM information Signed-off-by: sgurunat <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Updated docsum-vllm Dockerfile into llm-compose-cd.yaml under github workflows Signed-off-by: sgurunat <[email protected]> * Updated llm-compose.yaml file to include vllm sumarization docker build Signed-off-by: sgurunat <[email protected]> --------- Signed-off-by: sgurunat <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: lvliang-intel <[email protected]>

Description
Added langchain vLLM support for Summarization. Added new folder called vllm under summarization and have maintained the same file structure similar to tgi. Dockerfile, compose yaml and README contains details about how to build and use summarization with vLLM.
This also contains changes related to authentication support for remote vLLM endpoints in llm.py file.
Since model is mandatory parameter for vLLM updated the DocsumGateway to include the new parameter called "model" in Mega service gateway.py file
Issues
N/A
Type of change
Dependencies
N/A