Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Inference] Integrate vllm example #262

Merged
merged 25 commits into from
Aug 16, 2024
Merged

Conversation

KepingYan
Copy link
Contributor

No description provided.

@carsonwang
Copy link
Contributor

Thanks for the work! Can you also update all model yamls so they will use vllm by default unless one is not supported. Remove IPEX and Deepspeed related configs from the yaml and disable them by default.

@KepingYan KepingYan marked this pull request as ready for review July 23, 2024 07:12
@KepingYan
Copy link
Contributor Author

Gently ping @xwu99 , review comments are all resolved.

@xwu99 xwu99 merged commit da6d9f9 into intel:main Aug 16, 2024
13 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants