Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Question]: It seems that minference does not currently support tensor parallelism under vllm, right? Because in a multi-card environment, the head_id here is incorrect compared to a single card #62

Open
zh2333 opened this issue Aug 4, 2024 · 2 comments
Assignees
Labels
feature request New feature or request question Further information is requested

Comments

@zh2333
Copy link

zh2333 commented Aug 4, 2024

Describe the issue

多卡
@zh2333 zh2333 added the question Further information is requested label Aug 4, 2024
@iofu728 iofu728 self-assigned this Aug 5, 2024
@iofu728 iofu728 added the feature request New feature or request label Aug 5, 2024
@iofu728
Copy link
Contributor

iofu728 commented Aug 5, 2024

Hi @zh2333,

Thanks for your support.

Currently, the vllm version does not support TP, but we expect to add this feature by the middle of next month. I'll close issue #63 due to duplicate content.

@zh2333
Copy link
Author

zh2333 commented Aug 5, 2024

Hi @zh2333,

Thanks for your support.

Currently, the vllm version does not support TP, but we expect to add this feature by the middle of next month. I'll close issue #63 due to duplicate content.

Thank you very much for your reply. Looking forward to supporting TP features in minference!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature request New feature or request question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants