Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

build(deps): changes needed to support mamba/jamba model #400

Draft
wants to merge 8 commits into
base: main
Choose a base branch
from

Conversation

anhuong
Copy link
Collaborator

@anhuong anhuong commented Dec 3, 2024

Description of the change

In order to support Mamba2ForCausalLM and JambaForCausalLM models, we needed to install deps mamba_ssm and transformers with changes from Fabian's fork. In order to install mamba_ssm, we needed package cudnn9-cuda-12 otherwise we hit error ImportError: libcudnn.so.9: cannot open shared object file: No such file or directory. Then, causal-conv1d failed to install from mamba_ssm due to ModuleNotFoundError: No module named 'torch' which is why we are installing mamba_ssm as a separate dependency after the base deps are installed.

Related issue number

How to verify the PR

Built image with these changes and was able to run tuning on Mamba and Jamba models.

Was the PR tested

  • I have added >=1 unit test(s) for every new method I have added.
  • I have ensured all unit tests pass

Ssukriti and others added 7 commits November 21, 2024 15:19
Signed-off-by: Sukriti-Sharma4 <[email protected]>
Signed-off-by: Sukriti-Sharma4 <[email protected]>
Signed-off-by: Sukriti-Sharma4 <[email protected]>
Signed-off-by: Sukriti-Sharma4 <[email protected]>
Signed-off-by: Sukriti-Sharma4 <[email protected]>
Copy link

github-actions bot commented Dec 3, 2024

Thanks for making a pull request! 😃
One of the maintainers will review and advise on the next steps.

@github-actions github-actions bot added the build label Dec 3, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants