You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Congratulations on the great work you have done! I am very interested in your work. Specifically, I want to know how you allow multiple serving processes to share the same Cuda memory spaces (for the frozen parameters in the LoRA models).
Could you please point out the code? I want to study your implementation. Thanks!
BR//Zizhao
The text was updated successfully, but these errors were encountered:
Hi,
Congratulations on the great work you have done! I am very interested in your work. Specifically, I want to know how you allow multiple serving processes to share the same Cuda memory spaces (for the frozen parameters in the LoRA models).
Could you please point out the code? I want to study your implementation. Thanks!
BR//Zizhao
The text was updated successfully, but these errors were encountered: