-
Notifications
You must be signed in to change notification settings - Fork 130
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[C#] Regression on 0.5.0 with DML #1071
Comments
Well, DML didn't really work before in 0.40 . I mean it works up to a point then breaks. I took a look at the closed pull requests and didn't see anything relating to any DML fixes which is dissapointing. |
Just updated my code to 0.51 to try it out c# directml using same model as OP. Quadro P5000 GPU Same line:
|
@RyanUnderhill This is the one we caught with the validation pipeline. I thought it was the same error but turns out it wasn't. This crash is reason why there's no log message printed. I can reproduce this locally. |
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
We get very similar error/exception since also fail to run with EDIT: it seems v0.5.0 works well too, but why only |
@elephantpanda Is the crash in |
For our scenario, downgrading from 0.5.1 to 0.5.0 fixed the issue, so @elephantpanda is probably having a separate issue. |
I am using 0.5.1 (I have never tried 0.5.0) Presumably it's the same issue as it's the same line it crashes on. BTW, just tried this in CPU mode and it works fine so only crashes in DML mode. |
@elephantpanda You could try 0.5.0 first. We're preparing a 0.5.2 patch release that should fix the crash. |
I installed 0.5.0 onnxruntimegenai.directml and onnxruntimeGenai.managed keeping the other libraries the same. It now doesn't crash. It just outputs the first token then fails on the second token:
I'll just wait for the patch I think. |
Describe the bug
C# Version 0.5.0 broke DML models, such as microsoft--Phi-3-mini-4k-instruct-onnx directml-int4-awq-block-128.
The model loads, but the Generator's constructor throws an Access violation exception.
To Reproduce
Steps to reproduce the behavior:
Expected behavior
Works just as 0.4.0.
Desktop (please complete the following information):
The text was updated successfully, but these errors were encountered: