-
Notifications
You must be signed in to change notification settings - Fork 68
Lower stream to for loop #5229
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Lower stream to for loop #5229
Conversation
70e7023 to
321e137
Compare
|
Review updated until commit daf57cf Description
Changes walkthrough 📝
PR Reviewer Guide 🔍Here are some key observations to aid the review process:
|
NFC. Needed by #5229. This way, I can pass round only the map without the containing fusion.
NFC. Needed by #5229. This way, I can pass round only the map without the containing fusion.
744f167 to
003e429
Compare
|
!test |
This PR changed reorderDIDToFront to reorder all parallelized dimensions to front. This is less controversial than I expected because currently we only call reorderDIDToFront before intra-GPU scheduling kicks in. Needed by #5229
12c0001 to
daf57cf
Compare
|
@wujingyue it might be good to merge some of the changes in this or the matmul PR so we can actually test #5309 end-to-end. That is, once we update Do you think that is feasible? |
Sure. Which parts do you need? Feel free to cherry-pick if it's faster to do it yourself. |
My idea was to push the changes in
Sounds good. Once I have a draft ready, we can try doing the above, or cherry pick things into that PR. |
Sure! I'm waiting for #5323 to be merged. Meanwhile, feel free to cherry-pick. |
For #5289