-
Notifications
You must be signed in to change notification settings - Fork 73
The second memory node not working when trying 1P-2M-1S with a GMM #10
Comments
By the way, when we tried multiple processor nodes with GPM configured as 'Y', LegoOS failed to compile. It seemed that some code is using the |
@fyc1007261, I'm traveling this week, will get back on it next Wed/Thur. Sorry for the late. Meanwhile, @hythzz will you have time help out? |
Hi @lastweek , We are still struggling with the multiple P/M problem. Could you please help check out where our configurations might be wrong or provide some instructions on this? |
Hi @fyc1007261, Sorry for the late. I was moving a lot recently. According to your first post, it seems at least all machines are connected. You mentioned "the #1 node (the default memory node configured on all machines) used up all its memory and panicked", did you see a OOM message? I might have a clue where the issue, but need to take a look at your .config files. Could you share your P and M's .config files with me? Thank you. |
Hi @lastweek , I have put all the config files and logs that I consider important on this link. We are running the programming Thanks a lot for your help!! Config files and logs: |
Hi @fyc1007261 , I just checked your config files, looks like you didn't enable the Processor node side:
Memory node side:
Please keep the
|
Hi @lastweek @hythzz , Thanks for your help! It works now! |
Hi @fyc1007261, we are back on schedule and will update the repo more recently. |
Hi @fyc1007261, If your issue has been solved, please close this thread. |
Hi @hythzz, |
Hi @lastweek ,
We have successfully deployed 1P-1M-1S on CloudLab and we are now trying to do some experiments on multiple processor/memory nodes. We tried with 5 nodes with #0 as processor; #1, #4 as memory; #2 as storage and #3 as global resource monitor. We have also correctly configured
linux-modules/monitor/include/monitor_config.h
to let the GMM know the IDs of the memory nodes. After rebooting processor and memory nodes, we triedmake fit_install
on storage and GMM, thenmake monitor_install
on GMM node andmake storage_install
on storage. However, when we tried to run an application which required large memory, the #1 node (the default memory node configured on all machines) used up all its memory and panicked, while the #4 node seemed not to be working. Is there anything that we have left not configured or is there anything that we did wrong?Thanks very much for your help!
The text was updated successfully, but these errors were encountered: