You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I was trying to run with 3 gpus in an ssh server, but I keep on getting this error when I try to use multiple gpus. The single gpu case works perfectly.
This is the main error message RuntimeError: The server socket has failed to listen on any local network address. port: 29500, useIpv6: 0, code: -98, name: EADDRINUSE, message: address already in use
I have tried changing the MASTER_PORT to other numbers like 29501, but the same error message pops up about 29500. I have also used lsof and grep on the port 29500, but found no other processes. Does anyone know how to fix this?
here is the full error message if you want to take a look. error.txt
Thanks in advance!
The text was updated successfully, but these errors were encountered:
Hello,
I was trying to run with 3 gpus in an ssh server, but I keep on getting this error when I try to use multiple gpus. The single gpu case works perfectly.
This is the main error message
RuntimeError: The server socket has failed to listen on any local network address. port: 29500, useIpv6: 0, code: -98, name: EADDRINUSE, message: address already in use
My command looks something like this:
I have tried changing the MASTER_PORT to other numbers like 29501, but the same error message pops up about 29500. I have also used lsof and grep on the port 29500, but found no other processes. Does anyone know how to fix this?
here is the full error message if you want to take a look.
error.txt
Thanks in advance!
The text was updated successfully, but these errors were encountered: