Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Question]: rpc #10547

Open
xzp0208 opened this issue Jul 30, 2024 · 0 comments
Open

[Question]: rpc #10547

xzp0208 opened this issue Jul 30, 2024 · 0 comments

Comments

@xzp0208
Copy link

xzp0208 commented Jul 30, 2024

Description

(xzp) ai@ubuntu:~/of$ python -m oneflow.distributed.launch --nproc_per_node 2 ./ddp_train.py


Setting OMP_NUM_THREADS environment variable for each process to be 1 in default, to avoid your system being overloaded, please further tune the variable for optimal performance in your application as needed.


W20240730 07:54:40.090461 27910 rpc_client.cpp:190] LoadServer 127.0.0.1 Failed at 0 times error_code 14 error_message failed to connect to all addresses
W20240730 07:54:40.124917 27911 rpc_client.cpp:190] LoadServer 127.0.0.1 Failed at 0 times error_code 14 error_message failed to connect to all addresses

Alternatives

No response

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant