Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Tests fail often for different reasons #129

Open
FliegendeWurst opened this issue Nov 27, 2024 · 3 comments
Open

Tests fail often for different reasons #129

FliegendeWurst opened this issue Nov 27, 2024 · 3 comments

Comments

@FliegendeWurst
Copy link

FliegendeWurst commented Nov 27, 2024

Describe the bug
The tests occasionally fail when running with default settings. I have also observed the tests to hang indefinitely.

ERROR traitlets:client.py:568 Error occurred while starting new kernel client for kernel 95bf912a-0d9e-419a-8ae5-3b4fd9c31f6e: Kernel died before replying to kernel_info
ERROR traitlets:client.py:568 Error occurred while starting new kernel client for kernel fc8ac83e-1017-412a-95de-475b0d83c8ba: Kernel didn't respond in 60 seconds
None = NotebookResult(nb={'cells': [{'cell_type': 'code', 'execution_count': 1, 'metadata': {'execution': {'iopub.status.busy...'949777d72b0d2535278d3dc13498b2535136f6dfe0678499012e853ee9abcab1'}}}, 'nbformat': 4, 'nbformat_minor': 2}, error=None).error�[0m
zmq.error.ZMQError: Address already in use (addr='tcp://127.0.0.1:39979')

^ it is a very bad idea to use ports >32k, since those are also used for outgoing TCP connections.

To Reproduce
Steps to reproduce the behavior:

  1. Build and test nbmake using Nix (https://github.com/NixOS/nixpkgs)

Expected behavior
No test errors.

Logs
https://gist.github.com/FliegendeWurst/807356cbe8f273045a167198350e3d9c

@mweinelt
Copy link

mweinelt commented Dec 8, 2024

This looks like a race condition due to using xdist with high parallelism. A workaround is to set --maxproccesses to something lower.

@alex-treebeard
Copy link
Member

this def feels like something best worked around - for reference, we depend on nbclient and they depend on a stack of jupyter libraries that are not the most amenable to parallelism.

https://github.com/jupyter/nbclient

What degree of parallelism are you running with?

@FliegendeWurst
Copy link
Author

What degree of parallelism are you running with?

I was running the tests with 32 jobs, but I sometimes encountered the issue with 12 jobs too.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants