Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Networking failures on Marist/s390x systems #699

Open
ben-walsh opened this issue Feb 12, 2019 · 25 comments
Open

Networking failures on Marist/s390x systems #699

ben-walsh opened this issue Feb 12, 2019 · 25 comments

Comments

@ben-walsh
Copy link

ben-walsh commented Feb 12, 2019

Multicast networking is non-functional on the zLinux test machines.

I'm confident this is what is causing the consistent failure of the tests identified in ibmruntimes/openj9-openjdk-jdk8#143.

I have confirmed using the example multicast client and server code at https://examples.javacodegeeks.com/core-java/net/multicastsocket-net/java-net-multicastsocket-example that a fresh and unmodified zLinux machine provisioned on demand on my intranet works.

@sxa
Copy link
Member

sxa commented Mar 7, 2019

The network interfaces are definitely enabled for multicast so not clear why this is failing

@sxa sxa added this to the 2019 March milestone Mar 7, 2019
@karianna karianna modified the milestones: 2019 March, 2019 April Apr 1, 2019
@karianna karianna modified the milestones: 2019 April, 2019 May May 2, 2019
@karianna karianna modified the milestones: 2019 May, 2019 June Jun 3, 2019
@sxa sxa modified the milestones: 2019 June, July 2019 Jul 2, 2019
@sxa sxa modified the milestones: July 2019, August 2019 Jul 30, 2019
@andrew-m-leonard
Copy link
Contributor

@sxa From my investigation I am coming to the same conclusion as Ben on this.... thse Adopt zLinux machine don't support multicast correctly

@sxa sxa added the testFail label Aug 26, 2019
@karianna karianna modified the milestones: August 2019, September 2019 Sep 2, 2019
@adam-thorpe
Copy link

I believe these two tests are also linked to this issue:

  • java/net/MulticastSocket/PromiscuousIPv6.java
  • java/nio/channels/DatagramChannel/PromiscuousIPv6.java

@karianna karianna modified the milestones: September 2019, October 2019 Oct 4, 2019
@sxa
Copy link
Member

sxa commented Oct 28, 2019

@smlambert I believe the OpenJ9 team use zLinux machines from Marist as we do - does the same problem occur on the OpenJ9 CI?

@smlambert
Copy link
Contributor

those tests also fail at OpenJ9

java/net/MulticastSocket/Promiscuous.java:
https://ci.eclipse.org/openj9/view/Test/job/Grinder/547/

java/nio/channels/DatagramChannel/Promiscuous.java:
https://ci.eclipse.org/openj9/view/Test/job/Grinder/548/

@sxa sxa modified the milestones: October 2019, November 2019 Nov 1, 2019
@sxa sxa modified the milestones: November 2019, December 2019 Nov 29, 2019
@karianna karianna modified the milestones: January 2020, February 2020 Feb 3, 2020
@sxa sxa modified the milestones: February 2020, March 2020 Feb 25, 2020
@sxa sxa modified the milestones: March 2020, April 2020 Mar 31, 2020
@sxa
Copy link
Member

sxa commented May 4, 2020

@Haroon-Khel Can you verify whether this problem still exists on the new machines we got in December please?

@sxa sxa modified the milestones: April 2020, May 2020 May 4, 2020
@Haroon-Khel
Copy link
Contributor

This problem persists on the following marist machines:
build-marist-rhel77-s390x-1
build-marist-rhel77-s390x-2
test-marist-sles12-s390x-1
The two RHEL machines fail on ipv6 multicast tests, while the SLES machine fails the ipv4 multicast tests. Ill inform our contact to see whether there has been an update to our case from our emails in January

@Haroon-Khel Haroon-Khel modified the milestones: May 2020, June 2020 Jun 1, 2020
@Haroon-Khel Haroon-Khel modified the milestones: June 2020, Backlog Jul 3, 2020
@adamfarley
Copy link
Contributor

Running some grinders to see if this error is still occurring:

build-marist-rhel77-s390x-1 - Machine doesn't appear to exist, though there is a "1x". Same machine?
build-marist-rhel77-s390x-2 - OpenJ9 JDK8 Promiscuous.java x5 grinder - Passed
test-marist-sles12-s390x-1 - OpenJ9 JDK11 MulticastSocket/Test.java x5 grinder - Fails due to:

Unable to find remote helper for 'https'

Could be connected to this issue. @sxa - What do you think?

@sxa
Copy link
Member

sxa commented Jun 9, 2021

To summarise earlier call ... 1X is a renamed version of 1 to resolve an issue with the Jenkins connection. It is likely it can be renamed back but it is the same machine.

Ignore the sles failure for the purposes of this. The machine does not currently have the ci.role.test label

@adamfarley
Copy link
Contributor

adamfarley commented Jun 11, 2021

Ok, here's a list of all of the multicast tests that were associated with this issue, with the results by VM, version, and machine.

build-marist-rhel77-s390x-1x
java/net/MulticastSocket:

Test Name 8HS 11HS 16HS 8J9 11J9 16J9
Promiscuous.java Pass Pass Pass Pass Pass Pass
PromiscuousIPv6.java N/A N/A Pass N/A N/A Pass
SetLoopbackMode.java Fail Fail Fail Fail Fail Fail
Test.java Fail Fail Fail Fail Fail Fail

java/nio/channels/DatagramChannel:

Test Name 8HS 11HS 16HS 8J9 11J9 16J9
MulticastSendReceiveTests.java Fail Fail Fail Fail Fail Fail
Promiscuous.java Pass Pass Pass Pass Pass Pass
PromiscuousIPv6.java N/A N/A Pass N/A N/A Pass



build-marist-rhel77-s390x-2
java/net/MulticastSocket:

Test Name 8HS 11HS 16HS 8J9 11J9 16J9
Promiscuous.java Pass Pass Pass Pass Pass Pass
PromiscuousIPv6.java N/A N/A Pass N/A N/A Pass
SetLoopbackMode.java Fail Fail Fail Fail Fail Fail
Test.java Fail Fail Fail Fail Fail Fail

java/nio/channels/DatagramChannel:

Test Name 8HS 11HS 16HS 8J9 11J9 16J9
MulticastSendReceiveTests.java Wait Wait Wait Wait Wait Wait
Promiscuous.java Wait Wait Wait Wait Wait Wait
PromiscuousIPv6.java N/A N/A Wait N/A N/A Wait

@SehrishHussain
Copy link

Set of tests fail for openjdk18, aix-ppc64
java/net/MulticastSocket:

java/net/MulticastSocket/B6427403.java
java/net/MulticastSocket/IPMulticastIF.java
java/net/MulticastSocket/NoLoopbackPackets.java
java/net/CookieHandler/CookieManagerTest.java

Jenkins URL: https://ci.adoptopenjdk.net/job/Test_openjdk18_hs_extended.openjdk_ppc64_aix_testList_1/6/#showFailuresLink
TRSS Link: https://trss.adoptium.net/allTestsInfo?buildId=616d7be7c6182d02177fff72&limit=5&hasChildren=true

@sxa
Copy link
Member

sxa commented Feb 6, 2023

Ill inform our contact to see whether there has been an update to our case from our emails in January

Since this issue is still open this needs to be re-tested in light of the Marist machines having been rebuilt in the last few months.

@sxa
Copy link
Member

sxa commented Nov 5, 2024

Will link this to other issues as an "umbrella" for Marist/s390x networking failures

@sxa sxa changed the title Multicast Networking Non-Functional On zLinux Machines Networking failures on Marist/s390x systems Nov 5, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: Todo
Development

No branches or pull requests

9 participants