Skip to content

[BUG] NDS2.0 query16 validation failure due to unmatched results(intermittent) #14076

@yinqingh

Description

@yinqingh

Describe the bug
Build:

  • nds2-parquet-30k-snappy-gh-weekly/39
  • nds2-parquet-10k-snappy-sparkh-weekly/42

Query16 validation failed in the third test iteration (202512270151-GPU) while two previous iterations passed successfully. The query execution completed but validation check returned 'Fail' status. This is an intermittent issue affecting query16 in the NDS2.0 30TB parquet benchmark test on Grace Hopper cluster.

Also observed the same failure in nds2-parquet-10k-snappy-sparkh-weekly/42.

Error logs:

[2025-12-27T02:14:15.244Z] Collected 1 rows in 0.08379197120666504 seconds
[2025-12-27T02:14:15.244Z] Row 0: 
[2025-12-27T02:14:15.244Z] [151636, Decimal('634135438.30'), Decimal('-129843205.33')]
[2025-12-27T02:14:15.244Z] [151636, Decimal('634139560.60'), Decimal('-129841602.82')]

Environment details

  • Spark Version: 3.4.3
  • Hadoop Version: 3.3.6
  • Failed Iteration: 3rd run (202512270151)
  • Successful Iterations: 1st run (202512270102), 2nd run (202512270126)

Metadata

Metadata

Assignees

Labels

bot_watchSlack bot watched issue for LLM analyzerbugSomething isn't working

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions