Skip to content

[VL][Delta] Add JVM-side Delta DV scan handoff#3

Draft
malinjawi wants to merge 1 commit into
split/delta-dv-native-reader-prfrom
split/delta-dv-java-scan-pr
Draft

[VL][Delta] Add JVM-side Delta DV scan handoff#3
malinjawi wants to merge 1 commit into
split/delta-dv-native-reader-prfrom
split/delta-dv-java-scan-pr

Conversation

@malinjawi
Copy link
Copy Markdown
Owner

What changes are proposed in this pull request?

This draft PR is the third step in the split Delta deletion-vector stack.

It depends on apache#12040 and contains the JVM-side Delta scan and deletion-vector handoff pieces split out from apache#11963.

Main changes:

  • add Delta-version-specific metadata utilities for extracting materialized deletion-vector payloads
  • preprocess Delta scans with DVs so native Delta scans receive metadata columns and serialized DV payload handles
  • register task-local DV payloads through DeltaDeletionVectorRegistry and retain direct ByteBuffers for the native kernel lifetime
  • extend the Java/JNI/native split path so per-split payload buffers are passed beside LocalFiles split info
  • map Delta scan split metadata to the native Delta connector and HiveDeltaSplit path introduced by PR2
  • add Delta deletion-vector handoff suites for Delta 3.3 and Delta 4.0 layouts

This PR intentionally builds on PR2 rather than reintroducing the roaring bitmap or native Delta reader infrastructure.

How was this patch tested?

  • ./build/mvn -q spotless:apply -pl gluten-arrow,gluten-substrait,gluten-delta,backends-velox -Pbackends-velox -Pdelta -Pspark-3.5
  • ./build/mvn -q spotless:check -pl gluten-arrow,gluten-substrait,gluten-delta,backends-velox -Pbackends-velox -Pdelta -Pspark-3.5
  • ./build/mvn -q -pl gluten-arrow,gluten-substrait,gluten-delta,backends-velox -am -Pbackends-velox -Pdelta -Pspark-3.5 -DskipTests -DskipUTs -DskipITs test-compile
  • PATH=/opt/homebrew/opt/llvm@15/bin:$PATH /tmp/gluten-format-venv/bin/python dev/check.py format tree <touched C++ files>
  • git diff --check && git diff --cached --check

Was this patch authored or co-authored using generative AI tooling?

Generated-by: IBM BOB

@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI on x86

@malinjawi malinjawi force-pushed the split/delta-dv-java-scan-pr branch from 05109ae to 280cb51 Compare May 14, 2026 10:21
@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI on x86

@malinjawi malinjawi force-pushed the split/delta-dv-java-scan-pr branch from 280cb51 to 9bf0f3d Compare May 14, 2026 12:31
@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI on x86

@malinjawi malinjawi force-pushed the split/delta-dv-native-reader-pr branch from 08353bc to 6be05d2 Compare May 14, 2026 12:32
@malinjawi malinjawi force-pushed the split/delta-dv-java-scan-pr branch from 9bf0f3d to 1534612 Compare May 14, 2026 12:33
@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI on x86

@malinjawi malinjawi force-pushed the split/delta-dv-java-scan-pr branch from 1534612 to b061144 Compare May 14, 2026 19:41
@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI on x86

@malinjawi malinjawi force-pushed the split/delta-dv-native-reader-pr branch from 0338c08 to 4843cb8 Compare May 14, 2026 19:48
@malinjawi malinjawi force-pushed the split/delta-dv-java-scan-pr branch from b061144 to 205da0a Compare May 14, 2026 19:49
@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI on x86

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant