Skip to content

Conversation

karuppayya
Copy link
Contributor

What changes were proposed in this pull request?

Generate code for MergeRowsExec operator

Why are the changes needed?

This would help operator fusing and improve performance of MERGE operations

Does this PR introduce any user-facing change?

No

How was this patch tested?

All existing MergeRowsExec tests pass with codegen

Was this patch authored or co-authored using generative AI tooling?

No

@github-actions github-actions bot added the SQL label Sep 21, 2025
@karuppayya
Copy link
Contributor Author

@karuppayya
Copy link
Contributor Author

karuppayya commented Sep 29, 2025

@aokolnychyi @cloud-fan @dongjoon-hyun @rdblue Could one of you please help review this PR?

This change specifically improves MERGE INTO operations by fusing the Join Operator (typically SortMerge) and the MergeRow operator.

This fusion is expected to significantly boost MERGE performance at scale by reducing virtual function call overhead. Furthermore, it enables this path to be code-generated and potentially interoperate with other codegen-supported operators(most operators in Spark) in the future.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant