Skip to content

Conversation

@zifeif2
Copy link
Contributor

@zifeif2 zifeif2 commented Dec 8, 2025

What changes were proposed in this pull request?

Introduce Repartition Writer that supports operator with multiple column families. This include
TransformWithState

  • Support List, Map, Value state variables
  • event time timers, processing time timers and TTLs
  • Stream Stream Join V3

Other things that this PR makes:

  • When reading a LIST TWS state, it's returned as unnested operator (one element per row) PR. Therefore, when writing a list back to the State Store, writer will use "db.put()" followed by "db.merge()"

Why are the changes needed?

To complete the Offline Repartitioning project

Does this PR introduce any user-facing change?

No

How was this patch tested?

See integration test added for the operators mentioned above

Was this patch authored or co-authored using generative AI tooling?

@zifeif2 zifeif2 force-pushed the repartition-writer-multi-cf branch from 67d8600 to cc1321b Compare December 9, 2025 00:41
@zifeif2 zifeif2 marked this pull request as ready for review December 9, 2025 00:53
@zifeif2 zifeif2 force-pushed the repartition-writer-multi-cf branch from cc1321b to 74e6741 Compare December 10, 2025 23:27
@zifeif2 zifeif2 force-pushed the repartition-writer-multi-cf branch from 74e6741 to 275130b Compare December 11, 2025 05:29
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant