Skip to content

Conversation

ravi-databricks
Copy link
Contributor

  • Added support for CDC Multiple sequence cols PR
  • Added custom function support for kafka and delta tables PR
  • Update project overview and features tables in docs + readme
  • Updated release note and change logs

@ravi-databricks ravi-databricks self-assigned this Sep 12, 2025
@ravi-databricks ravi-databricks added the enhancement New feature or request label Sep 12, 2025
@ravi-databricks ravi-databricks added this to the v0.0.10 milestone Sep 12, 2025
sequence_by = cdc_apply_changes.sequence_by
if ',' in sequence_by:
sequence_cols = [col.strip() for col in sequence_by.split(',')]
sequence_by = struct(*sequence_cols) # Use struct() from pyspark.sql.functions

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

how do we guarantee consistent ordering of the sequencing columns. "col1, col2" vs. "col2, col1" are different structs and will have a different sorting and uniqueness logic.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes! Code preserves the order exactly as provided:

  • split(',') returns tokens in the original order
  • struct(sequence_cols) builds the struct with fields in that same order

So "col1, col2" and "col2, col1" produce different structs, which is expected and will change sequencing semantics.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

sounds good.

Copy link

@ganeshchand ganeshchand left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

overall, it looks good. just one concern regarding the sequency by columns ordering.

Copy link

@ganeshchand ganeshchand left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Copy link

@brij-raghuwanshi-db brij-raghuwanshi-db left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good

@ravi-databricks ravi-databricks merged commit 4690761 into main Sep 16, 2025
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants