[SPARK-54670][CONNECT] Rework Connect Literal handling #53430

hvanhovell · 2025-12-10T19:40:34Z

Why are the changes needed?

This PR fixes a number of problems with literal handling in Spark Connect:

This removes the dataType field in literal, this is an unshipped change. The problem with that particular change is that it is prohibitively easy to create an inconsitent state (e.g. an int can be a binary now...), and it causes duplication.
... TBD...
...TBD...

Does this PR introduce any user-facing change?

No.

How was this patch tested?

Modified existing tests.

Was this patch authored or co-authored using generative AI tooling?

No.

hvanhovell · 2025-12-10T19:44:13Z

sql/connect/common/src/test/resources/query-tests/explain-results/function_lit_array.explain

Difference in Decimal handling:

[89.976200000000000000,89.976210000000000000] AS ARRAY(89.976200000000000000BD, 89.976210000000000000BD)#0, [89889.766723100000000000,89889.766723100000000000] AS ARRAY(89889.766723100000000000BD, 89889.766723100000000000BD)#0 [89.97620,89.97621] AS ARRAY(89.97620BD, 89.97621BD)#0, [89889.7667231,89889.7667231] AS ARRAY(89889.7667231BD, 89889.7667231BD)#0

hvanhovell · 2025-12-10T19:47:57Z

sql/connect/common/src/test/resources/query-tests/explain-results/function_typedLit.explain

Difference in Binary handling:

0x0806 AS X'0806'#0, [8,6] AS ARRAY(8Y, 6Y)#0

hvanhovell · 2025-12-11T19:46:28Z

...er/src/main/scala/org/apache/spark/sql/connect/planner/LiteralExpressionProtoConverter.scala

+    expressions.Literal(value, dataType)
+  }
+
+  private object FromProtoToCatalystConverter extends FromProtoConvertor {


This directly converts the literal to its catalyst representation.

hvanhovell added 4 commits December 9, 2025 19:49

WIP

8c4b8c3

Merge remote-tracking branch 'apache/master' into undo-SPARK-53578

c5ca99b

Fix verbatims

9bc6566

Fix ML tests

7b75e12

github-actions bot added SQL ML CONNECT labels Dec 10, 2025

hvanhovell commented Dec 10, 2025

View reviewed changes

Fix tests

b85ab62

github-actions bot added the PYTHON label Dec 11, 2025

hvanhovell commented Dec 11, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-54670][CONNECT] Rework Connect Literal handling #53430

[SPARK-54670][CONNECT] Rework Connect Literal handling #53430

hvanhovell commented Dec 10, 2025

Uh oh!

hvanhovell Dec 10, 2025

Uh oh!

hvanhovell Dec 10, 2025

Uh oh!

hvanhovell Dec 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

[SPARK-54670][CONNECT] Rework Connect Literal handling #53430

Are you sure you want to change the base?

[SPARK-54670][CONNECT] Rework Connect Literal handling #53430

Conversation

hvanhovell commented Dec 10, 2025

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

hvanhovell Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

hvanhovell Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

hvanhovell Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant