Does the arrow protocol require unique dictionary values? #47134
-
Is it explicitly required that dictionary values be unique? Currently the at least the C++ API Example of creating non-unique dicts: #25845 Curious if this is a hard-requirement, or if it's an implementation detail open to change. |
Beta Was this translation helpful? Give feedback.
Replies: 4 comments
-
Code processing dictionary-encoded arrays should not assume values to be unique. |
Beta Was this translation helpful? Give feedback.
-
From the Arrow format spec:
(Thanks Docs chat bot!) So forcing the de-duplication seems to go against the spec and there for is a bug? |
Beta Was this translation helpful? Give feedback.
-
Is there an issue opened to track here? Otherwise, @kdkavanagh can you open an issue, please? |
Beta Was this translation helpful? Give feedback.
-
Created #47151, thanks |
Beta Was this translation helpful? Give feedback.
From the Arrow format spec:
(Thanks Docs chat bot!)
So forcing the de-duplication seems to go against the spec and there for is a bug?