[v24.3.x] iceberg: fix spec inconsistency in manifest list files_count#24650
Merged
andrwng merged 2 commits intoredpanda-data:v24.3.xfrom Jan 8, 2025
Merged
Conversation
Collaborator
Author
Retry command for Build#60094please wait until all jobs are finished before running the slash command |
Collaborator
Author
CI test resultstest results on build#60094
test results on build#60133
test results on build#60374
|
038e933 to
2cfb307
Compare
Collaborator
Author
Retry command for Build#60129please wait until all jobs are finished before running the slash command |
2cfb307 to
d9c34cb
Compare
Collaborator
Author
Retry command for Build#60133please wait until all jobs are finished before running the slash command |
Contributor
|
/ci-repeat |
Some characters that may be used in topic names must be treated specialy in SQL. Table names must be quoted when they contains the special character. Quotation mark is specific to query engine. Added quotation for table names for query engines used in tests. Signed-off-by: Michał Maślanka <michal@redpanda.com> (cherry picked from commit 8fe08cd)
The schema we are using was pulled some time ago and appears to be outdated. The Apache Iceberg Java implementation has since renamed added_data_files_count and friends to added_files_count, to match the documented spec. This meant that after updating the table with an external non-Redpanda writer, Redpanda wouldn't be able to download the current manifest list when appending and get stuck, complaining about an EOF (presumably the Avro C++ library throws this when there is an unknown field). I suspect that this may have also been the cause of an EOF seen when trying to read a manifest list with BigQuery: Error while reading data, error message: The Apache Avro failed to read data with the following error: EOF reached File: [...]/metadata/snap-[...]-0.avro The old names are added as an alias to ensure Redpanda can still download Iceberg manifest lists from 24.3. (cherry picked from commit a3e6880)
d9c34cb to
c5ff7e4
Compare
Collaborator
Author
Retry command for Build#60374please wait until all jobs are finished before running the slash command |
ztlpn
approved these changes
Jan 8, 2025
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Backport of PR #24602