Fix query without equality delete fields #485

zhangjun0x01 · 2025-09-17T07:05:45Z

when we do a query without the equality delete fields, it will throw an exception Equality deletes need the relevant columns to be selected, this pr fix this issue.

steps:

add the equality delete fields to projection columns when the query sql do not contain relevant columns .
execute ApplyEqualityDeletes
delete the equality delete fields from output data chunk for query result.

Tishj · 2025-09-17T11:44:55Z

src/iceberg_functions/iceberg_multi_file_reader.cpp

 				equalities.push_back(expression->Copy());
+				continue;
+			}
+			idx_t index = field_id_to_result_id.at(field_id);


I don't understand what's happening here, can you explain and add a clarifying comment?
Especially the fact that there's an if, else if but no else, meaning we could have a situation where an equality delete condition is essentially skipped?

Hmm I think I understand, you're rewriting the expression, so they match with the modifications made to the output_chunk made earlier on

Can't this be done directly when constructing the expressions?
This feels rather error-prone because when additional expressions would be added, they would also need to be added here

In any case, it looks like this is actually exhaustive, so can we change the else if to just an else and turn the condition into an assertion instead?

Consider the following case: there is a datafile with the following data:

1,'a', 2025-01-01 2,'b', 2025-01-02

then we add two delete snapshots:

Snapshot 1: delete where id = 1

Snapshot 2: delete where name = 'b'

Both of these delete snapshots are valid for the datafile. When we execute a query:

// without equality delete field id and name select count(*) from mytable

In the IcebergMultiFileReader::InitializeReader method, we add all equality delete fields (id and name) to the scan columns.

In the function IcebergMultiFileReader::FinalizeChunk, the input_chunk parameter will contain the additional equality delete fields (id and name). We copy these two fields to the output_chunk. At this point, in the output_chunk, the index of the field 'id' should be 1, and the index of the field 'name' should be 2. However, all the equality delete filters we obtain here have an index of 1 for both 'id' and 'name'. Therefore, we rewrite the expression here to be compatible with the above logic.

The reason for this situation is that the delete filters based on the 'id' and 'name' fields come from different snapshots (in IcebergMultiFileList::ScanEqualityDeleteFile). At this point, we do not know the order of these two delete snapshots, so we cannot pass in the correct index when constructing the BoundReferenceExpression expression.

Hmm, I'm not entirely following yet
It's been a while since I implemented this logic, so excuse me while I recollect some thoughts.

In ScanEqualityDeleteFile we create the expressions, referencing the index in output_chunk, in order to make it projection-agnostic, so expressions don't have to be rewritten later.

You're adding to the column_indexes (now new_column_indexes) for all the columns that aren't selected but required by the equality deletes.

What I'm hearing is that you're losing the context of the order you're adding these in, to correctly replicate later, here:

//! Add the extra equality delete fields to output chunk. int32_t diff = 0; if (executor.expressions.size() != output_chunk.ColumnCount()) { diff = executor.expressions.size() - output_chunk.ColumnCount(); for (int32_t i = diff; i > 0; i--) { int32_t index = input_chunk.ColumnCount() - i; output_chunk.data.emplace_back(input_chunk.data[index]); } }

I've made a commit to the branch to remove the need to rewrite expressions
Can you take some time to double check my logic?
I left a FIXME in there because I'm not entirely confident that piece of logic will work in all cases

It assumes a direct relation between new_global_column_ids and the output_chunk sizes, that might break if there is projection pushdown into parquet? I'm not entirely sure

For future reference, it's about the connection between the logic in ScanEqualityDeleteFile, InitializeReader and FinalizeChunk, those are now using equality_id_to_result_id to synchronize the mapping that need to be done in those methods

src/iceberg_functions/iceberg_multi_file_reader.cpp

zhangjun0x01 · 2025-09-17T14:58:01Z

@Tishj ,By the way, how do I configure Iceberg to use relative paths for test cases in the data directory? I checked the configuration of Iceberg and it seems that there is no such configuration ,thanks

Tishj · 2025-09-18T08:20:48Z

@Tishj ,By the way, how do I configure Iceberg to use relative paths for test cases in the data directory? I checked the configuration of Iceberg and it seems that there is no such configuration ,thanks

I'm not sure I understand what you're asking, Iceberg works with absolute paths, that's a limitation of the format
We try to hack around that somewhat with allow_moved_paths, which I believe only works on iceberg_scan(...), not on an attached iceberg catalog

zhangjun0x01 · 2025-09-18T08:48:14Z

allow_moved_paths

in the data/persistent directory，all test cases use the relative paths in v1.metadata.json and xxx.avro , I do not know how to generate the data with relative paths

for example，the location field use relative paths , not /data/persistent/equality_deletes/warehouse/mydb/mytable

{
  "format-version" : 2,
  "table-uuid" : "7c269e29-15d2-48a6-bc83-4919d38e3041",
  "location" : "data/persistent/equality_deletes/warehouse/mydb/mytable",
  "last-sequence-number" : 0,
  "last-updated-ms" : 1746122109432,
  "last-column-id" : 2,
  "current-schema-id" : 0,
xxxxxx

Tishj · 2025-09-18T09:58:09Z

@zhangjun0x01 check out some of the scripts, like scripts/persistent/partition_bool.py, those are the scripts used to generate the persistent data.

I imagine it's because the DATA_GENERATION_DIR is a relative path

zhangjun0x01 · 2025-09-19T09:29:03Z

@zhangjun0x01 check out some of the scripts, like scripts/persistent/partition_bool.py, those are the scripts used to generate the persistent data.

I imagine it's because the DATA_GENERATION_DIR is a relative path

I test it , when we create iceberg hadoop catalog with relative path, the table will use relative path.

…evant equality deletes for a given data file

zhangjun0x01 force-pushed the equality_delete branch from 6f49768 to cb13c01 Compare September 17, 2025 08:24

Tishj reviewed Sep 17, 2025

View reviewed changes

src/iceberg_functions/iceberg_multi_file_reader.cpp Outdated Show resolved Hide resolved

Tishj reviewed Sep 17, 2025

View reviewed changes

src/iceberg_functions/iceberg_multi_file_reader.cpp Show resolved Hide resolved

zhangjun0x01 marked this pull request as draft September 25, 2025 09:02

zhangjun0x01 force-pushed the equality_delete branch from a3128b4 to 94a498d Compare September 25, 2025 14:13

张军 and others added 4 commits September 26, 2025 18:08

fix query without equality delete fields

ebc0d44

slight cleanup of logic, deduplication of the logic to select the rel…

9fa6300

…evant equality deletes for a given data file

remove the need to rewrite expressions

a226d49

add test case

3919d0c

zhangjun0x01 force-pushed the equality_delete branch from 94a498d to 3919d0c Compare September 26, 2025 10:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix query without equality delete fields #485

Fix query without equality delete fields #485

zhangjun0x01 commented Sep 17, 2025 •

edited

Loading

Uh oh!

Tishj Sep 17, 2025

Uh oh!

Tishj Sep 17, 2025

Uh oh!

zhangjun0x01 Sep 17, 2025

Uh oh!

Tishj Sep 18, 2025 •

edited

Loading

Uh oh!

Tishj Sep 18, 2025

Uh oh!

Tishj Sep 18, 2025

Uh oh!

Uh oh!

Uh oh!

zhangjun0x01 commented Sep 17, 2025

Uh oh!

Tishj commented Sep 18, 2025

Uh oh!

zhangjun0x01 commented Sep 18, 2025

Uh oh!

Tishj commented Sep 18, 2025

Uh oh!

zhangjun0x01 commented Sep 19, 2025

Uh oh!

Uh oh!

Fix query without equality delete fields #485

Are you sure you want to change the base?

Fix query without equality delete fields #485

Conversation

zhangjun0x01 commented Sep 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Tishj Sep 17, 2025

Choose a reason for hiding this comment

Uh oh!

Tishj Sep 17, 2025

Choose a reason for hiding this comment

Uh oh!

zhangjun0x01 Sep 17, 2025

Choose a reason for hiding this comment

Uh oh!

Tishj Sep 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Tishj Sep 18, 2025

Choose a reason for hiding this comment

Uh oh!

Tishj Sep 18, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

zhangjun0x01 commented Sep 17, 2025

Uh oh!

Tishj commented Sep 18, 2025

Uh oh!

zhangjun0x01 commented Sep 18, 2025

Uh oh!

Tishj commented Sep 18, 2025

Uh oh!

zhangjun0x01 commented Sep 19, 2025

Uh oh!

Uh oh!

zhangjun0x01 commented Sep 17, 2025 •

edited

Loading

Tishj Sep 18, 2025 •

edited

Loading