Fix record skipping when querying paginated data across shards #3061

Swiddis · 2024-10-08T23:24:52Z

Description

When pulling unordered data from an index with multiple shards, data gets lost if the fetchSize is not a multiple of the shard count, as the persisted cursor position to continue paging is based on the last seen _doc which is duplicated when the primary shard count exceeds 1. This PR currently adds a reproducer for the bug -- finding a fix is still in progress.

Related Issues

N/A

Check List

New functionality includes testing.
New functionality has been documented.
New functionality has javadoc added.
New functionality has a user manual doc added.
API changes companion pull request created.
Commits are signed per the DCO using --signoff.
Public documentation issue/PR created.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

Signed-off-by: Simeon Widdis <[email protected]>

penghuo

Does it impact Join in legacy module also?

penghuo · 2024-10-09T00:26:04Z

integ-test/src/test/java/org/opensearch/sql/sql/PaginationWindowIT.java

+
+  @Test
+  public void testMultiShardPagesEqualsActualData() throws IOException {
+    // A bug made it so when pulling unordered data from an index with multiple shards, data gets lost if the fetchSize


pulling unordered data from an index
it is not accurate, it is ordered data (sord by _doc)?

penghuo · 2024-10-09T00:29:05Z

opensearch/src/main/java/org/opensearch/sql/opensearch/request/OpenSearchQueryRequest.java

@@ -189,6 +189,9 @@ public OpenSearchResponse searchWithPIT(Function<SearchRequest, SearchResponse>
      // Set sort field for search_after
      if (this.sourceBuilder.sorts() == null) {
        this.sourceBuilder.sort(DOC_FIELD_NAME, ASC);
+        // Workaround to preserve sort location more exactly,


if this is workaround, could u add the long-term solution issue?

fddattal · 2024-10-09T01:08:35Z

opensearch/src/main/java/org/opensearch/sql/opensearch/request/OpenSearchQueryRequest.java

@@ -189,6 +189,9 @@ public OpenSearchResponse searchWithPIT(Function<SearchRequest, SearchResponse>
      // Set sort field for search_after
      if (this.sourceBuilder.sorts() == null) {
        this.sourceBuilder.sort(DOC_FIELD_NAME, ASC);
+        // Workaround to preserve sort location more exactly,
+        // see https://github.com/opensearch-project/sql/pull/3061
+        this.sourceBuilder.sort("_id", ASC);


I see there are a couple places in the code where we sort on doc field.
Can you help me understand why we only need this here?

https://github.com/search?q=repo%3Aopensearch-project%2Fsql+%22DOC_FIELD_NAME%22&type=code

Add reproducer for pagination skipping bug

e143507

Signed-off-by: Simeon Widdis <[email protected]>

Swiddis force-pushed the bugfix/pagination-mismatch branch from ba741cd to e143507 Compare October 8, 2024 23:49

Swiddis added the bug Something isn't working label Oct 9, 2024

Fix the bug

a8af162

Signed-off-by: Simeon Widdis <[email protected]>

Swiddis force-pushed the bugfix/pagination-mismatch branch from 3946e6a to a8af162 Compare October 9, 2024 00:10

Swiddis marked this pull request as ready for review October 9, 2024 00:11

Swiddis requested review from ps48, kavithacm, derek-ho, joshuali925, dai-chen, YANG-DB, rupal-bq, mengweieric, vamsi-amazon, penghuo, seankao-az, MaxKsyunz, Yury-Fridlyand, anirudha, forestmvey, acarbonetto, GumpacG, ykmr1224, LantaoJin and noCharger as code owners October 9, 2024 00:11

penghuo reviewed Oct 9, 2024

View reviewed changes

fddattal reviewed Oct 9, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix record skipping when querying paginated data across shards #3061

Fix record skipping when querying paginated data across shards #3061

Swiddis commented Oct 8, 2024 •

edited

Loading

penghuo left a comment

penghuo Oct 9, 2024

penghuo Oct 9, 2024

fddattal Oct 9, 2024 •

edited

Loading

Fix record skipping when querying paginated data across shards #3061

Are you sure you want to change the base?

Fix record skipping when querying paginated data across shards #3061

Conversation

Swiddis commented Oct 8, 2024 • edited Loading

Description

Related Issues

Check List

penghuo left a comment

Choose a reason for hiding this comment

penghuo Oct 9, 2024

Choose a reason for hiding this comment

penghuo Oct 9, 2024

Choose a reason for hiding this comment

fddattal Oct 9, 2024 • edited Loading

Choose a reason for hiding this comment

Swiddis commented Oct 8, 2024 •

edited

Loading

fddattal Oct 9, 2024 •

edited

Loading