Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add patch to fix arm segfault in nmslib during ingestion #1541

Merged
merged 1 commit into from
Mar 14, 2024

Conversation

jmazanec15
Copy link
Member

@jmazanec15 jmazanec15 commented Mar 14, 2024

Description

Adds patch to fix the segfault happening in nmslib. For more information on the patch, see nmslib/nmslib#544.

Existing tests will cover this change. The issue can only be reproduced at large scales. I can confirm the patch has remedied seeing this issue in large OpenSearch clusters.

Issues Resolved

#1406

Check List

  • Commits are signed as per the DCO using --signoff

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

@jmazanec15 jmazanec15 added Bug Fixes Changes to a system or product designed to handle a programming bug/glitch backport 2.x labels Mar 14, 2024
Copy link

codecov bot commented Mar 14, 2024

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 85.13%. Comparing base (bfcf7dc) to head (9b66eb1).

Additional details and impacted files
@@             Coverage Diff              @@
##               main    #1541      +/-   ##
============================================
- Coverage     85.17%   85.13%   -0.04%     
  Complexity     1299     1299              
============================================
  Files           171      171              
  Lines          5295     5295              
  Branches        506      506              
============================================
- Hits           4510     4508       -2     
- Misses          572      574       +2     
  Partials        213      213              

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@vamshin
Copy link
Member

vamshin commented Mar 14, 2024

Thanks Jack. Just Curious why is this race condition CPU architecture dependent?

@jmazanec15 jmazanec15 merged commit 2b0f5a3 into opensearch-project:main Mar 14, 2024
52 checks passed
opensearch-trigger-bot bot pushed a commit that referenced this pull request Mar 14, 2024
Signed-off-by: John Mazanec <[email protected]>
(cherry picked from commit 2b0f5a3)
@navneet1v
Copy link
Collaborator

Thanks Jack. Just Curious why is this race condition CPU architecture dependent?

all details related to this is added on the GH issue of nmslib. Its more around how different CPU architectures are creating the assembly code.

@jmazanec15
Copy link
Member Author

Right, the issue doesnt happen on x86 (that weve seen) because gcc on x86 doesnt re-order the assignment operations. But gcc on arm does - probably as some kind of optimization

navneet1v pushed a commit that referenced this pull request Mar 14, 2024
Signed-off-by: John Mazanec <[email protected]>
(cherry picked from commit 2b0f5a3)

Co-authored-by: John Mazanec <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
backport 2.x Bug Fixes Changes to a system or product designed to handle a programming bug/glitch
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants