Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Disable track_and_verify_wals=1 with write fault injection only when pessimistic txn in stress test #13360

Closed
wants to merge 1 commit into from

Conversation

hx235
Copy link
Contributor

@hx235 hx235 commented Jan 31, 2025

Context/Summary:

#13263 temporally disabled track_and_verify_wals=1 with write fault injection in all cases to mitigate a WAL hole surfaced by track_and_verify_wals=1 not fully debugged at that time. Fully debugging shows the WAL hole only happens under pessimistic TXN when two-phase-commit (2pc) was used.

The bug essentially is about 2pc won't be able to discard the corrupted WAL as it would in non-2pc case as part of the WAL write error recovery. So the corrupted WAL will still present in the next DB open and caught by track_and_verify_wals=1.

This fix is going to take a while. So for now, let's reduce the scope of disabling the testing.

Test:
Monitor stress test for WAL recovery error/corruption

@facebook-github-bot
Copy link
Contributor

@hx235 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

@hx235
Copy link
Contributor Author

hx235 commented Feb 3, 2025

@jaykorean ping :)

Copy link
Contributor

@jaykorean jaykorean left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks!

@facebook-github-bot
Copy link
Contributor

@hx235 merged this pull request in a10b4aa.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants