Data race in `internals/overlord/servstate:S.startTestServices(c)` #264

barrettj12 · 2023-07-28T02:16:18Z

This is a utility function called in many of the servstate tests.

pebble/internals/overlord/servstate/manager_test.go

Line 256 in 00bcd1f

func (s *S) startTestServices(c *C) {

It starts two services

test1: /bin/sh -c "echo test1 | tee -a .../log.txt; sleep 10"
test2: /bin/sh -c "echo test2 | tee -a .../log.txt; sleep 10"

and then asserts the contents of log.txt are

test1
test2

However, there is a data race, since we may read log.txt before the test2 service has had a chance to write to it.

manager_test.go:203:
    c.Assert(string(data), Matches, "(?s)"+expected)
... value string = "test1\n"
... regex string = "" +
...     "(?s).*test1\n" +
...     ".*test2\n"

To reproduce

Create a new test in internals/overlord/servstate/manager_test.go:

func (s *S) TestStartTestServices(c *C) {
	s.startTestServices(c)
}

Install the stress utility (useful for finding sporadic test failures):

go install golang.org/x/tools/cmd/stress@latest

Run

stress go test ./internals/overlord/servstate -check.f TestStartTestServices -count=1

and start observing failures.

The text was updated successfully, but these errors were encountered:

flotter · 2023-07-28T16:02:31Z

I will add this to #266 - its not included right now, but easy to add.

The startTestServices() test helper uses a special entry in the service command under test to write the service standard output also to a log file that can be inspected. This mechanism suffers from a race condition (as highlighted in canonical#264) because when the content of the log file is loaded, the service may not yet have completed writing to the log. Since standard output is also verified separately through a different mechanism, the following changes are made: - Enhance the global "done check" (previous called "done file") to check completion per service. This effectively adds the capability previously provided by the log assert mechanism. - Use the existing "done check" mechanism to wait until the service command-line is complete up to the point of the check. - Only now verify the stdout buffer content as checked previously. - Remove the log mechanism all together.

flotter · 2023-08-01T11:52:27Z

#266 Should now fix this.

The startTestServices() test helper uses a special entry in the service command under test to write the service standard output also to a log file that can be inspected. This mechanism suffers from a race condition (as highlighted in #264) because when the content of the log file is loaded, the service may not yet have completed writing to the log. Since standard output is also verified separately through a different mechanism, the following changes are made: - Enhance the global "done check" (previous called "done file") to check completion per service. This effectively adds the capability previously provided by the log assert mechanism. - Use the existing "done check" mechanism to wait until the service command-line is complete up to the point of the check. - Only now verify the stdout buffer content as checked previously. - Remove the log mechanism all together.

flotter mentioned this issue Aug 1, 2023

servstate: test stability improvements #266

Merged

jnsgruk linked a pull request Aug 1, 2023 that will close this issue

servstate: test stability improvements #266

Merged

jnsgruk closed this as completed in #266 Aug 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Data race in `internals/overlord/servstate:S.startTestServices(c)` #264

Data race in `internals/overlord/servstate:S.startTestServices(c)` #264

barrettj12 commented Jul 28, 2023

flotter commented Jul 28, 2023

flotter commented Aug 1, 2023

Data race in internals/overlord/servstate:S.startTestServices(c) #264

Data race in internals/overlord/servstate:S.startTestServices(c) #264

Comments

barrettj12 commented Jul 28, 2023

To reproduce

flotter commented Jul 28, 2023

flotter commented Aug 1, 2023

Data race in `internals/overlord/servstate:S.startTestServices(c)` #264

Data race in `internals/overlord/servstate:S.startTestServices(c)` #264