Reduce raciness in test #1996

lukedirtwalker · 2018-10-17T13:10:32Z

fetchSegsFromDBRetry select on ctx.Done() and time.After().
If the setup/calling of ctx.Done() takes more than what we pass in tim.After,
it can be that both channels are ready at the same time and then the test might fail.

By increasing the timeout in the test this should no longer be a problem.

This change is

sgmonroy

So just a bit more context, this is the result of investigating a CI failure of an unrelated change.
It seems that CI is very much resource constrained and race conditions show up more often.

func (h *baseHandler) fetchSegsFromDBRetry(ctx context.Context,
        params *query.Params) ([]*seg.PathSegment, error) {

        for {
                upSegs, err := h.fetchSegsFromDB(ctx, params)
                if err != nil || len(upSegs) > 0 {
                        return upSegs, err
                }
                select {
                case <-ctx.Done():
                        return nil, ctx.Err()
                case <-time.After(h.retryInt):
                        // retry
                }
        }
}

We reasoned that on each select evaluation, a timer/channel is created which spawns its own go routine. Thus, it is basically a race between the timer and the current go routines. If the current go routine does not run enough to evaluate the select cases, it is theoretically possible that the timer go routine did run and expired, which ends up in both cases being true and one of them randomly executed.

Reviewed 1 of 1 files at r1.
Reviewable status: complete! all files reviewed, all discussions resolved (waiting on @kormat)

sgmonroy

Reviewable status: complete! all files reviewed, all discussions resolved (waiting on @kormat)

fetchSegsFromDBRetry select on ctx.Done() and time.After(). If the setup/calling of ctx.Done() takes more than what we pass in time.After, it can be that both channels are ready at the same time and then the test might fail. By increasing the timeout in the test this should no longer be a problem.

kormat

Reviewable status: complete! all files reviewed, all discussions resolved (waiting on @kormat)

lukedirtwalker assigned kormat and sgmonroy Oct 17, 2018

sgmonroy approved these changes Oct 18, 2018

View reviewed changes

lukedirtwalker force-pushed the pubStabilizeTest branch from 64def34 to 295c736 Compare October 18, 2018 08:46

lukedirtwalker force-pushed the pubStabilizeTest branch from 295c736 to edc2d12 Compare October 18, 2018 08:47

kormat approved these changes Oct 18, 2018

View reviewed changes

lukedirtwalker merged commit a83dfac into scionproto:master Oct 18, 2018

lukedirtwalker deleted the pubStabilizeTest branch October 18, 2018 10:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce raciness in test #1996

Reduce raciness in test #1996

lukedirtwalker commented Oct 17, 2018 •

edited by kormat

Loading

sgmonroy left a comment

sgmonroy left a comment

kormat left a comment

Reduce raciness in test #1996

Reduce raciness in test #1996

Conversation

lukedirtwalker commented Oct 17, 2018 • edited by kormat Loading

sgmonroy left a comment

Choose a reason for hiding this comment

sgmonroy left a comment

Choose a reason for hiding this comment

kormat left a comment

Choose a reason for hiding this comment

lukedirtwalker commented Oct 17, 2018 •

edited by kormat

Loading