Handle an interesting race condition (bug #1747367) #8385

jameinel · 2018-02-15T12:11:13Z

Description of change

The static analysis showed that this might be a problem, and we're able to set
up a case where it really does fail.

Essentially, we have one loop which waits to try and send a notification of an
event to users. While waiting for that event to be sent, it allows new requests
to be made. So if we start watching for an event, but never handle when it
triggers, and then queue up a bunch more watch events that cause us to
reallocate the event slice, and then unwatch the original event, we will
never notice that the original event is no longer watched because we'll be hung
on an object that doesn't get updated.

QA steps

The test properly handles the error conditions. If you uncomment that one line in presence.go, it properly fails the test case (and fails after waiting LongWait).

Documentation changes

None.

Bug reference

lp:1747367

The static analysis showed that this might be a problem, and we're able to set up a case where it really does fail. Essentially, we have one loop which waits to try and send a notification of an event to users. While waiting for that event to be sent, it allows new requests to be made. So if we start watching for an event, but never handle when it triggers, and then queue up a bunch more watch events that cause us to reallocate the event slice, and *then* unwatch the original event, we will never notice that the original event is no longer watched because we'll be hung on an object that doesn't get updated.

jameinel · 2018-02-15T13:43:01Z

!!build!!

manadart

That is a subtle one.

manadart · 2018-02-15T14:14:24Z

$$merge$$

rogpeppe

Nice catch! LGTM with one minor thought.

rogpeppe · 2018-02-15T17:32:42Z

state/presence/presence.go

@@ -353,6 +353,11 @@ func (w *Watcher) flush() {
 				return
 			case req := <-w.request:
 				w.handle(req)
+				// handle may append to the w.pending array, and/or it may unwatch something that was previously pending


ISTM that we're already handling the unwatch case (that's the e.ch != nil loop condition) but that the real issue here is the append, because the append can cause a realloc and thus e might be pointing to the old slice and hence be invalid.

I think I'd probably program this a little more defensively;

for { // Note: w.pending can be appended to by w.handle so we can't // safely keep the same e between iterations. e := &w.pending[i] if e.ch == nil { break } select { etc

?

jameinel force-pushed the 2.3-presence-1747367 branch from 49500bf to ba26f5c Compare February 15, 2018 13:54

manadart approved these changes Feb 15, 2018

View reviewed changes

jujubot merged commit d7a0009 into juju:2.3 Feb 15, 2018

rogpeppe reviewed Feb 15, 2018

View reviewed changes

jameinel mentioned this pull request Feb 16, 2018

2.3 into develop #8391

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle an interesting race condition (bug #1747367) #8385

Handle an interesting race condition (bug #1747367) #8385

jameinel commented Feb 15, 2018

jameinel commented Feb 15, 2018

manadart left a comment

manadart commented Feb 15, 2018

rogpeppe left a comment

rogpeppe Feb 15, 2018

Handle an interesting race condition (bug #1747367) #8385

Handle an interesting race condition (bug #1747367) #8385

Conversation

jameinel commented Feb 15, 2018

Description of change

QA steps

Documentation changes

Bug reference

jameinel commented Feb 15, 2018

manadart left a comment

Choose a reason for hiding this comment

manadart commented Feb 15, 2018

rogpeppe left a comment

Choose a reason for hiding this comment

rogpeppe Feb 15, 2018

Choose a reason for hiding this comment