Handle incoming resolve messages. #480

zenhack · 2023-03-16T22:58:57Z

Still TODO:

We need to handle disembargos with target = (importedCap = ...).
Testing.

Still TODO: - We need to handle disembargos with target = (importedCap = ...). - Testing.

The := below is sufficient, since this isn't actually used before that.

Needs testing.

This caught a bug: we need to check if ClientState.Metadata is nil, which is possible if the client itself is null, and seems to happen to the promise after it is resolved. TODO: we should de-dup the logic between releaseExport and releaseExports. This is not entirely trivial though, because the latter is executed after we've wiped the exports table.

zenhack · 2023-03-19T01:51:56Z

Alright, this is done.

This basically completes the promise resolution detour from 3PH, Though there's a semantic thing that I don't like, which is that Fulfill(client) feels like it should take ownership of client, but doesn't. I may submit another PR changing this before moving on.

lthibault · 2023-03-20T16:37:13Z

I'm very much in favor of taking the time to make reference ownership semantics consistent. 👍

lthibault

LGTM.

lthibault · 2023-03-20T17:03:10Z

Test is timing out.

Re-running.

zenhack · 2023-03-20T17:33:41Z

Note that there does appear to be a real deadlock that needs to be addressed; the test failures are not spurious.

These are much cleaner.

...now that the resolver takes ownership.

zenhack · 2023-03-22T04:16:16Z

Status update: there are at least two bugs, one which is a deadlock, and another that is a message-reordering problem.

The deadlock is "solved" by this change to the test:

diff --git a/rpc/senderpromise_test.go b/rpc/senderpromise_test.go
index 3bcaec4..7cb5ea2 100644
--- a/rpc/senderpromise_test.go
+++ b/rpc/senderpromise_test.go
@@ -392,6 +392,7 @@ func TestPromiseOrdering(t *testing.T) {
 		if i == 100 {
 			go func() {
 				bs := testcapnp.PingPong(c1.Bootstrap(ctx))
+				assert.NoError(t, bs.Resolve(ctx))
 				r.Fulfill(bs)
 			}()
 		}

So the problem I think arises when a promise resolves to another promise (in this case a question; not sure if that matters).

I'm attaching a log for the ordering problem. Need to dig in still.
log.txt

zenhack · 2023-03-23T13:33:23Z

Solved the deadlock (kinda... see the commit message), still need to work out the ordering problem.

Worryingly, this was manifesting in the test as a deadlock: we hit the error complaining about it not being an import, but then connection shutdown hung, waiting on tasks. I haven't pinned down exactly what was going on there, but this sidesteps the issue by fixing the thing that was causing a connection abort in the first place.

lthibault · 2023-03-23T14:39:05Z

@zenhack Do we have an issue open for the shutdown bug?

zenhack · 2023-03-23T14:42:26Z

Not that I know of; I have no memory of having seen something like this before.

zenhack · 2023-03-30T04:16:03Z

I merged back in main, and also reworked the way NewLocalPromise works to avoid some dodgy code that I think has race conditions. I held out some hope that this would fix the ordering bug, but it seems to have no effect.

zenhack · 2023-03-30T04:28:31Z

Discussed some on matrix, but for posterity: it's not clear to me that the tribble 4-way race condition actually requires three distinct vats, so it could be a problem even in a level 1 implementation, if each link in the promise chain crosses the network. The failure we're seeing matches this description, so I am wondering if it is a symptom of the 4-way race condition. The flow looks something like:

vat A requests vat B's bootstrap interface, and starts making calls on it (before receiving the return). The question for the bootstrap is the first promise.
vat B returns a senderPromsie as its bootstrap.
At some point later, vat B requests vat A's bootstrap interface and fulfills the promise it sent with the result. Note that the bug crops up regardless of whether or not vat B waits for the return for the bootstrap message. If it does not there are three promises, but even if it does there are still two.

I'm going to put this PR down until I've more deeply understood the implications. I still haven't totally grokked what exactly goes wrong in the 4-way race condition, only that there is an assumption being violated (i.e. I don't know what an example failure due to this issue should look like). I plan on figuring this out, and probably also stepping back to fix the way we do promise resolution in the library; regardless of whether this hunch is correct, we'll need to for 3PH.

zenhack · 2023-03-31T18:54:07Z

Per #494 (comment), my suspicion was correct: 3PH is not required for the race, and it seems like a good bed that's what's causing the failure. So I'll work on that refactor before coming back to this.

...instead of a Client. This is a step towards addressing the tribble 4-way race condition. Currently there are some test failures.

The call to waitRef.Release() in export.go was resolving the receiver too soon, resulting in a leak.

zenhack · 2023-05-27T04:05:01Z

I've merged in main and #520, still reconciling the latter.

Has build errors that need fixing.

I plan on adding *Snapshot versions and eventually storing snapshots internally.

This seems like an unnecessary complication, and I don't want to deal with it when porting stuff over to snapshots.

...and use it to catch a bug; the test is failing, need to track it down.

Otherwise the target message steals the caps, resulting in errors later when we try to read them.

Per capnproto#523, this is currently failing frequently. Mark it as flaky until we're ready to actually deal with it.

In addition to letting us hand out borrowed references, Storing the snapshots will facilitate code that needs them not to resolve further after AddSnapshot() or SetSnapshot().

There are test failures that need debugging though.

zenhack · 2023-06-17T05:54:02Z

Superseded by #530, closing.

zenhack marked this pull request as draft March 16, 2023 22:59

zenhack added 5 commits March 16, 2023 18:59

WIP: handle incoming resolve messages.

a3ffc4e

Still TODO: - We need to handle disembargos with target = (importedCap = ...). - Testing.

cleanup: remove unnecessary declaration of importClient.

36b13bf

The := below is sufficient, since this isn't actually used before that.

Factor some logic out of handleDisembargo

3cbff97

First stab at disembargos on imports.

9fbd8c6

Needs testing.

zenhack marked this pull request as ready for review March 19, 2023 01:49

zenhack requested a review from lthibault March 19, 2023 01:50

zenhack changed the title ~~WIP: handle incoming resolve messages.~~ Handle incoming resolve messages. Mar 19, 2023

Add another test wrt promise resolution.

d50b6ce

lthibault approved these changes Mar 19, 2023

View reviewed changes

lthibault previously approved these changes Mar 20, 2023

View reviewed changes

zenhack dismissed lthibault’s stale review via b11ac49 March 21, 2023 00:33

zenhack added 2 commits March 20, 2023 20:35

Fix the ownership semantics for NewLocalPromise

b11ac49

These are much cleaner.

Remove no longer necessary .Release()

a1028be

...now that the resolver takes ownership.

lthibault previously approved these changes Mar 21, 2023

View reviewed changes

Merge remote-tracking branch 'origin/main' into handle-resolve

0799051

Minor style cleanup

9a40e87

zenhack dismissed lthibault’s stale review via e46bf47 March 23, 2023 13:32

lthibault previously approved these changes Mar 23, 2023

View reviewed changes

Merge remote-tracking branch 'origin/main' into handle-resolve

14b7864

Merge branch 'localpromise-fixes' into handle-resolve

d7861ab

zenhack added 4 commits May 26, 2023 17:07

WIP: put a ClientSnapshot in the exports table

2e138f0

...instead of a Client. This is a step towards addressing the tribble 4-way race condition. Currently there are some test failures.

ClientSnapshot.Release(): take pointer receiver.

9999902

The call to waitRef.Release() in export.go was resolving the receiver too soon, resulting in a leak.

Do leak reporting for ClientSnapshots, not just Clients.

7213fc1

Merge remote-tracking branch 'origin/main' into handle-resolve

1cc3130

zenhack added 14 commits May 27, 2023 00:06

Merge branch 'export-snapshot' into handle-resolve

8d71f0b

Has build errors that need fixing.

CapTable: rename Get -> GetClient, Add -> AddClient

91d062a

I plan on adding *Snapshot versions and eventually storing snapshots internally.

CapTable: Add *Snapshot variants of Add & Get

5e0023a

Separate client/snapshot versions of CapTable.At()

abf193b

Remove argument to CapTable.Reset()

76f95d5

This seems like an unnecessary complication, and I don't want to deal with it when porting stuff over to snapshots.

CapTable.Set: split into client/snapshot variants

61ca9e5

CapTable: change internal table to store snapshots.

f967703

WIP: Add a Steal() method to Client and ClientSnapshot

fe08d87

...and use it to catch a bug; the test is failing, need to track it down.

Add missing call to .AddRef()

afeb534

pogs tests: clone input before inserting.

7a23f9b

Otherwise the target message steals the caps, resulting in errors later when we try to read them.

Mark TestDuplicateBootstrap as flaky.

2372b53

Per capnproto#523, this is currently failing frequently. Mark it as flaky until we're ready to actually deal with it.

Merge branch 'client.Steal' into captable-snapshots

1acb315

Merge branch '523-flaky' into captable-snapshots

843e8b6

CapTable: maintain snapshots & clients in parallel.

575916e

In addition to letting us hand out borrowed references, Storing the snapshots will facilitate code that needs them not to resolve further after AddSnapshot() or SetSnapshot().

zenhack mentioned this pull request May 27, 2023

Add a Steal() method to Client and ClientSnapshot #522

Open

zenhack added 2 commits May 27, 2023 17:07

Merge branch 'captable-snapshots' into handle-resolve

75200fc

Fix build errors.

0a99d60

There are test failures that need debugging though.

zenhack mentioned this pull request May 29, 2023

Store ClientSnapshots in the CapTable. #525

Closed

zenhack mentioned this pull request Jun 17, 2023

WIP: Handle incoming resolve messages, take 2 #530

Merged

zenhack closed this Jun 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Handle incoming resolve messages. #480

Handle incoming resolve messages. #480

Uh oh!

zenhack commented Mar 16, 2023

Uh oh!

zenhack commented Mar 19, 2023

Uh oh!

lthibault commented Mar 20, 2023

Uh oh!

lthibault left a comment

Uh oh!

lthibault commented Mar 20, 2023

Uh oh!

zenhack commented Mar 20, 2023

Uh oh!

zenhack commented Mar 22, 2023

Uh oh!

zenhack commented Mar 23, 2023

Uh oh!

lthibault commented Mar 23, 2023

Uh oh!

zenhack commented Mar 23, 2023

Uh oh!

zenhack commented Mar 30, 2023

Uh oh!

zenhack commented Mar 30, 2023

Uh oh!

zenhack commented Mar 31, 2023

Uh oh!

zenhack commented May 27, 2023

Uh oh!

zenhack commented Jun 17, 2023

Uh oh!

Uh oh!

Uh oh!

Handle incoming resolve messages. #480

Handle incoming resolve messages. #480

Uh oh!

Conversation

zenhack commented Mar 16, 2023

Uh oh!

zenhack commented Mar 19, 2023

Uh oh!

lthibault commented Mar 20, 2023

Uh oh!

lthibault left a comment

Choose a reason for hiding this comment

Uh oh!

lthibault commented Mar 20, 2023

Uh oh!

zenhack commented Mar 20, 2023

Uh oh!

zenhack commented Mar 22, 2023

Uh oh!

zenhack commented Mar 23, 2023

Uh oh!

lthibault commented Mar 23, 2023

Uh oh!

zenhack commented Mar 23, 2023

Uh oh!

zenhack commented Mar 30, 2023

Uh oh!

zenhack commented Mar 30, 2023

Uh oh!

zenhack commented Mar 31, 2023

Uh oh!

zenhack commented May 27, 2023

Uh oh!

zenhack commented Jun 17, 2023

Uh oh!

Uh oh!