Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

clarification on semantics of ??= #11

Open
sigmundch opened this issue May 19, 2015 · 6 comments
Open

clarification on semantics of ??= #11

sigmundch opened this issue May 19, 2015 · 6 comments

Comments

@sigmundch
Copy link

Question for @gbracha and @stereotype441

The proposal says that the semantics of a ??= b are the same as:

  a = a ?? b

This means, a will be assigned it's own value when it's not null. This can be observable if you do:

get v => 1
set v(value) { print('setter was called with $value');
main() {
  v ??= 2; // prints "setter was called with 1"
  v ??= 3; // prints "setter was called with 1"
}

However, some tests are written with a different semantics that avoids the assignment if the value is not null. In other words:

  ((v) => v == null ? a = b : v)(a)

I currently have the former implemented in dart2js and I'm inclined to change those tests to match, but I wanted to verify this was the intended behavior.

@stereotype441
Copy link

I raised this question with Gilad in March, and I believe that is what prompted him to change the semantics from a = a ?? b to ((v) => v == null ? a = b : v)(a). (Note that the spec changes that were made in https://codereview.chromium.org//1031323002 are consistent with the tests). So yes, it was a deliberate change and my preference would be to keep it the way it's currently spec'ed (and tested).

For context, here's the line of reasoning that motivated me to suggest the change:

  • Given the short-cutting semantics of ?? (RHS is only evaluated when necessary), I think it's reasonable for a user to expect further short-cutting of ??= (assignment is only performed when necessary).
  • It's very rare (outside of contrived tests) to write code where a = a has deliberate side-effects. So for practical purposes the only difference between the two possible interpretations of ??= is execution speed. And unfortunately, we can't rely on the implementation to optimize away the redundant assignment. Consider the sophistication of analysis that has to be performed in order to determine that the assignment is redundant in the case of a[b] ??= c. In addition to proving some subtle invariants about the implementation of Map (or whatever the type of a is), you also have to check that the evaluation of c won't affect a. The practical upshot of this is that if we interpret a ??= b as shorthand for a = a ?? b, then ??= will often be slower than what the user intends, making the benefit of the syntactic sugar dubious.
  • If we interpret a ??= b as a = a ?? b, then a?.b ??= c becomes a?.b = a?.b ?? c, which has the very odd semantics that if a is null, then c is evaluated and then its result is thrown away! It's far more likely that what the user intends is (a == null ? null : a.b ??= c).

@sigmundch
Copy link
Author

Thanks for the detailed answer Paul - and thanks for the link to the spec changes! I was only aware of the proposal in this repo, I didn't know the changes were already added to the actual spec, I'll take a look there as well.

Regarding:

which has the very odd semantics that if a is null, then c is evaluated and then its result is thrown away!

Not sure I follow this, given that a?.b = exp semantics are:

((x) => x == null ? null : x.b = exp)(a)

It seems that desugarizing a ??= b as a = a ?? b should yield the same result in this case:

((x) => x == null ? null : x.b = a?.b ?? c)(a)

@sigmundch
Copy link
Author

Follow up question: how many times do we print read in this code?

var _x = new A();
get x {
  print('read');
  return _x;
}

main() {
  x?.y ??= 3;
}

It seems like under the old semantics we'd print it twice:

x?.y ??= 3

// is equivalent to:
x?.y = x?.y ?? 3

// which corresponds to:
t1 = x;  // (x in the lhs, prints "read")
if (t1 != null) {
  t2 = x; // (x in the rhs, prints "read")
  if (t2 != null) {
   t3 = t2.y;
  }
  if (t3 == null) t3 = 3;
  t1.y = t3;
}

It appears that under the new semantics we print it 3 times:

x?.y ??= 3
// expands to:
t1 = x?.y;  // prints "read"
if (t1 == null) {
  x?.y = x?.y ?? 3 // prints read twice like above.
}

is that correct?

@stereotype441
Copy link

Regarding a?.b ??= c, you are right; please ignore what I previously said about this case.

Regarding the question of how many times "read" should be printed in the expression x?.y ??= 3, it should only be printed once. Using the equivalent expression in the spec (https://codereview.chromium.org/1031323002/diff/1/docs/language/dartLangSpec.tex#newcode4481):

x?.y ??= 3
// expands to:
((z) => z == null ? null : z.y ??= 3)(x)

(to avoid confusion I used z instead of x for the parameter in the function literal). This means that the getter for x is only called once, prior to invoking the function expression.

In general all the null-aware operators should evaluate subexpressions once if necessary, and zero times otherwise. Otherwise subexpressions with side effects will behave unexpectedly. Consider that a user might want to do myList[i++]?.y ??= 3, in which case they expect i to be incremented exactly once. I believe all the equivalent expressions in the spec achieve this; if you find one that doesn't I'd love to hear about it.

@stereotype441
Copy link

Sorry I didn't point you to the spec changes earlier. I've updated the tracking bugs dartbug.com/23454 and dartbug.com/23455 to point to the spec changes so that hopefully Matthias won't fall into the same trap.

@sigmundch
Copy link
Author

Thanks Paul!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants