[spec] Reverse subtyping #110

rossberg · 2020-10-01T16:19:38Z

This (finally) adds the ability to do reverse subtyping on records and variants. I tried to include various motivation, explanation, and some simple examples to make those rather exotic rules clearer.

Also filled in the missing pieces of lexical syntax for values.

chenyan-dfinity

LGTM. Just some typo.

spec/Candid.md

Co-authored-by: Yan Chen <[email protected]>

spec/Candid.md

nomeata · 2020-10-02T06:55:59Z

spec/Candid.md

+In order to be able to evolve and extend variant types that also occur in outbound position (i.e., are used both as function results and function parameters), the subtype relation also supports *adding* tags to variants, provided the variant itself is optional.
+```
+opt variant { <fieldtype>;* } <: opt variant { <fieldtype'>;* }
+----------------------------------------------------------------------------------------------
+opt variant { <nat> : opt <datatype>; <fieldtype>;* } <: opt variant { <fieldtype'>;* }
+```


It seems that this rule is actually just an instance of the

not (<datatype> <: <datatype'>) --------------------------------- opt <datatype> <: opt <datatype'>

rule, isn’t it?

Ah, but they have different elaboration of course… this makes your rules overlapping (I think you avoided that before…) A problem? Since this is just defining a binary relation on types at this point (and not an elaboration) it feels strange to use more rules than necessary.

Maybe it’s worth defining <: in a non-overlapping way, with simply an unrestricted

--------------------------------- opt <datatype> <: opt <datatype'>

And then doing the different cases in the elaborated version.

Or simply only define the elaborated version…

But I thought we’d agree we need different relations for “sensible evolution” and “decoding”?

Ah, but they have different elaboration of course… this makes your rules overlapping (I think you avoided that before…) A problem?

Yes, it would be better if this wasn't there. Same problem as above. For now, I added similar handwaving to pepper over the non-determinism.

But I thought we’d agree we need different relations for “sensible evolution” and “decoding”?

I realised that doesn't actually work. You need to allow all of decoding in the evolution check, because cases like that might not be under your own control: some type you use may originate from another canister, that has performed multiple evolutionary steps that are transitively non-coherent, but you have missed the intermediate step that would avoid the "ugly" transitive case when you upgrade yourself.

Hence the informal language in L785+ instead, about discouraging users to create that case and implementations to warn about it.

IOW, we need the completeness property after all, even if some cases hopefully never occur in practice if everybody follows good style.

I don't follow the example. Even if you miss a step, and even if there there is incoherence, the “interface evolution” relation would still be transitive, woudn’t it? But maybe there is some complicate higher-order case where some canister is in the unfortunate situation where it is force to upgrade it’s interface in an undesirable way.

Our previous conclusion was (unless I misremember something) that the transitive rule (opt t <: opt t' with incompatible t, t') should not be allowed in the upgrade check, because that check always deals with a single evolutionary step where you simply shouldn't be doing that. And that we hence should define two relations, one with and the other without this rule (or something like that).

But the assumption that the check always deals with a single evolutionary step is bogus if your interface has to adapt to (the latest version of) somebody else's interface types, whose evolution might have a different pace, and who makes bad transitive evolutionary choices outside your control. In that case you may need the "bad" rule on options even during upgrade checks.

Our previous conclusion was (unless I misremember something) that the transitive rule (opt t <: opt t' with incompatible t, t') should not be allowed in the upgrade check,

yes… but

because that check always deals with a single evolutionary step where you simply shouldn't be doing that

wasn’t the reason, I think. I thought the reason was that we only had problems when you need the bad rules due to transitivity because you composed two serivces, and each was doing one (or more) sensible steps. But the transtive relation was only relevant for decoding, not an actual service evolution.

That’s why I hope we can have a “can sensibly evolve” relation (probably with polarities), that’s transitive, and a separate relation “can be decoded at” relation that has the unwanted relations.

But maybe I am too optimistic, and there will always be corner cases where a service needs to change its interface is a “bad” way.

nomeata · 2020-10-02T06:59:44Z

spec/Candid.md

+not (<datatype> <: <datatype'>)
+---------------------------------
+opt <datatype> <: opt <datatype'>
+  ~> \x.join_opt (\y.


I don’t think we can allow such subtyping premises on the elaborated rules, as these define deserialization.

Shouldn’t this rule (without the premise, and with the if ∃ elaboration) subsume the normal rule?

Yeah, I tried to keep the structure of the rules without elaboration. But I think the problem is that this rules try to express a phase separation that doesn't actually exist. All the rules should be considered "runtime". I removed the other rule and reformulated this one non-deterministically and added a comment admitting that the formulation is somewhat hand-wavy. Doing it properly would require a 4-place relation like v:t <: v':t', I think, but I'd prefer leaving that for another time.

Yes, we can clean up separtely.

I don’t think it’d be a four-place relation: Didn’t we determine that the input data should be considered untyped (and the type description just be an encoding help)? So I’d expect v <: v' : t or maybe more suggestively v ~> v':t.

I suppose we could phrase it like that, but that would be a bit of a disconnect with the actual serialisation format, which does include a type -- it just may not be "principal".

But that disconnect would be intentional: Didn't we just last week realize that it was misguided to take that type as more than just a way to help understand the binary structure of the value?

I think the conceptual phase distinction between “decoding binary to abstract value” and “interpreting a value at the expected type” is very helpful (and how many implementatoins will naturally approach this, too).

spec/Candid.md

rossberg

@nomeata, PTAL.

spec/Candid.md

rossberg · 2020-10-05T12:06:18Z

spec/Candid.md

+not (<datatype> <: <datatype'>)
+---------------------------------
+opt <datatype> <: opt <datatype'>
+  ~> \x.join_opt (\y.


Yeah, I tried to keep the structure of the rules without elaboration. But I think the problem is that this rules try to express a phase separation that doesn't actually exist. All the rules should be considered "runtime". I removed the other rule and reformulated this one non-deterministically and added a comment admitting that the formulation is somewhat hand-wavy. Doing it properly would require a 4-place relation like v:t <: v':t', I think, but I'd prefer leaving that for another time.

spec/Candid.md

rossberg · 2020-10-06T10:37:11Z

spec/Candid.md

+In order to be able to evolve and extend variant types that also occur in outbound position (i.e., are used both as function results and function parameters), the subtype relation also supports *adding* tags to variants, provided the variant itself is optional.
+```
+opt variant { <fieldtype>;* } <: opt variant { <fieldtype'>;* }
+----------------------------------------------------------------------------------------------
+opt variant { <nat> : opt <datatype>; <fieldtype>;* } <: opt variant { <fieldtype'>;* }
+```


Ah, but they have different elaboration of course… this makes your rules overlapping (I think you avoided that before…) A problem?

Yes, it would be better if this wasn't there. Same problem as above. For now, I added similar handwaving to pepper over the non-determinism.

But I thought we’d agree we need different relations for “sensible evolution” and “decoding”?

I realised that doesn't actually work. You need to allow all of decoding in the evolution check, because cases like that might not be under your own control: some type you use may originate from another canister, that has performed multiple evolutionary steps that are transitively non-coherent, but you have missed the intermediate step that would avoid the "ugly" transitive case when you upgrade yourself.

Hence the informal language in L785+ instead, about discouraging users to create that case and implementations to warn about it.

IOW, we need the completeness property after all, even if some cases hopefully never occur in practice if everybody follows good style.

nomeata

Maybe not in this PR, but reading this I get a strong urge to re-write the spec in terms of

A relation T <:: T' of “good” updates, that is more restrictive that what we allow (in particular, if your service defines an optional field and later removes it, this is marked using opt null).
A set V of untyped values (essentially the AST of the encoded values).
A relation V : T to describe the values of the types (likely without built-in subtyping, i.e. {foo = true} /: record {}, as it is not needed here)
A type-indexed decoding function T ~> (V -> T ∪ {fail}), which by construction ignores the type annotation on the input value (V is the set of Candid values), and models decoding failure explicitly (which is needed to describe the backtracking behavior).

And then we can describe (and prove!) properties that we want to hold for these things, e.g.

∀ t ∀ v ∈ V. t ~> f ⟹ (f v) = fail ∨ (f v) : t
∀ t ∀ v : t. (f v) = v (round-tripping is possible)
<:: is transitive
∀ t1 t2 ∀ v ∈ V. t1 <:: t2 ⟹ t1 ~> f1 ⟹ t2 ~> f2 ⟹ (f2 v) ≠ fail ⟹(f1 v) ≠ fail
(Something similar for the polarity switched version of <::, if it needs polarities)

But maybe not in this PR.

spec/Candid.md

Co-authored-by: Joachim Breitner <[email protected]>

the changes in #110 are definitely more than editorial, so even if we don’t bump the Magic Bytes, I think we should definitely bump the version of the spec.

* Bump spec version the changes in #110 are definitely more than editorial, so even if we don’t bump the Magic Bytes, I think we should definitely bump the version of the spec. * Update spec/Candid.md Co-authored-by: Joachim Breitner <[email protected]> Co-authored-by: Yan Chen <[email protected]>

this adds some tests to account for the new behaviour of #110 and #128 and #134 and #137. Co-authored-by: chenyan-dfinity <[email protected]>

[spec] Reverse subtyping

5308d5e

rossberg requested review from nomeata and chenyan-dfinity October 1, 2020 16:19

chenyan-dfinity reviewed Oct 2, 2020

View reviewed changes

Apply suggestions from code review

6122335

Co-authored-by: Yan Chen <[email protected]>

nomeata reviewed Oct 2, 2020

View reviewed changes

rossberg added 3 commits October 2, 2020 09:11

Say something about covert channels and lack of coherence

fb06196

Tweak rules

638e80f

Joachim's comments

5e84198

rossberg commented Oct 6, 2020

View reviewed changes

nomeata approved these changes Oct 6, 2020

View reviewed changes

spec/Candid.md Outdated Show resolved Hide resolved

spec/Candid.md Outdated Show resolved Hide resolved

Apply suggestions from code review

ee84d78

Co-authored-by: Joachim Breitner <[email protected]>

rossberg merged commit c1662ab into master Oct 6, 2020

rossberg deleted the sub branch October 6, 2020 13:38

nomeata added a commit that referenced this pull request Oct 7, 2020

Bump spec version

1678f67

the changes in #110 are definitely more than editorial, so even if we don’t bump the Magic Bytes, I think we should definitely bump the version of the spec.

nomeata mentioned this pull request Oct 7, 2020

Bump spec version #112

Merged

This was referenced Oct 27, 2020

Candid subtyping meta-issue dfinity/motoko#1523

Closed

Candid subtyping: Dynamic solution with type hash dfinity/motoko#1525

Closed

Candid record extension: The asymmetric solution dfinity/motoko#1509

Closed

nomeata mentioned this pull request Nov 2, 2020

Some more tests related to subtyping #126

Merged

nomeata added a commit that referenced this pull request Nov 21, 2020

Tests for new subtyping rules (#126)

f6f794f

this adds some tests to account for the new behaviour of #110 and #128 and #134 and #137. Co-authored-by: chenyan-dfinity <[email protected]>

This was referenced Nov 22, 2020

Still not sound: Gradual typing vs. opportunistic decoding #141

Closed

[IDL] Optimistic subtyping dfinity/motoko#1959

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[spec] Reverse subtyping #110

[spec] Reverse subtyping #110

rossberg commented Oct 1, 2020 •

edited

Loading

chenyan-dfinity left a comment

nomeata Oct 2, 2020 •

edited

Loading

rossberg Oct 6, 2020 •

edited

Loading

nomeata Oct 6, 2020

rossberg Oct 6, 2020

nomeata Oct 6, 2020

nomeata Oct 2, 2020

rossberg Oct 5, 2020

nomeata Oct 6, 2020

rossberg Oct 6, 2020

nomeata Oct 6, 2020 •

edited

Loading

rossberg left a comment

rossberg Oct 5, 2020

rossberg Oct 6, 2020 •

edited

Loading

nomeata left a comment •

edited

Loading

[spec] Reverse subtyping #110

[spec] Reverse subtyping #110

Conversation

rossberg commented Oct 1, 2020 • edited Loading

chenyan-dfinity left a comment

Choose a reason for hiding this comment

nomeata Oct 2, 2020 • edited Loading

Choose a reason for hiding this comment

rossberg Oct 6, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nomeata Oct 6, 2020 • edited Loading

Choose a reason for hiding this comment

rossberg left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

rossberg Oct 6, 2020 • edited Loading

Choose a reason for hiding this comment

nomeata left a comment • edited Loading

Choose a reason for hiding this comment

rossberg commented Oct 1, 2020 •

edited

Loading

nomeata Oct 2, 2020 •

edited

Loading

rossberg Oct 6, 2020 •

edited

Loading

nomeata Oct 6, 2020 •

edited

Loading

rossberg Oct 6, 2020 •

edited

Loading

nomeata left a comment •

edited

Loading