Support for `funcref`s, `ref.func`, and `table.grow` with `funcref`s #1901

fitzgen · 2020-06-18T18:35:06Z

No description provided.

github-actions · 2020-06-18T19:10:26Z

Subscribe to Label Action

cc @bnjbvr, @peterhuene

This issue or pull request has been labeled: "cranelift", "cranelift:wasm", "wasmtime:api"

Thus the following users have been cc'd because of the following labels:

bnjbvr: cranelift
peterhuene: wasmtime:api

To subscribe or unsubscribe from this label, edit the .github/subscribe-to-label.json configuration file.

Learn more.

alexcrichton

Nice, that was smaller than I expected! One thing we'll need to handle is that ValueType::from_wasmtime_type is incorrect if both externref and funcref translate to the same underlying cranelift type.

Other than that, though, I was hoping that we wouldn't have to allocate an ExternRef for dealing with funcref. There's no real need to GC them since they have the same lifetime of the Instance in theory, so it seems like wasted time to GC them and to allocate new data structures for them.

Thinking a bit about this, can we perhaps make the representation of funcref to be *mut VMCallerCheckedAnyfunc? That way it would still be word-sized (easy to manage) and we could easily implement calling that and such. Additionally it's very easy to convert that into a wasmtime::Func, and we'll probably just need to update wasmtime::Func to internally contain *mut VMCallerCheckedAnyfunc to make it easy to create a Func. "morally" at least I'd expect that going from funcref to wasmtime::Func should basically be "pair the internal representation with a Store and you're off to the races"

We statically know all possible indices that can be referenced by ref.func, so I think what we can do is something like this:

The function index may be referenced by an element segment (passive/declared/active). In that case we don't currently ever drop those until the end of the Store, so assuming that we store all these element segments as Vec<VMCallerCheckedAnyfunc> (as we do today for tables) we could just load a constant offset from an element segment (e.g function N is at element segment A at offset B)
Otherwise we can allocate a Vec<VMCallerCheckedAnyfunc> specifically for non-element-segment-referenced indices (those referenced through globals for now) and attach that the VMContext. I guess not really allocate a Vec<T> but we would just make the VMContext allocation bigger.

In any case I think it'd be best if we could leverage the current state of things to avoid GC/allocation overhead for funcref if we can. WDYT?

alexcrichton · 2020-06-18T19:03:40Z

crates/runtime/src/table.rs

+            #[cfg(target_pointer_width = "32")]
+            TableElements::ExternRefs(_) => TableElementType::Val(ir::types::R32),
+            #[cfg(target_pointer_width = "64")]
+            TableElements::ExternRefs(_) => TableElementType::Val(ir::types::R64),


FWIW this pattern with #[cfg] might be good to encapsulate somewhere with something like ir::types::rsize()

fitzgen · 2020-06-18T20:01:35Z

That definitely sounds like a better design, albeit more implementation effort. How strongly do you feel vs waiting until we actually care about the performance of funcref?

We statically know all possible indices that can be referenced by ref.func,

I know that this is true at the spec level with the refs member of the validation context, but do you know if we actually retain this info anywhere? Not sure this is exposed by wasmparser or cranelift_wasm.

alexcrichton · 2020-06-18T20:05:19Z

I think it's fine to take the shortest path for now regardless of perf to a complete implementation of reference types, but looking forward to things like table.copy and table.init on funcref tables I feel like we're gonna be jumping through a lot of hoops trying to convert back and forth between VMCallerCheckedAnyfunc and VMFuncRef. I'm fine deferring to you, but it feels to me that it'll be basically equivalent the amount of work to take either strategy (in that I'm not sure one is that much more onerous than the other).

Currently AFAIK the refs member isn't exposed, so cranelift-wasm would likely need to construct this.

fitzgen · 2020-06-18T20:16:29Z

My one concern with your proposed design is just figuring out where to stick the new side tables for VMCallerCheckedAnyfuncs and how to plumb that through everywhere. Its the kind of thing that seems like it might run into the annoying wasmtime/wasmtime-runtime crate boundary again. I'll investigate a little more and see if I can't make it work.

alexcrichton · 2020-06-18T20:25:15Z

Perhaps we could temporarily allocate a VMCallerCheckedAnyfunc for every function in the module, and shove that into the VMContext? That way the index of the function is the same as the index into that table. We could later optimize that to not allocate such a big table for functions that aren't referenced.

fitzgen · 2020-06-18T23:44:01Z

Pushed a wip commit that switches to using *const VMCallerCheckedAnyfunc for funcrefs. It compiles at least, but tests are failing.

github-actions · 2020-06-22T22:08:22Z

Subscribe to Label Action

cc @fitzgen

This issue or pull request has been labeled: "fuzzing"

Thus the following users have been cc'd because of the following labels:

fitzgen: fuzzing

To subscribe or unsubscribe from this label, edit the .github/subscribe-to-label.json configuration file.

Learn more.

fitzgen · 2020-06-22T22:52:43Z

@alexcrichton okay I think this is ready for review again.

github-actions · 2020-06-22T23:17:17Z

Subscribe to Label Action

cc @peterhuene

This issue or pull request has been labeled: "wasmtime:c-api"

Thus the following users have been cc'd because of the following labels:

peterhuene: wasmtime:c-api

To subscribe or unsubscribe from this label, edit the .github/subscribe-to-label.json configuration file.

Learn more.

alexcrichton

Looking great! My main comment below is about the change to Extern which had a pretty large impact, but I don't think should be necessary.

Otherwise, though, could you add some tests which acquire a Val::FuncRef and then test whether it's None or Some, and if Some calls the function to make sure it gets the right result?

Additionally I think some tests would be good for passing in various FuncRef values. For example Some, None, something from another store, something from another instance, etc.

crates/wasmtime/src/externals.rs

crates/environ/src/func_environ.rs

crates/runtime/src/instance.rs

crates/wasmtime/src/func.rs

crates/wasmtime/src/values.rs

alexcrichton · 2020-06-23T14:31:54Z

crates/wasmtime/src/values.rs

-            }
-        }
+        Val::FuncRef(None) => store.null_func_ref(),
+        Val::FuncRef(Some(f)) => f.caller_checked_anyfunc(),


An idle thought, but we could perhaps have something like:

impl Func { pub(crate) fn into_raw(self, dst: &Store) -> Result<NonNull<VMCallerCheckedAnyfunc>> { // ... } }

to bake in how you can't access the raw value without using the right store. Anyway not necessary to do here, just a thought!

crates/wasmtime/src/values.rs

crates/wast/src/spectest.rs

fitzgen · 2020-06-23T18:55:15Z

@alexcrichton okay, I've deduped the null func refs into the store, and made Extern::Func non-nullable.

Still need to write a couple tests that pass in Val::Func(None) to various things, but I think you can take another look a this while I do that.

alexcrichton · 2020-06-23T19:09:58Z

Looks great to me! I've got one final question though before landing, which I just thought of after reading this. Could we be even more clever and make the null funcref a literal null pointer? It looks like you were worried about the cost of call_indirect in terms of number of checks, but I think that it's the same number of checks as before, we're just testing anyfunc_ptr for whether it's null instead of func_addr, which we assume is always non-null.

With that in place translation of ref.null would be super simple, just a bag of zeros all the time!

fitzgen · 2020-06-23T20:55:12Z

Looks great to me! I've got one final question though before landing, which I just thought of after reading this. Could we be even more clever and make the null funcref a literal null pointer? It looks like you were worried about the cost of call_indirect in terms of number of checks, but I think that it's the same number of checks as before, we're just testing anyfunc_ptr for whether it's null instead of func_addr, which we assume is always non-null.

With that in place translation of ref.null would be super simple, just a bag of zeros all the time!

The concern wasn't about the path for call_indirect with a null funcref (which should be a super rare event, and leads to a trap anyways, so it isn't ever gonna be fast). The concern was introducing an extra potential cache miss in front of every non-trapping call_indirect before we actually end up calling the function.

Originally, we couldn't use a null pointer for a null func ref, but after all the refactorings this PR has been through, that might not be the case anymore. Let me double check. Either way, I think ref.null will be cheap enough either way. But this actually made me discover a latent bug: cranelift-wasm doesn't allow you customize ref.null's translation (currently). It also always assumes that all references are of the reference type (which is not true for funcrefs for us). So I have a bit more work to do here either way.

alexcrichton · 2020-06-23T21:18:28Z

Oh I'm not actually worried about the cost of ref.null, it's more that using a null pointer for ref.null felt clearer and easier to understand.

Could you elaborate on the cache misses, though? With this current incantation a call_indirect is: load anyref, load funcptr, check null, load type, check match, call. With a null pointer representation it would instead be load anyref, check null, load type, check match, load funcptr, call. I'm not sure how the cache misses would be different there?

fitzgen · 2020-06-23T21:24:35Z

The extra potential cache miss is from changing table elements from VMCallerCheckedAnyfunc to a pointer to a VMCallerCheckedAnyfunc (regardless whether that is a nullable pointer or not).

It makes it so that there is an extra pointer chase to get to the func_ptr member of an element in the table.

alexcrichton · 2020-06-23T21:30:08Z

Ah ok, I remember talking about that yeah. I thought you meant that the extra cache miss was about the representation of the null function in the sense of is it a null pointer or is it a sentinel.

…thods This serves two purposes: 1. It ensures that we call `get_or_create_table` to ensure that the embedder already had a chance to create the given table (although this is mostly redundant due to validation). 2. It allows the embedder to easily get the `ir::TableData` associated with this table, and more easily emit whatever inline JIT code to translate the table instruction (rather than falling back to VM calls).

* Allow different Cranelift IR types to be used for different Wasm reference types. * Do not assume that all Wasm reference types are always a Cranelift IR reference type. For example, `funcref`s might not need GC in some implementations, and can therefore be represented with a pointer rather than a reference type.

…ncIndex` It was previously taking a raw `u32`. This change makes it more clear what index space that index points into.

fitzgen · 2020-06-23T23:41:38Z

phew. okay.

I made ref.null func be a null pointer, rather than a VMCallerCheckedAnyfunc with a null func_ptr member.
similarly, VMCallerCheckedAnyfunc::func_ptr is now NonNull.
I added the extra translate_* hooks to cranelift_wasm::FuncEnvironment to allow for non-uniform reference types representations.

@alexcrichton I think this is ready for another (and hopefully the last!) round of review. Thanks for your patience with this one!

alexcrichton

Looks like there's some warnings on aarch64 causing CI to fail, but otherwise 👍 from me, thanks for being patient with me too :)

Mind adding a test or two for ref.is_null while you're at it?

crates/wasmtime/src/values.rs

`funcref`s are implemented as `NonNull<VMCallerCheckedAnyfunc>`. This should be more efficient than using a `VMExternRef` that points at a `VMCallerCheckedAnyfunc` because it gets rid of an indirection, dynamic allocation, and some reference counting. Note that the null function reference is *NOT* a null pointer; it is a `VMCallerCheckedAnyfunc` that has a null `func_ptr` member. Part of bytecodealliance#929

fitzgen requested a review from alexcrichton June 18, 2020 18:35

github-actions bot added cranelift Issues related to the Cranelift code generator cranelift:wasm wasmtime:api Related to the API of the `wasmtime` crate itself labels Jun 18, 2020

alexcrichton reviewed Jun 18, 2020

View reviewed changes

fitzgen force-pushed the ref-func branch from 9379e9c to a432872 Compare June 22, 2020 21:48

github-actions bot added the fuzzing Issues related to our fuzzing infrastructure label Jun 22, 2020

fitzgen force-pushed the ref-func branch from a432872 to baa295a Compare June 22, 2020 22:51

fitzgen requested a review from alexcrichton June 22, 2020 22:52

github-actions bot added the wasmtime:c-api Issues pertaining to the C API. label Jun 22, 2020

fitzgen force-pushed the ref-func branch 2 times, most recently from 0a9f160 to 7c0d77a Compare June 23, 2020 00:04

alexcrichton reviewed Jun 23, 2020

View reviewed changes

fitzgen force-pushed the ref-func branch from 7c0d77a to b9d223c Compare June 23, 2020 18:52

fitzgen requested a review from alexcrichton June 23, 2020 18:53

fitzgen force-pushed the ref-func branch from b9d223c to 88a12a1 Compare June 23, 2020 18:57

fitzgen added 4 commits June 23, 2020 16:36

cranelift-wasm: Retain both the Wasm and Cranelift types of tables

c6f32f6

cranelift-wasm: Make FuncEnvironment::translate_ref_func take a `Fu…

ddc2ce8

…ncIndex` It was previously taking a raw `u32`. This change makes it more clear what index space that index points into.

fitzgen force-pushed the ref-func branch from 88a12a1 to 6def311 Compare June 23, 2020 23:37

fitzgen force-pushed the ref-func branch from 6def311 to ce5d07f Compare June 24, 2020 00:28

alexcrichton approved these changes Jun 24, 2020

View reviewed changes

crates/wasmtime/src/values.rs Outdated Show resolved Hide resolved

fitzgen force-pushed the ref-func branch 2 times, most recently from c9cf3ec to 818a814 Compare June 24, 2020 17:01

fitzgen force-pushed the ref-func branch from 818a814 to 58bb5dd Compare June 24, 2020 17:08

fitzgen merged commit 1ea2f47 into bytecodealliance:master Jun 24, 2020

fitzgen deleted the ref-func branch June 24, 2020 17:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for `funcref`s, `ref.func`, and `table.grow` with `funcref`s #1901

Support for `funcref`s, `ref.func`, and `table.grow` with `funcref`s #1901

fitzgen commented Jun 18, 2020

github-actions bot commented Jun 18, 2020

alexcrichton left a comment

alexcrichton Jun 18, 2020

fitzgen commented Jun 18, 2020

alexcrichton commented Jun 18, 2020

fitzgen commented Jun 18, 2020

alexcrichton commented Jun 18, 2020

fitzgen commented Jun 18, 2020

github-actions bot commented Jun 22, 2020

fitzgen commented Jun 22, 2020

github-actions bot commented Jun 22, 2020

alexcrichton left a comment

alexcrichton Jun 23, 2020

fitzgen commented Jun 23, 2020

alexcrichton commented Jun 23, 2020

fitzgen commented Jun 23, 2020

alexcrichton commented Jun 23, 2020

fitzgen commented Jun 23, 2020

alexcrichton commented Jun 23, 2020

fitzgen commented Jun 23, 2020

alexcrichton left a comment

Support for funcrefs, ref.func, and table.grow with funcrefs #1901

Support for funcrefs, ref.func, and table.grow with funcrefs #1901

Conversation

fitzgen commented Jun 18, 2020

github-actions bot commented Jun 18, 2020

Subscribe to Label Action

alexcrichton left a comment

Choose a reason for hiding this comment

alexcrichton Jun 18, 2020

Choose a reason for hiding this comment

fitzgen commented Jun 18, 2020

alexcrichton commented Jun 18, 2020

fitzgen commented Jun 18, 2020

alexcrichton commented Jun 18, 2020

fitzgen commented Jun 18, 2020

github-actions bot commented Jun 22, 2020

Subscribe to Label Action

fitzgen commented Jun 22, 2020

github-actions bot commented Jun 22, 2020

Subscribe to Label Action

alexcrichton left a comment

Choose a reason for hiding this comment

alexcrichton Jun 23, 2020

Choose a reason for hiding this comment

fitzgen commented Jun 23, 2020

alexcrichton commented Jun 23, 2020

fitzgen commented Jun 23, 2020

alexcrichton commented Jun 23, 2020

fitzgen commented Jun 23, 2020

alexcrichton commented Jun 23, 2020

fitzgen commented Jun 23, 2020

alexcrichton left a comment

Choose a reason for hiding this comment

Support for `funcref`s, `ref.func`, and `table.grow` with `funcref`s #1901

Support for `funcref`s, `ref.func`, and `table.grow` with `funcref`s #1901