Add support for `funcref`s inside GC objects #9341

fitzgen · 2024-10-01T00:01:37Z

No description provided.

github-actions · 2024-10-01T02:14:46Z

Subscribe to Label Action

cc @fitzgen

This issue or pull request has been labeled: "wasmtime:api", "wasmtime:ref-types"

Thus the following users have been cc'd because of the following labels:

fitzgen: wasmtime:ref-types

To subscribe or unsubscribe from this label, edit the .github/subscribe-to-label.json configuration file.

Learn more.

fitzgen · 2024-10-01T15:47:26Z

Rebased to resolve conflicts.

alexcrichton

I think this is all reasonable enough, but this is definitely further amplifying my worries about the cost of GC and runtime performance. Every write of a funcref now requires at least a hash table lookup and reads can require taking a read lock on a global RwLock. On one hand I realize that the goal is to get GC working, but on the other hand I also have a hard time seeing how this is going to be a suitable performance profile for anyone actually using GC. I do realize though that improving on this is going to be significantly difficult to, for example, expose the slab to jit code or to implement a GC of the slab entries themselves.

In that sense I'm going to go ahead and approve this as-is since I don't believe there are any correctness issues with it. (one minor worry is that the hash map isn't GC'd right now, but that mirrors the behavior of we don't implement instance-level GC within a Store so they're O(N) the same size more-or-less). Do you think it would be useful though to build up a list of the possible performance optimizations we can think of to keep track of things? I do still agree that this is the best route for getting things implemented and to a point where we can measure performance to see what to optimize, but I'm also afraid of losing context on the myriad of possible ways to optimize things as PRs land incrementally

fitzgen · 2024-10-01T16:18:56Z

Do you think it would be useful though to build up a list of the possible performance optimizations we can think of to keep track of things?

Yeah, I can write up an issue with a list of these.

fitzgen · 2024-10-01T16:24:39Z

I share your concerns. It certainly is not going to be performant as-is. My most-immediate thoughts are that we would do roughly the following to speed this up:

Expose the slab to Wasm, allowing id-to-funcref conversion to happen fully within wasm code
for funcref-to-id conversion, add an LRU/associative cache to the vmctx (or maybe runtime limits) to cache the results of the libcall and allow the happy path to stay within wasm code. the slow path would still fall back to a libcall however (I do not want to implement hashing in wasm code and try to keep it in sync with the Rust hashing)

My hope is that the above would result in good enough perf for us to not have to revisit this for quite a while.

alexcrichton · 2024-10-01T16:38:13Z

Two questions actually:

On the write side I was hoping we could do something purely in JIT code where it allocates space from the slab itself and the slab slots are managed by the normal GC. Would that be possible? For example during tracing we'd record which slab slots were in use and deallocation would be to iterate over the existing slab slots (and compaction may not be too hard either).
On the read side is there any way to avoid the lock? That seems required currently for our threat model, but I'm basically wondering if there's any way we can get away with just an index comparison as opposed to a full subtype check.

fitzgen · 2024-10-01T18:41:44Z

On the read side is there any way to avoid the lock?

I think we solve this via the approach that we have talked about previously of having a shared, lock-free arena for the supertypes arrays, so that particular subtyping checks don't need to lock anything.

Will write up an issue about this soon.

fitzgen · 2024-10-01T18:46:15Z

On the write side I was hoping we could do something purely in JIT code where it allocates space from the slab itself and the slab slots are managed by the normal GC. Would that be possible? For example during tracing we'd record which slab slots were in use and deallocation would be to iterate over the existing slab slots (and compaction may not be too hard either).

I think the tracing part is definitely doable. Compaction should also be possible. But the tricky part will be deduplicating funcrefs in the slab (i.e. the reason the hash map is in there now). Without that deduping, I fear that it will be too easy to fill the funcref table, require a GC, and end up thrashing the collector.

fitzgen · 2024-10-01T19:32:48Z

On the read side is there any way to avoid the lock?

I think we solve this via the approach that we have talked about previously of having a shared, lock-free arena for the supertypes arrays, so that particular subtyping checks don't need to lock anything.

Will write up an issue about this soon.

#9352

fitzgen · 2024-10-01T19:33:03Z

Do you think it would be useful though to build up a list of the possible performance optimizations we can think of to keep track of things?

Yeah, I can write up an issue with a list of these.

#9351

fitzgen requested a review from a team as a code owner October 1, 2024 00:01

fitzgen requested review from alexcrichton and removed request for a team October 1, 2024 00:01

github-actions bot added wasmtime:api Related to the API of the `wasmtime` crate itself wasmtime:ref-types Issues related to reference types and GC in Wasmtime labels Oct 1, 2024

Add support for funcrefs inside GC objects

c6303cf

fitzgen force-pushed the funcrefs-in-gc-heap branch from 71efbcf to c6303cf Compare October 1, 2024 15:47

alexcrichton approved these changes Oct 1, 2024

View reviewed changes

fitzgen added this pull request to the merge queue Oct 1, 2024

fitzgen mentioned this pull request Oct 1, 2024

Speed up getting and setting funcref elements/fields in GC objects #9347

Open

Merged via the queue into bytecodealliance:main with commit f07c441 Oct 1, 2024
39 checks passed

fitzgen deleted the funcrefs-in-gc-heap branch October 1, 2024 16:51

fitzgen mentioned this pull request Oct 1, 2024

Avoid locks for subtyping checks in Wasm GC #9352

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for `funcref`s inside GC objects #9341

Add support for `funcref`s inside GC objects #9341

fitzgen commented Oct 1, 2024

github-actions bot commented Oct 1, 2024

fitzgen commented Oct 1, 2024

alexcrichton left a comment

fitzgen commented Oct 1, 2024

fitzgen commented Oct 1, 2024

alexcrichton commented Oct 1, 2024

fitzgen commented Oct 1, 2024

fitzgen commented Oct 1, 2024

fitzgen commented Oct 1, 2024

fitzgen commented Oct 1, 2024

Add support for funcrefs inside GC objects #9341

Add support for funcrefs inside GC objects #9341

Conversation

fitzgen commented Oct 1, 2024

github-actions bot commented Oct 1, 2024

Subscribe to Label Action

fitzgen commented Oct 1, 2024

alexcrichton left a comment

Choose a reason for hiding this comment

fitzgen commented Oct 1, 2024

fitzgen commented Oct 1, 2024

alexcrichton commented Oct 1, 2024

fitzgen commented Oct 1, 2024

fitzgen commented Oct 1, 2024

fitzgen commented Oct 1, 2024

fitzgen commented Oct 1, 2024

Add support for `funcref`s inside GC objects #9341

Add support for `funcref`s inside GC objects #9341