[wgpu-core] Return submission index for `map_async` and `on_submitted_work_done` to track down completion of async callbacks #6360

eliemichel · 2024-10-03T07:11:36Z

Problem
wgpu-native is upgrading to latest versions of webgpu-headers (see gfx-rs/wgpu-native#427), and a notable change since the last sync is the introduction of WGPUFuture. Such structure holds a u64, from which we must be able to tell whether the async operation has completed.

A good candidate for the payload of WGPUFuture is the submission index already used in some places (and u64::MAX to mean "a future that is already resolved, probably because submission failed"). The only issue is that submit operations do not necessarily return a submission index.

Description
This PR makes functions buffer_map_async, add_work_done_closure, map_async, queue_on_submitted_work_done return the submission index that can later be used to check that the operation completed.

Testing
This is a WIP, I need feedback on the use of last_successful_submission_index in queue_on_submitted_work_done (is this the right fallback?), feedback about how to check that an operation completed from its submission index, and there is something to discuss about map_async:

What I do here is to increment the active_submission_index to make sure the operation is not considered complete too early, but maybe this can have the callback be invoked with some delay if the buffer mapping operation was not the last in queue?
The problem is that triage comes later, so when map_async returns, we do not know yet the true submission index of the map operation (it may be lower than the active_submission_index, right? If not then the proposed solution is ok, otherwise should we trigger triage right away? I guess there is a good reason why it is done separately though.

Wumpf · 2024-10-20T19:52:08Z

I looked a tiny little bit into this and I think the approach with just looking at whatever the active or last successful submission index is not going to work well:
The WebGPU spec formulates the conditions for when a buffer mapping promise should resolve quite well: It's either when the device is lost or:

The [device timeline](https://www.w3.org/TR/webgpu/#device-timeline) becomes informed of the completion of an unspecified [queue timeline](https://www.w3.org/TR/webgpu/#queue-timeline) point:
* after the completion of currently-enqueued operations that use this
* and no later than the completion of all currently-enqueued operations (regardless of whether they use this).

The second bullet points means that the mapping has to be available at the very latest point when all active submissions at the time of calling map_async are done. This means that we can be certain that things are done if you're waiting for last_successful_submission_index (no plus one, not quite following why you think that's necessary, maybe I'm missing something?)
However, as the first bullet point indicates the mapping may succeed much much earlier! Namely if all submissions are that are using the buffer are completed. Imagine you issue 4 submissions of which submission 0 and 2 use the buffer. Then call map_async. It should now finish once submission 2 is done, before submission 3 and not even considering that the next submission index would be 4.
In wgpu this is handled by LifetimeTracker and it should be possible to prod it such to determine the point in time a buffer becomes available, but naturally this will be a bit tricker than what you've attempted so far.

Another high level issue that hasn't been adressed here so far (and wtf why is the CI not tripping on this 😱 ) is that by exposing this as part of the wgpu interface, you'll have to come up with a way to express this in WebGPU. My kneejerk reaction would be to just not to expose it in the first place (wgpu-native is using wgpu-core only, right?). But there has been some interest in this recently, see #6426. But still not sure I understand the need for as of now (see #6426 (comment), that said, the issue & discussion already highlighted at least major docs issues)

eliemichel · 2024-10-22T07:00:57Z

Thanks for taking the time to have a look! 🙏

no plus one, not quite following why you think that's necessary, maybe I'm missing something?

You're not missing anything, this +1 was added out of my doubt about how submission indices must be interpreted, I wanted to avoid reading the mapped buffer too early so conservativatively that I end up doing it too late!

I've read the code of the LifetimeTracker, but it is unclear to me how it determines when an async map operation terminates. It seems to be only about submitting things to the queue, rather than reading things back.

By exposing this as part of the wgpu interface, you'll have to come up with a way to express this in WebGPU.

If by "WebGPU" you mean the JavaScript API, this is supposedly exposed as a Promise object. I do not know how Firefox exactly uses wgpu to implement this API, I just modified wgpu to please the compiler as I am working on the wgpu-core (which indeed is all what wgpu-native needs). I replied with more details in #6426 because if I understand correctly this is all very remated!

Wumpf · 2024-10-22T11:22:01Z

but it is unclear to me how it determines when an async map operation

I'd need to dig in deeper again, but it tracks which buffers are waiting for mapping. triage_submissions is deciding given a submission id which buffer is ready to be mapped. So I believe it should be possible to figure out that id ahead of time.
If not we should document why not and use a different mechanism for the wgpu-native future ids

If by "WebGPU" you mean the JavaScript API, this is supposedly exposed as a Promise object.

No, that's not what I meant: The wgpu crate is an interface to either a host implementing WebGPU or an implementation of WebGPU (confusingly, one of them based on WebGL!). The former is found here, the later (that's what you adjusted so far) is here. Since wgpu is supposed to have a consistent api across both, we make those promises be a callback under all circumstances.
But so far you touched wgpu-core in such a way that compiling with the webgpu backend should fail - which is why I'm a bit aghast that the ci didn't catch that.

eliemichel · 2024-10-22T17:32:23Z

The wgpu crate is an interface to either a host implementing WebGPU or an implementation of WebGPU

Ow, I did not remember that! I get it now. And that explains part of my confusion. I'll try and rework this PR in the light of this, thx!

eliemichel · 2024-10-22T22:48:03Z

Okey I've tried adding a WgpuFuture type in the Context interface, do you think it's heading in the right direction?

Wumpf · 2024-10-23T17:09:35Z

do you think it's heading in the right direction?

frankly at this point I don't know.
I think we need to look at a possible promise-like type in a more holistic way across the wgpu surface and explore how this can be used in a sane cross platform manner, including a sample demonstrating that.
It's one thing to add more (hopefully 🤞 ) readily available information to wgpu-core and using that then in wgpu-native and another one to change the way readiness of asynchronous events is advertised in the wgpu interface accross native & web. All those considerations should really go to a separate iteration. Let's focus for now just on the native-only problem of whether wgpu-core can advertise/predict the submission index at which a buffer mapping becomes available!
@teoxoy told me he might be able to give a hand as well on that issue - he's been recently dealing with some aspects of queue workload as part of another work stream.

wgpu-core/src/device/queue.rs

wgpu-core/src/resource.rs

wgpu-core/src/device/life.rs

eliemichel · 2024-10-27T08:43:27Z

Thanks for the feedback!

@Wumpf agreed, I removed everything that was about the wgpu API.

@teoxoy I removed triage_mapped and replaced with '0' where appropriate!

teoxoy

LGTM

eliemichel added 3 commits September 29, 2024 21:53

Switch flag enums to u64

ba7f1a2

try another way to implement futures

2d283e3

Add submission indices

a93cbdc

eliemichel requested a review from a team as a code owner October 3, 2024 07:11

Remove changes not related to this PR

2e7160b

eliemichel changed the title ~~Eliemichel/future~~ Return submission index to track down completion of async callbacks Oct 3, 2024

eliemichel marked this pull request as draft October 3, 2024 07:15

eliemichel mentioned this pull request Oct 3, 2024

Update library to latest webgpu-native headers gfx-rs/wgpu-native#427

Open

eliemichel added 3 commits October 5, 2024 11:52

map_async returns submission index in wgpu-rs

4ae7b01

on_submitted_work_done returns submission index in wgpu-rs

6374487

Update changelog

1c8c0ad

eliemichel marked this pull request as ready for review October 5, 2024 10:08

Merge branch 'trunk' into eliemichel/future

74fcb65

eliemichel mentioned this pull request Oct 13, 2024

Upgrade to latest wgpu gfx-rs/wgpu-native#441

Open

eliemichel and others added 2 commits October 20, 2024 18:42

Merge branch 'trunk' into eliemichel/future

c9283c1

Fix return value of on_submitted_work_done

4e948dc

Wumpf mentioned this pull request Oct 20, 2024

DownloadBuffer: Improve usability and documentation #6426

Open

eliemichel added 6 commits October 22, 2024 19:49

Merge remote-tracking branch 'origin/trunk' into eliemichel/future

2e8d131

WIP define BufferMapFuture and SubmittedWorkDoneFuture

51aebec

Introduce WgpuFuture

404c387

Implement WgpuFuture for webgpu

4ec8eb8

Try introducing instance_wait_any

4f37591

Run cargo fmt

3209897

eliemichel added 2 commits October 23, 2024 08:45

Refine map async submission index

be41ff6

Use Future instead of Promise in webgpu implem of WgpuFuture

353ddbe

teoxoy requested changes Oct 25, 2024

View reviewed changes

wgpu-core/src/device/queue.rs Outdated Show resolved Hide resolved

wgpu-core/src/resource.rs Outdated Show resolved Hide resolved

wgpu-core/src/device/life.rs Outdated Show resolved Hide resolved

eliemichel added 2 commits October 27, 2024 09:30

Revert changes to wgpu, focus on wgpu-core

4c9371f

Remove triage_mapped

c4f9f46

eliemichel and others added 3 commits October 27, 2024 09:46

Style update

22bc30d

Merge branch 'trunk' into eliemichel/future

8ee2c2a

Fix merge with trunk

9393427

teoxoy approved these changes Nov 4, 2024

View reviewed changes

Merge branch 'trunk' into eliemichel/future

e5c553f

teoxoy changed the title ~~Return submission index to track down completion of async callbacks~~ [wgpu-core] Return submission index for map_async and on_submitted_work_done to track down completion of async callbacks Nov 4, 2024

teoxoy enabled auto-merge (squash) November 4, 2024 14:12

teoxoy merged commit 6a75a73 into gfx-rs:trunk Nov 4, 2024
27 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[wgpu-core] Return submission index for `map_async` and `on_submitted_work_done` to track down completion of async callbacks #6360

[wgpu-core] Return submission index for `map_async` and `on_submitted_work_done` to track down completion of async callbacks #6360

eliemichel commented Oct 3, 2024

Wumpf commented Oct 20, 2024 •

edited

Loading

eliemichel commented Oct 22, 2024 •

edited

Loading

Wumpf commented Oct 22, 2024

eliemichel commented Oct 22, 2024

eliemichel commented Oct 22, 2024

Wumpf commented Oct 23, 2024

eliemichel commented Oct 27, 2024

teoxoy left a comment

[wgpu-core] Return submission index for map_async and on_submitted_work_done to track down completion of async callbacks #6360

[wgpu-core] Return submission index for map_async and on_submitted_work_done to track down completion of async callbacks #6360

Conversation

eliemichel commented Oct 3, 2024

Wumpf commented Oct 20, 2024 • edited Loading

eliemichel commented Oct 22, 2024 • edited Loading

Wumpf commented Oct 22, 2024

eliemichel commented Oct 22, 2024

eliemichel commented Oct 22, 2024

Wumpf commented Oct 23, 2024

eliemichel commented Oct 27, 2024

teoxoy left a comment

Choose a reason for hiding this comment

[wgpu-core] Return submission index for `map_async` and `on_submitted_work_done` to track down completion of async callbacks #6360

[wgpu-core] Return submission index for `map_async` and `on_submitted_work_done` to track down completion of async callbacks #6360

Wumpf commented Oct 20, 2024 •

edited

Loading

eliemichel commented Oct 22, 2024 •

edited

Loading