BaseIOStream.write(): support typed memoryviews #2996

madsbk · 2021-03-02T10:01:57Z

This PR implements support of memoryviews with an item size greater than 1 byte.

Currently, Tornado uses len(x) to retrieve the number of bytes of x, which works fine when x is bytes or a memoryview with an item size of 1. However, with an item size greater than 1 we cannot assume that len(x) == x.nbytes.

Making sure that ``len(data) == data.nbytes`` by casting memoryviews to bytes.

madsbk · 2021-03-09T07:38:54Z

Sorry for the close and re-opening of the PR. We have fixed the issue in Dask/Distributed by always casting to bytes before communication: dask/distributed#4555 but I think this PR is still relevant for other downstream projects that might use typed memoryviews.

bdarnell

Interesting, I hadn't noticed this aspect of memoryview before (I thought it was just a python-level exposure of the buffer protocol).

It looks like we could alternatively convert the input to a memoryview (unconditionally) and use nbytes instead of len. Is there any particular reason to prefer one form over the other? (I like replacing the union with a concrete type as early as possible, although len() is the natural idiom so choosing a type whose len() is problematic is error-prone).

madsbk · 2021-03-15T08:15:42Z

It looks like we could alternatively convert the input to a memoryview (unconditionally) and use nbytes instead of len. Is there any particular reason to prefer one form over the other? (I like replacing the union with a concrete type as early as possible, although len() is the natural idiom so choosing a type whose len() is problematic is error-prone).

I think both approaches make sense but if you are not planning to support non-contiguous buffers or fancy data compression, casting memoryviews to cast("B") seems fine than you can use nbytes and len interchangeable.

BaseIOStream.write(): support typed memoryview

c66204a

Making sure that ``len(data) == data.nbytes`` by casting memoryviews to bytes.

This was referenced Mar 2, 2021

[REVIEW] tcp.write(): cast memoryview to "B" dask/distributed#4555

Merged

[REVIEW] Msgpack handles extract serialize dask/distributed#4531

Merged

madsbk closed this Mar 2, 2021

madsbk reopened this Mar 9, 2021

bdarnell approved these changes Mar 14, 2021

View reviewed changes

bdarnell merged commit 4d47f9d into tornadoweb:master Mar 20, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BaseIOStream.write(): support typed memoryviews #2996

BaseIOStream.write(): support typed memoryviews #2996

madsbk commented Mar 2, 2021

madsbk commented Mar 9, 2021

bdarnell left a comment

madsbk commented Mar 15, 2021

BaseIOStream.write(): support typed memoryviews #2996

BaseIOStream.write(): support typed memoryviews #2996

Conversation

madsbk commented Mar 2, 2021

madsbk commented Mar 9, 2021

bdarnell left a comment

Choose a reason for hiding this comment

madsbk commented Mar 15, 2021