Parsing content after a 204 response #26

jugglinmike · 2017-05-24T15:29:44Z

When data is written to a connection following a 204 response, the user agent may interpret the data as it pleases. In web browsers today, this means:

The Chromium and Edge web browsers will inspect the first 4 bytes for the
start of a valid HTTP response. If found, they will parse the data that
follows as a new response (and if any of those first four bytes bytes are
invalid they will be discarded). If more than four invalid bytes are
encountered, the browsers abort parsing and interpret the data as a
HTTP/0.9 response. This is consistent with their general response parsing
behavior (i.e. without a preceding a 204 response)
The Firefox web browser may inspect 1 kilobyte of data or more (the exact
number has been variable in my testing) for a valid response. If found, it
will discard any preceding invalid data. This tolerant behavior is only
observable following a 204 response; otherwise, Firefox seems to parse in the
same way as Chromium and Edge.
the Safari web browser, upon receiving any invalid data, makes no attempt to
recover and discards the remaining data

This variation has led to instability in automated tests written for the Web Platform Tests project--see issue 5037.

I originally reported this inconsistency in issue 5227, where @mnot provided the following context (from RFC7230 section 3.3.3):

If the final response to the last request on a connection has been completely
received and there remains additional data to read, a user agent MAY discard
the remaining data or attempt to determine if that data belongs as part of
the prior response body, which might be the case if the prior message's
Content-Length value is incorrect. A client MUST NOT process, cache, or
forward such extra data as a separate response, since such behavior would be
vulnerable to cache poisoning.

Mark followed up by saying:

What I think's being requested is a recommendation for how much data should
be discarded before the client gives up; possibly a minimum. It feels kind of
analogous to when we established the minimum URL length that should be
supported by implementations, so it's not completely off base.

That said, this is truly a corner case; the right answer is "don't do that."
Anyone depending on interop in this case is doing it wrong to start with.

Though I agree with @annevk: "I think ideally HTTP defines how to parse HTTP." Can the specification language be made more explicit for expected behavior in this situation?

Thanks for your consideration!

mcmanus · 2017-05-26T12:47:31Z

I agree with anne that this was a flaw in http/1 - but the specification has been made more explicit via 7540 - the successor protocol. This is the normal way of fixing problems. In 7540 there is no ambiguity between responses.

As I've said in different forums, the WPT is trying to test the wrong thing. Fundamentally it is trying to test error handling behavior when no behavior is specified in http/1. There are probably an unbounded number of such cases where a server does something that violates 723x and the client behavior is unspecified - I'm not sure why this one is special. I'd like to think the testing framework focused on finding violations of 723x. The server should fail this particular test for sending data after the 204.

otoh - the situation is much better in http/2 and a client test for that would be deterministic.

annevk · 2017-06-28T22:05:58Z

What I don't understand is why undefined client-behavior is acceptable and not a bug that needs fixing. It's almost never acceptable (cannot think of exceptions) anywhere else in the platform.

Concretely, it's a problem for new client implementations that wish to compete with existing ones as they have to reverse engineer existing clients to work with broken servers. That's a time-old problem.

mnot · 2017-06-28T23:01:26Z

@jugglinmike one clarification -- when you see this happening, what request does the newly "found" response after the 204 get assigned to -- the one that caused the 204, or is pipelining in use?

jugglinmike · 2017-06-29T17:43:28Z

@mnot in my tests, the client sends a second request only after it receives a 204 response to the initial request. It is this second request that receives the malformed response.

My test script is available in the following Firefox bug report:

https://bugzilla.mozilla.org/show_bug.cgi?id=1356614

I believe this format precludes any pipelining. Would the effect of pipelining be useful here?

mnot · 2017-06-30T01:53:33Z

OK, so it sounds like those bytes sit in a buffer somewhere (TCP? browser?) and get consumed as part of the next response on the wire.

I can think of a few relevant ways to tighten up HTTP here. None of them are specific to 204.

If a HTTP/1 client receives bytes on a connection that doesn't have any outstanding requests, it MUST (do something). I can imagine that they might get assigned as extra bytes on the previous response if it allows a body, or raise an error, or get silently discarded, but there's no way they should be interpreted as a response to a subsequent request. Clarifying this should help all non-pipelining clients behave correctly.
If a HTTP/1 client receives bytes on a connection that does have outstanding requests, I don't know that there's much we can do to help them; there are a mess of heuristics that are in use, but AIUI very little interest in evolving / aligning that code. The best we might do is put a "here be dragons" warning sign up (which is generally the case for pipelining anyway).
When a HTTP/1 client is expecting a response and gets garbage bytes, it would be nice to align or at least put limits on how much tolerance they have. This is what the bug originally asked for, but that depends on implementers wanting to do so. In the meantime, I think (1) above might help substantially (and it's something browsers should really fix IMO).

The only catch for (1) is that it's naturally racy; there are always going to be cases where bytes are in flight, or the TCP buffers aren't fully drained before the browser starts to write(). So it'll probably have to be a SHOULD with caveats.

@mcmanus thoughts?

mcmanus · 2017-06-30T16:31:09Z

This issue is fixed - where all h1 transport issues were fixed - in 7540. If it were out of the scope of transport I would feel differently - but these are exactly the kinds of things the h2 path is expected to fix. So why refix it in a fork of a legacy protocol?

…

On Thu, Jun 29, 2017 at 6:53 PM, Mark Nottingham ***@***.***> wrote: OK, so it sounds like those bytes sit in a buffer somewhere (TCP? browser?) and get consumed as part of the next response on the wire. I can think of a few relevant ways to tighten up HTTP here. None of them are specific to 204. 1. If a HTTP/1 client receives bytes on a connection that doesn't have any outstanding requests, it MUST (do something). I can imagine that they might get assigned as extra bytes on the previous response if it allows a body, or raise an error, or get silently discarded, but there's no way they should be interpreted as a response to a subsequent request. Clarifying this should help all non-pipelining clients behave correctly. 2. If a HTTP/1 client receives bytes on a connection that *does* have outstanding requests, I don't know that there's much we can do to help them; there are a mess of heuristics that are in use, but AIUI very little interest in evolving / aligning that code. The best we might do is put a "here be dragons" warning sign up (which is generally the case for pipelining anyway). 3. When a HTTP/1 client is expecting a response and gets garbage bytes, it would be *nice* to align or at least put limits on how much tolerance they have. This is what the bug originally asked for, but that depends on implementers wanting to do so. In the meantime, I think (1) above might help substantially (and it's something browsers should really fix IMO). The only catch for (1) is that it's naturally racy; there are always going to be cases where bytes are in flight, or the TCP buffers aren't fully drained before the browser starts to write(). So it'll probably have to be a SHOULD with caveats. @mcmanus <https://github.com/mcmanus> thoughts? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#26 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAP5s1n_06q1Y2W8y9eA-dnAJdKEUEUIks5sJFUdgaJpZM4NlR98> .

annevk · 2017-06-30T16:34:27Z

@mcmanus for the reasons explained in #26 (comment)?

mcmanus · 2017-06-30T16:46:39Z

I don't see 'how browsers interop with broken legacy servers document' to be an http item. We don't make things better by having a discussion of "which http/1 do you implement?"

This energy would seem better directed towards fixing the broken actors- to the extent its actually a problem at all.

annevk · 2017-06-30T17:00:35Z

It's much harder to fix millions of servers than aligning a couple of clients. (See also all kinds of other parsers, such as HTML, URL, CSS, etc.)

mnot · 2017-07-02T23:59:09Z

@mcmanus h1 isn't deprecated yet.

mcmanus · 2017-07-10T15:30:02Z

its not a matter of deprecation of 723x (which would mean stop using it) - but of forking 723x into new-h1 and h2.

now h2 incorporates a whole lot of 723x by reference - though definitely not the framing that is part of this particular issue - and pulling that out to a new document that can evolve makes more sense to me.

ekr · 2017-07-25T21:37:50Z

@annevk: "What I don't understand is why undefined client-behavior is acceptable and not a bug that needs fixing. It's almost never acceptable (cannot think of exceptions) anywhere else in the platform"

That's not really true as a general matter for network protocols. Rather, we define what legal behavior from the peer is, and then implementations have quite a bit of freedom in how to handle nonconformant behavior unless the standard specifically tells them what to do.

annevk · 2017-07-26T07:14:54Z

I understand that's the way you go about things. I don't understand why that's good or why lessons we learned elsewhere about that being problematic are not applicable. See also https://annevankesteren.nl/2016/05/client-server.

ekr · 2017-07-26T11:45:14Z

I'm not sure what "you" refers to, because in this case, it's basically how every network protocol I've ever worked on has been defined. As for "lessons we learned elsewhere", you might consider that other people have learned lessons from their experience as well.

annevk · 2017-07-26T12:07:29Z

I understand that and I'd like to learn about them. It's interesting to me how something that applies to a wide variety of formats, would stop applying the moment a format is used for transport.

ekr · 2017-07-26T12:08:13Z

Happy to chat about this offline, but this issue probably isn't the place

mnot · 2018-10-16T02:44:00Z

Now that #145 is done, Associating a Response to a Request seems like a natural place to talk about this in a limited fashion (as I outlined above).

…sued request Fixes #26.

mnot added the h1-messaging label Jun 20, 2017

mnot changed the title ~~Under-specified: parsing behavior following 204 response~~ Parsing content after a 204 response Jun 29, 2018

mnot self-assigned this Oct 16, 2018

mnot added a commit that referenced this issue Nov 29, 2018

Caution against treating data on a connection as part of a not-yet-is…

5597844

…sued request Fixes #26.

mnot added the has-proposal label Nov 29, 2018

mnot mentioned this issue Nov 29, 2018

Caution against treating data on a connection as part of a not-yet-issued request #178

Merged

mnot closed this as completed in #178 Feb 26, 2019

reschke added a commit that referenced this issue Feb 26, 2019

fix change tracking for #26, regen HTML

86b753c

vdbelt mentioned this issue Jul 19, 2019

Fix: return empty response for 204 preflight requests spatie/laravel-cors#61

Merged

github-actions bot mentioned this issue May 26, 2021

Remove redundant BCD comment from web/http, /security, /web_components, /events and /exslt mdn/content#5351

Merged

github-actions bot mentioned this issue Aug 12, 2021

Convert HTTP to Markdown mdn/content#7853

Merged

TBBle mentioned this issue Jan 8, 2022

FastAPI always returns content, even if 204 no content status code is set. fastapi/fastapi#2832

Closed

9 tasks

github-actions bot mentioned this issue Jan 29, 2022

Macro, Specs and Compat fixes for French HTTP pages mdn/translated-content#3794

Merged

github-actions bot mentioned this issue Apr 15, 2022

Update redirected URLs due to sites moved from *.github.io (part 8) mdn/content#15012

Merged

1 task

github-actions bot mentioned this issue May 9, 2022

prepare for zh-{TW,CN}/web/http h2m: removing span and font tag mdn/translated-content#5505

Merged

This was referenced Jun 11, 2022

Fixes multiline markdown link text mdn/content#17180

Merged

chore: Run Markdownlint rules on fr docs - Batch 12 mdn/translated-content#6316

Merged

github-actions bot mentioned this issue Aug 20, 2022

remove duplicated frontmatter keys from web/{a-h} zh-TW mdn/translated-content#7830

Merged

This was referenced Sep 29, 2022

Markdown conversion for ru - Replace - HTTP section ⚠️ Do not squash ⚠️ mdn/translated-content#8905

Merged

convert Web/http html to md mdn/translated-content#8972

Merged

github-actions bot mentioned this issue Oct 6, 2022

Markdown conversion for pt-BR - Replace - HTTP section ⚠️ Do not squash ⚠️ mdn/translated-content#9057

Merged

This was referenced Feb 9, 2023

Web/HTTP/Status/204 を更新 mdn/translated-content#11542

Merged

[ko] HTTP Status 204, 404 수정, vary 헤더 수정 외 mdn/translated-content#11669

Merged

github-actions bot mentioned this issue Mar 3, 2023

HTTP Status: Remove tags/add status where necessary mdn/content#25025

Merged

github-actions bot mentioned this issue Sep 11, 2023

[ko]: revise files for web/http/status/100~301 mdn/translated-content#15778

Merged

github-actions bot mentioned this issue Oct 16, 2023

chore: remove orphaned files mdn/translated-content#16452

Closed

github-actions bot mentioned this issue Apr 10, 2024

[zh-tw]: update HTTP Status 204 mdn/translated-content#19385

Merged

github-actions bot mentioned this issue Jun 26, 2024

feat(HTTP): Refresh 2XX response pages mdn/content#34422

Merged

3 tasks

This was referenced Aug 13, 2024

Infra spell bot mdn/content#35224

Merged

fix more typos mdn/content#35434

Merged

github-actions bot mentioned this issue Aug 21, 2024

[zh-cn]: update compatibility translation in 204 page mdn/translated-content#23181

Merged

JannisBush mentioned this issue Oct 1, 2024

HTTP does not allow content for 204 anymore mdn/content#36146

Open

zhengcan mentioned this issue Nov 18, 2024

Handle HTTP 204 (No Content) Responses zhengcan/apisdk-rs#5

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parsing content after a 204 response #26

Parsing content after a 204 response #26

jugglinmike commented May 24, 2017

mcmanus commented May 26, 2017

annevk commented Jun 28, 2017

mnot commented Jun 28, 2017

jugglinmike commented Jun 29, 2017

mnot commented Jun 30, 2017

mcmanus commented Jun 30, 2017 via email

annevk commented Jun 30, 2017

mcmanus commented Jun 30, 2017

annevk commented Jun 30, 2017

mnot commented Jul 2, 2017

mcmanus commented Jul 10, 2017

ekr commented Jul 25, 2017

annevk commented Jul 26, 2017

ekr commented Jul 26, 2017

annevk commented Jul 26, 2017

ekr commented Jul 26, 2017

mnot commented Oct 16, 2018

Parsing content after a 204 response #26

Parsing content after a 204 response #26

Comments

jugglinmike commented May 24, 2017

mcmanus commented May 26, 2017

annevk commented Jun 28, 2017

mnot commented Jun 28, 2017

jugglinmike commented Jun 29, 2017

mnot commented Jun 30, 2017

mcmanus commented Jun 30, 2017 via email

annevk commented Jun 30, 2017

mcmanus commented Jun 30, 2017

annevk commented Jun 30, 2017

mnot commented Jul 2, 2017

mcmanus commented Jul 10, 2017

ekr commented Jul 25, 2017

annevk commented Jul 26, 2017

ekr commented Jul 26, 2017

annevk commented Jul 26, 2017

ekr commented Jul 26, 2017

mnot commented Oct 16, 2018