From f0d56b01a169963753faf52a67e4f816c1d1a90e Mon Sep 17 00:00:00 2001 From: Greg Wilkins Date: Thu, 7 Oct 2021 18:32:35 +1100 Subject: [PATCH] Issue #18 URI path processing UTF-8 decoding --- spec/src/main/asciidoc/servlet-spec-body.adoc | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/spec/src/main/asciidoc/servlet-spec-body.adoc b/spec/src/main/asciidoc/servlet-spec-body.adoc index cc3babcdc..d3d7caa3a 100644 --- a/spec/src/main/asciidoc/servlet-spec-body.adoc +++ b/spec/src/main/asciidoc/servlet-spec-body.adoc @@ -1324,8 +1324,8 @@ The URI is split by the first occurrence of any '?' character to path and query. ==== Discard fragment A fragment in the path is indicated by the first occurrence of a `\#` character. Any `#` character and following fragment is removed from the path and discarded. -==== Decoding of non-special characters -Characters other than `/`, `;` and `%` that are encoded in `%nn` form are decoded and the resulting octet sequences is treated as UTF-8 and converted to a character sequence. +==== Decoding of characters as UTF-8 +Characters other than `/`, `;` and `%` that are encoded in `%nn` form are decoded. The resulting octet sequences is treated as UTF-8 and converted to a character sequence. ==== Collapse sequences of multiple `"/"` characters Any sequence of more than one `"/"` character in the URI must be replaced with a single `"/"`.