Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Text encoding of spec.html: ascii or utf-8? #1287

Closed
claudepache opened this issue Aug 14, 2018 · 4 comments
Closed

Text encoding of spec.html: ascii or utf-8? #1287

claudepache opened this issue Aug 14, 2018 · 4 comments

Comments

@claudepache
Copy link
Contributor

Currently, the file spec.html contains the declaration

<meta charset="ascii">

which is contradicted by its contents, because it has two non-ascii, utf-8-encoded characters (namely on line 7524 and on line 38018).

That makes my text editor complain each time I want to edit the spec, either:

encoding-mismatch-1

or:

encoding-mismatch-warning

Two options:

  1. Officially allow utf-8-encoded contents, and declare formally spec.html’s charset as utf-8;
  2. Replace the two offending characters by the proper html entities, and add an automated test in order to reject further insertions of non-ascii characters.

Personally, I prefer option 1 ( is more readable, more writable (although I guess that may depend on your working environment), and I never recall the html entity for that character).

@jmdyck
Copy link
Collaborator

jmdyck commented Aug 14, 2018

The U+2019 right single quotation mark on line 38018 should be a U+0027 apostrophe. Commit 64ab8cf in my "Misc editorial" PR fixes this.

So it's really just the U+2265 greater-than or equal to on line 7524. For option 2, this could be changed to &ge;.

@annevk
Copy link
Member

annevk commented Aug 14, 2018

Note that <meta charset=ascii> isn't really conforming HTML anymore (and conforming implementations will treat it as if you wrote windows-1252). Only UTF-8 is allowed.

@jmdyck
Copy link
Collaborator

jmdyck commented Aug 14, 2018

The 'rendering' process generates a file with <meta charset="utf-8">, so at least we're okay on that front.

@claudepache
Copy link
Contributor Author

Resolved by dc1e21c, which follows from the introduction of character ℝ in spec.html.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants