Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Consider adding windows-936-2000 as a label for GBK #295

Open
annevk opened this issue Sep 29, 2022 · 2 comments
Open

Consider adding windows-936-2000 as a label for GBK #295

annevk opened this issue Sep 29, 2022 · 2 comments
Labels
i18n-clreq Notifies Chinese script experts of relevant issues interop Implementations are not interoperable with each other normative

Comments

@annevk
Copy link
Member

annevk commented Sep 29, 2022

It seems that both Chromium and WebKit support this label.

@hsivonen what do you think?

Chromium bug: https://bugs.chromium.org/p/chromium/issues/detail?id=1081315.

WPT: encoding/unsupported-labels.window.js.

cc @achristensen07 @inexorabletash

@annevk annevk added normative i18n-clreq Notifies Chinese script experts of relevant issues interop Implementations are not interoperable with each other labels Sep 29, 2022
@hsivonen
Copy link
Member

I'm not against this.

However, if we're adding more labels, I think there's probably more opportunity for benefit from supporting Java 1 encoding names. See https://searchfox.org/comm-central/source/mailnews/intl/charsetalias.properties and the java.io column at https://docs.oracle.com/javase/8/docs/technotes/guides/intl/encoding.doc.html

Particular categories of labels that are relevant to consider:

  • ISO-8859 series labels of the form iso8859_N where N is the part number, e.g. iso8859_1.
  • Code pages shared by Windows and DOS with the ms prefix. Of these, ms932 was already added as an ad hoc special case.
  • Code pages shared by Windows and DOS with the cp prefix.
  • Underscore versions of Unix CJK encodings: euc_cn, big5_hkscs, etc. My vague recollection is that Presto-era Opera found EUC_JP not to be Web-compatible due to Shift_JIS content labeled as EUC_JP. However, it's been long enough that my memory may be generating false recollections of Web compat problems.

@annevk
Copy link
Member Author

annevk commented Sep 29, 2022

https://annevankesteren.nl/2009/09/web-character-encodings agrees with your memory. The fact that no user agent ships those today makes me a bit more hesitant as it would probably require a lot more research than adding this single label to Gecko. Having said that, I'm not opposed if someone is willing to drive that effort.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
i18n-clreq Notifies Chinese script experts of relevant issues interop Implementations are not interoperable with each other normative
Development

No branches or pull requests

2 participants