Static html5ever::tendril::encoding_rs::GBK
source · pub static GBK: &'static Encoding
Expand description
The GBK encoding.
The decoder for this encoding is the same as the decoder for gb18030. The encoder side of this encoding is GBK with Windows code page 936 euro sign behavior and with the changes to two-byte sequences made in GB18030-2022. GBK extends GB2312-80 to cover the CJK Unified Ideographs Unicode block as well as a handful of ideographs from the CJK Unified Ideographs Extension A and CJK Compatibility Ideographs blocks.
Unlike e.g. in the case of ISO-8859-1 and windows-1252, GBK encoder wasn’t unified with the gb18030 encoder in the Encoding Standard out of concern that servers that expect GBK form submissions might not be able to handle the four-byte sequences.
Index visualization for the two-byte sequences, Visualization of BMP coverage of the two-byte index
The encoder of this encoding roughly matches the Windows code page 936. The decoder side is a superset.
This will change from static
to const
if Rust changes
to make the referent of pub const FOO: &'static Encoding
unique cross-crate, so don’t take the address of this
static
.