Expand description
This module provides the lower-level API for UTS 46.
Uts46::process
is the core that the other convenience
methods build on.
UTS 46 flags map to this API as follows:
- CheckHyphens - true:
Hyphens::Check
, false:Hyphens::Allow
; the WHATWG URL Standard sets this to false for normal (non-conformance-checker) user agents. - CheckBidi - Always true; cannot be configured, since this flag is true even when WHATWG URL Standard beStrict is false.
- CheckJoiners - Always true; cannot be configured, since this flag is true even when WHATWG URL Standard beStrict is false.
- UseSTD3ASCIIRules - true:
AsciiDenyList::STD3
, false:AsciiDenyList::EMPTY
; however, the check the WHATWG URL Standard performs right after the UTS 46 invocation corresponds toAsciiDenyList::URL
. - Transitional_Processing - Always false but could be implemented as a preprocessing step. This flag is deprecated and for Web purposes the transition is over in the sense that all of Firefox, Safari, or Chrome set this flag to false.
- VerifyDnsLength - true:
DnsLength::Verify
, false:DnsLength::Ignore
; the WHATWG URL Standard sets this to false for normal (non-conformance-checker) user agents. - IgnoreInvalidPunycode - Always false; cannot be configured. (Not yet covered by the WHATWG URL Standard, but 2 out of 3 major browser clearly behave as if this was false).
Structsยง
- The ASCII deny list to be applied.
- An implementation of UTS #46.
Enumsยง
- AlreadyAsciiLabel ๐
- The UTS 46 VerifyDNSLength flag.
- Policy for customizing behavior in case of an error.
- The CheckHyphens mode.
- The failure outcome of
Uts46::process
- The success outcome of
Uts46::process
- RtlNumeralState ๐For keeping track of what kind of numerals have been seen in an RTL label.
Constantsยง
- DOT_MASK ๐The mask for the ASCII dot.
- GLYPHLESS_MASK ๐Bit set for glyphless ASCII.
- ICU4C-compatible constraint. https://unicode-org.atlassian.net/browse/ICU-13727
- ICU4C-compatible constraint. (Note: ICU4C measures UTF-16 and we measure UTF-32. This means that we allow longer non-BMP inputs. For this implementation, the denial-of-service scaling does not depend on BMP vs. non-BMP: only the scalar values matter.)
- PUNYCODE_PREFIX ๐
- PUNYCODE_PREFIX_MASK ๐
- UPPER_CASE_MASK ๐Bit set for upper-case ASCII.
Functionsยง
- check_hyphens ๐
- glyphless_mask ๐Computes the mask for glyphless ASCII.
- has_punycode_prefix ๐
- in_inclusive_range8 ๐
- is_ascii ๐
- ldh_mask ๐Computes the ASCII deny list for STD3 ASCII rules.
- upper_case_mask ๐Computes the mask for upper-case ASCII.
- Performs the VerifyDNSLength check on the output of the ToASCII operation.
- write_punycode_label ๐