pub enum Meaning {
Whole(char),
LeadSurrogate(u16),
TrailSurrogate(u16),
Prefix(usize),
Suffix,
}
Expand description
Meaning of a complete or partial UTF-8 codepoint.
Not all checking is performed eagerly. That is, a codepoint Prefix
or
Suffix
may in reality have no valid completion.
Variants§
Whole(char)
We found a whole codepoint.
LeadSurrogate(u16)
We found something that isn’t a valid Unicode codepoint, but
it would correspond to a UTF-16 leading surrogate code unit,
i.e. a value in the range U+D800
- U+DBFF
.
The argument is the code unit’s 10-bit index within that range.
These are found in UTF-8 variants such as CESU-8 and WTF-8.
TrailSurrogate(u16)
We found something that isn’t a valid Unicode codepoint, but
it would correspond to a UTF-16 trailing surrogate code unit,
i.e. a value in the range U+DC00
- U+DFFF
.
The argument is the code unit’s 10-bit index within that range.
These are found in UTF-8 variants such as CESU-8 and WTF-8.
Prefix(usize)
We found only a prefix of a codepoint before the buffer ended.
Includes the number of additional bytes needed.
Suffix
We found only a suffix of a codepoint before running off the start of the buffer.
Up to 3 more bytes may be needed.
Trait Implementations§
source§impl Ord for Meaning
impl Ord for Meaning
source§impl PartialEq<Meaning> for Meaning
impl PartialEq<Meaning> for Meaning
source§impl PartialOrd<Meaning> for Meaning
impl PartialOrd<Meaning> for Meaning
1.0.0 · source§fn le(&self, other: &Rhs) -> bool
fn le(&self, other: &Rhs) -> bool
self
and other
) and is used by the <=
operator. Read more