Struct icu_normalizer::ComposingNormalizer

source ·
pub struct ComposingNormalizer {
    pub(crate) decomposing_normalizer: DecomposingNormalizer,
    pub(crate) canonical_compositions: DataPayload<CanonicalCompositionsV1Marker>,
}
Expand description

A normalizer for performing composing normalization.

Fields§

§decomposing_normalizer: DecomposingNormalizer§canonical_compositions: DataPayload<CanonicalCompositionsV1Marker>

Implementations§

source§

impl ComposingNormalizer

source

pub const fn new_nfc() -> Self

NFC constructor using compiled data.

Enabled with the compiled_data Cargo feature.

📚 Help choosing a constructor

source

pub fn try_new_nfc_with_any_provider( provider: &(impl AnyProvider + ?Sized), ) -> Result<Self, NormalizerError>

A version of Self::new_nfc that uses custom data provided by an AnyProvider.

📚 Help choosing a constructor

source

pub fn try_new_nfc_unstable<D>(provider: &D) -> Result<Self, NormalizerError>

A version of Self::new_nfc that uses custom data provided by a DataProvider.

📚 Help choosing a constructor

⚠️ The bounds on provider may change over time, including in SemVer minor releases.
source

pub const fn new_nfkc() -> Self

NFKC constructor using compiled data.

Enabled with the compiled_data Cargo feature.

📚 Help choosing a constructor

source

pub fn try_new_nfkc_with_any_provider( provider: &(impl AnyProvider + ?Sized), ) -> Result<Self, NormalizerError>

A version of Self::new_nfkc that uses custom data provided by an AnyProvider.

📚 Help choosing a constructor

source

pub fn try_new_nfkc_unstable<D>(provider: &D) -> Result<Self, NormalizerError>

A version of Self::new_nfkc that uses custom data provided by a DataProvider.

📚 Help choosing a constructor

⚠️ The bounds on provider may change over time, including in SemVer minor releases.
source

pub(crate) const fn new_uts46() -> Self

This is a special building block normalization for IDNA that implements parts of the Map step and the following Normalize step.

Warning: In this normalization, U+0345 COMBINING GREEK YPOGEGRAMMENI exhibits a behavior that no character in Unicode exhibits in NFD, NFKD, NFC, or NFKC: Case folding turns U+0345 from a reordered character into a non-reordered character before reordering happens. Therefore, the output of this normalization may differ for different inputs that are canonically equivalents with each other if they differ by how U+0345 is ordered relative to other reorderable characters.

source

pub(crate) fn try_new_uts46_unstable<D>( provider: &D, ) -> Result<Self, NormalizerError>

A version of Self::new_uts46 that uses custom data provided by a DataProvider.

📚 Help choosing a constructor

⚠️ The bounds on provider may change over time, including in SemVer minor releases.
source

pub fn normalize_iter<I: Iterator<Item = char>>( &self, iter: I, ) -> Composition<'_, I>

Wraps a delegate iterator into a composing iterator adapter by using the data already held by this normalizer.

source

pub(crate) fn normalize_iter_private<I: Iterator<Item = char>>( &self, iter: I, ignorable_behavior: IgnorableBehavior, ) -> Composition<'_, I>

source

pub fn normalize(&self, text: &str) -> String

Normalize a string slice into a String.

source

pub fn is_normalized(&self, text: &str) -> bool

Check whether a string slice is normalized.

source

pub fn normalize_utf16(&self, text: &[u16]) -> Vec<u16>

Normalize a slice of potentially-invalid UTF-16 into a Vec.

Unpaired surrogates are mapped to the REPLACEMENT CHARACTER before normalizing.

source

pub fn is_normalized_utf16(&self, text: &[u16]) -> bool

Checks whether a slice of potentially-invalid UTF-16 is normalized.

Unpaired surrogates are treated as the REPLACEMENT CHARACTER.

source

pub fn normalize_utf8(&self, text: &[u8]) -> String

Normalize a slice of potentially-invalid UTF-8 into a String.

Ill-formed byte sequences are mapped to the REPLACEMENT CHARACTER according to the WHATWG Encoding Standard.

source

pub fn is_normalized_utf8(&self, text: &[u8]) -> bool

Check if a slice of potentially-invalid UTF-8 is normalized.

Ill-formed byte sequences are mapped to the REPLACEMENT CHARACTER according to the WHATWG Encoding Standard before checking.

source

pub fn normalize_to<W: Write + ?Sized>( &self, text: &str, sink: &mut W, ) -> Result

Normalize a string slice into a Write sink.

source

pub fn normalize_utf8_to<W: Write + ?Sized>( &self, text: &[u8], sink: &mut W, ) -> Result

Normalize a slice of potentially-invalid UTF-8 into a Write sink.

Ill-formed byte sequences are mapped to the REPLACEMENT CHARACTER according to the WHATWG Encoding Standard.

source

pub fn normalize_utf16_to<W: Write16 + ?Sized>( &self, text: &[u16], sink: &mut W, ) -> Result

Normalize a slice of potentially-invalid UTF-16 into a Write16 sink.

Unpaired surrogates are mapped to the REPLACEMENT CHARACTER before normalizing.

Trait Implementations§

source§

impl Debug for ComposingNormalizer

source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more

Auto Trait Implementations§

Blanket Implementations§

source§

impl<T> Any for T
where T: 'static + ?Sized,

source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
source§

impl<T> Borrow<T> for T
where T: ?Sized,

source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
source§

impl<T> From<T> for T

source§

fn from(t: T) -> T

Returns the argument unchanged.

source§

impl<T, U> Into<U> for T
where U: From<T>,

source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

§

type Error = Infallible

The type returned in the event of a conversion error.
source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
source§

impl<T> ErasedDestructor for T
where T: 'static,

source§

impl<T> MaybeSendSync for T