Struct regex_automata::nfa::thompson::range_trie::RangeTrie

source ·

pub struct RangeTrie {
    states: Vec<State>,
    free: Vec<State>,
    iter_stack: RefCell<Vec<NextIter>>,
    iter_ranges: RefCell<Vec<Utf8Range>>,
    dupe_stack: Vec<NextDupe>,
    insert_stack: Vec<NextInsert>,
}

Expand description

A range trie represents an ordered set of sequences of bytes.

A range trie accepts as input a sequence of byte ranges and merges them into the existing set such that the trie can produce a sorted non-overlapping sequence of byte ranges. The sequence emitted corresponds precisely to the sequence of bytes matched by the given keys, although the byte ranges themselves may be split at different boundaries.

The order complexity of this data structure seems difficult to analyze. If the size of a byte is held as a constant, then insertion is clearly O(n) where n is the number of byte ranges in the input key. However, if k=256 is our alphabet size, then insertion could be O(k^2 * n). In particular it seems possible for pathological inputs to cause insertion to do a lot of work. However, for what we use this data structure for, there should be no pathological inputs since the ultimate source is always a sorted set of Unicode scalar value ranges.

Internally, this trie is setup like a finite state machine. Note though that it is acyclic.

Fields§

§states: Vec<State>

The states in this trie. The first is always the shared final state. The second is always the root state. Otherwise, there is no particular order.

§free: Vec<State>

A free-list of states. When a range trie is cleared, all of its states are added to this list. Creating a new state reuses states from this list before allocating a new one.

§iter_stack: RefCell<Vec<NextIter>>

A stack for traversing this trie to yield sequences of byte ranges in lexicographic order.

§iter_ranges: RefCell<Vec<Utf8Range>>

A buffer that stores the current sequence during iteration.

§dupe_stack: Vec<NextDupe>

A stack used for traversing the trie in order to (deeply) duplicate a state. States are recursively duplicated when ranges are split.

§insert_stack: Vec<NextInsert>

A stack used for traversing the trie during insertion of a new sequence of byte ranges.

Struct regex_automata::nfa::thompson::range_trie::RangeTrieCopy item path

Fields§

Implementations§

impl RangeTrie

pub fn new() -> RangeTrie

pub fn clear(&mut self)

pub fn iter<E, F: FnMut(&[Utf8Range]) -> Result<(), E>>( &self, f: F, ) -> Result<(), E>

pub fn insert(&mut self, ranges: &[Utf8Range])

pub fn add_empty(&mut self) -> StateID

fn duplicate(&mut self, old_id: StateID) -> StateID

fn add_transition( &mut self, from_id: StateID, range: Utf8Range, next_id: StateID, )

fn add_transition_at( &mut self, i: usize, from_id: StateID, range: Utf8Range, next_id: StateID, )

fn set_transition_at( &mut self, i: usize, from_id: StateID, range: Utf8Range, next_id: StateID, )

fn state(&self, id: StateID) -> &State

fn state_mut(&mut self, id: StateID) -> &mut State

Trait Implementations§

impl Clone for RangeTrie

fn clone(&self) -> RangeTrie

fn clone_from(&mut self, source: &Self)

impl Debug for RangeTrie

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Auto Trait Implementations§

impl !Freeze for RangeTrie

impl !RefUnwindSafe for RangeTrie

impl Send for RangeTrie

impl !Sync for RangeTrie

impl Unpin for RangeTrie

impl UnwindSafe for RangeTrie

Blanket Implementations§

impl<T> Any for Twhere T: 'static + ?Sized,

fn type_id(&self) -> TypeId

impl<T> Borrow<T> for Twhere T: ?Sized,

fn borrow(&self) -> &T

impl<T> BorrowMut<T> for Twhere T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

impl<T> CloneToUninit for Twhere T: Clone,

unsafe fn clone_to_uninit(&self, dst: *mut T)

impl<T> From<T> for T

fn from(t: T) -> T

impl<T, U> Into<U> for Twhere U: From<T>,

fn into(self) -> U

impl<T> ToOwned for Twhere T: Clone,

type Owned = T

fn to_owned(&self) -> T

fn clone_into(&self, target: &mut T)

impl<T, U> TryFrom<U> for Twhere U: Into<T>,

type Error = Infallible

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

impl<T, U> TryInto<U> for Twhere U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Struct regex_automata::nfa::thompson::range_trie::RangeTrie

impl<T> Any for T
where T: 'static + ?Sized,

impl<T> Borrow<T> for T
where T: ?Sized,

impl<T> BorrowMut<T> for T
where T: ?Sized,

impl<T> CloneToUninit for T
where T: Clone,

impl<T, U> Into<U> for T
where U: From<T>,

impl<T> ToOwned for T
where T: Clone,

impl<T, U> TryFrom<U> for T
where U: Into<T>,

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,