Struct regex_automata::dfa::onepass::InternalBuilder

source ·

struct InternalBuilder<'a> {
    dfa: DFA,
    uncompiled_nfa_ids: Vec<StateID>,
    nfa_to_dfa_id: Vec<StateID>,
    stack: Vec<(StateID, Epsilons)>,
    seen: SparseSet,
    matched: bool,
    config: Config,
    nfa: &'a NFA,
    classes: ByteClasses,
}

Expand description

An internal builder for encapsulating the state necessary to build a one-pass DFA. Typical use is just InternalBuilder::new(..).build().

There is no separate pass for determining whether the NFA is one-pass or not. We just try to build the DFA. If during construction we discover that it is not one-pass, we bail out. This is likely to lead to some undesirable expense in some cases, so it might make sense to try an identify common patterns in the NFA that make it definitively not one-pass. That way, we can avoid ever trying to build a one-pass DFA in the first place. For example, ‘\w*\s’ is not one-pass, and since ‘\w’ is Unicode-aware by default, it’s probably not a trivial cost to try and build a one-pass DFA for it and then fail.

Note that some (immutable) fields are duplicated here. For example, the ‘nfa’ and ‘classes’ fields are both in the ‘DFA’. They are the same thing, but we duplicate them because it makes composition easier below. Otherwise, since the borrow checker can’t see through method calls, the mutable borrow we use to mutate the DFA winds up preventing borrowing from any other part of the DFA, even though we aren’t mutating those parts. We only do this because the duplication is cheap.

Fields§

§dfa: DFA

The DFA we’re building.

§uncompiled_nfa_ids: Vec<StateID>

An unordered collection of NFA state IDs that we haven’t yet tried to build into a DFA state yet.

This collection does not ultimately wind up including every NFA state ID. Instead, each ID represents a “start” state for a sub-graph of the NFA. The set of NFA states we then use to build a DFA state consists of that “start” state and all states reachable from it via epsilon transitions.

§nfa_to_dfa_id: Vec<StateID>

A map from NFA state ID to DFA state ID. This is useful for easily determining whether an NFA state has been used as a “starting” point to build a DFA state yet. If it hasn’t, then it is mapped to DEAD, and since DEAD is specially added and never corresponds to any NFA state, it follows that a mapping to DEAD implies the NFA state has no corresponding DFA state yet.

§stack: Vec<(StateID, Epsilons)>

A stack used to traverse the NFA states that make up a single DFA state. Traversal occurs until the stack is empty, and we only push to the stack when the state ID isn’t in ‘seen’. Actually, even more than that, if we try to push something on to this stack that is already in ‘seen’, then we bail out on construction completely, since it implies that the NFA is not one-pass.

§seen: SparseSet

The set of NFA states that we’ve visited via ‘stack’.

§matched: bool

Whether a match NFA state has been observed while constructing a one-pass DFA state. Once a match state is seen, assuming we are using leftmost-first match semantics, then we don’t add any more transitions to the DFA state we’re building.

§config: Config

The config passed to the builder.

This is duplicated in dfa.config.

§nfa: &'a NFA

The NFA we’re building a one-pass DFA from.

This is duplicated in dfa.nfa.

§classes: ByteClasses

The equivalence classes that make up the alphabet for this DFA>

This is duplicated in dfa.classes.

Struct regex_automata::dfa::onepass::InternalBuilderCopy item path

Fields§

Implementations§

impl<'a> InternalBuilder<'a>

fn new(config: Config, nfa: &'a NFA) -> InternalBuilder<'a>

fn build(self) -> Result<DFA, BuildError>

fn shuffle_states(&mut self)

fn compile_transition( &mut self, dfa_id: StateID, trans: &Transition, epsilons: Epsilons, ) -> Result<(), BuildError>

fn add_start_state( &mut self, pid: Option<PatternID>, nfa_id: StateID, ) -> Result<StateID, BuildError>

fn add_dfa_state_for_nfa_state( &mut self, nfa_id: StateID, ) -> Result<StateID, BuildError>

fn add_empty_state(&mut self) -> Result<StateID, BuildError>

fn stack_push( &mut self, nfa_id: StateID, epsilons: Epsilons, ) -> Result<(), BuildError>

Trait Implementations§

impl<'a> Debug for InternalBuilder<'a>

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Auto Trait Implementations§

impl<'a> Freeze for InternalBuilder<'a>

impl<'a> RefUnwindSafe for InternalBuilder<'a>

impl<'a> Send for InternalBuilder<'a>

impl<'a> Sync for InternalBuilder<'a>

impl<'a> Unpin for InternalBuilder<'a>

impl<'a> UnwindSafe for InternalBuilder<'a>

Blanket Implementations§

impl<T> Any for Twhere T: 'static + ?Sized,

fn type_id(&self) -> TypeId

impl<T> Borrow<T> for Twhere T: ?Sized,

fn borrow(&self) -> &T

impl<T> BorrowMut<T> for Twhere T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

impl<T> From<T> for T

fn from(t: T) -> T

impl<T, U> Into<U> for Twhere U: From<T>,

fn into(self) -> U

impl<T, U> TryFrom<U> for Twhere U: Into<T>,

type Error = Infallible

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

impl<T, U> TryInto<U> for Twhere U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Struct regex_automata::dfa::onepass::InternalBuilder

impl<T> Any for T
where T: 'static + ?Sized,

impl<T> Borrow<T> for T
where T: ?Sized,

impl<T> BorrowMut<T> for T
where T: ?Sized,

impl<T, U> Into<U> for T
where U: From<T>,

impl<T, U> TryFrom<U> for T
where U: Into<T>,

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,