Struct Regex

Source

pub struct Regex {
    forward: DFA,
    reverse: DFA,
}

Expand description

A regular expression that uses hybrid NFA/DFAs (also called “lazy DFAs”) for searching.

A regular expression is comprised of two lazy DFAs, a “forward” DFA and a “reverse” DFA. The forward DFA is responsible for detecting the end of a match while the reverse DFA is responsible for detecting the start of a match. Thus, in order to find the bounds of any given match, a forward search must first be run followed by a reverse search. A match found by the forward DFA guarantees that the reverse DFA will also find a match.

§Fallibility

Most of the search routines defined on this type will panic when the underlying search fails. This might be because the DFA gave up because it saw a quit byte, whether configured explicitly or via heuristic Unicode word boundary support, although neither are enabled by default. It might also fail if the underlying DFA determines it isn’t making effective use of the cache (which also never happens by default). Or it might fail because an invalid Input configuration is given, for example, with an unsupported Anchored mode.

If you need to handle these error cases instead of allowing them to trigger a panic, then the lower level Regex::try_search provides a fallible API that never panics.

§Example

This example shows how to cause a search to terminate if it sees a \n byte, and handle the error returned. This could be useful if, for example, you wanted to prevent a user supplied pattern from matching across a line boundary.

use regex_automata::{hybrid::{dfa, regex::Regex}, Input, MatchError};

let re = Regex::builder()
    .dfa(dfa::Config::new().quit(b'\n', true))
    .build(r"foo\p{any}+bar")?;
let mut cache = re.create_cache();

let input = Input::new("foo\nbar");
// Normally this would produce a match, since \p{any} contains '\n'.
// But since we instructed the automaton to enter a quit state if a
// '\n' is observed, this produces a match error instead.
let expected = MatchError::quit(b'\n', 3);
let got = re.try_search(&mut cache, &input).unwrap_err();
assert_eq!(expected, got);

Fields§

§forward: DFA

The forward lazy DFA. This can only find the end of a match.

§reverse: DFA

The reverse lazy DFA. This can only find the start of a match.

This is built with ‘all’ match semantics (instead of leftmost-first) so that it always finds the longest possible match (which corresponds to the leftmost starting position). It is also compiled as an anchored matcher and has ‘starts_for_each_pattern’ enabled. Including starting states for each pattern is necessary to ensure that we only look for matches of a pattern that matched in the forward direction. Otherwise, we might wind up finding the “leftmost” starting position of a totally different pattern!

Struct Regex Copy item path

§Fallibility

§Example

Fields§

Implementations§

impl Regex

pub fn new(pattern: &str) -> Result<Regex, BuildError>

§Example

pub fn new_many<P: AsRef<str>>(patterns: &[P]) -> Result<Regex, BuildError>

§Example

pub fn builder() -> Builder

§Example

pub fn create_cache(&self) -> Cache

pub fn reset_cache(&self, cache: &mut Cache)

§Example

impl Regex

pub fn is_match<'h, I: Into<Input<'h>>>( &self, cache: &mut Cache, input: I, ) -> bool

§Panics

§Example

pub fn find<'h, I: Into<Input<'h>>>( &self, cache: &mut Cache, input: I, ) -> Option<Match>

§Panics

§Example

pub fn find_iter<'r, 'c, 'h, I: Into<Input<'h>>>( &'r self, cache: &'c mut Cache, input: I, ) -> FindMatches<'r, 'c, 'h> ⓘ

§Panics

§Example

impl Regex

pub fn try_search( &self, cache: &mut Cache, input: &Input<'_>, ) -> Result<Option<Match>, MatchError>

§Errors

fn is_anchored(&self, input: &Input<'_>) -> bool

impl Regex

pub fn forward(&self) -> &DFA

pub fn reverse(&self) -> &DFA

pub fn pattern_len(&self) -> usize

§Example

Trait Implementations§

impl Debug for Regex

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Auto Trait Implementations§

impl Freeze for Regex

impl RefUnwindSafe for Regex

impl Send for Regex

impl Sync for Regex

impl Unpin for Regex

impl UnsafeUnpin for Regex

impl UnwindSafe for Regex

Blanket Implementations§

impl<T> Any for Twhere T: 'static + ?Sized,

fn type_id(&self) -> TypeId

impl<T> Borrow<T> for Twhere T: ?Sized,

fn borrow(&self) -> &T

impl<T> BorrowMut<T> for Twhere T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

impl<T> From<T> for T

fn from(t: T) -> T

impl<T, U> Into<U> for Twhere U: From<T>,

fn into(self) -> U

impl<T, U> TryFrom<U> for Twhere U: Into<T>,

type Error = Infallible

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

impl<T, U> TryInto<U> for Twhere U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Struct Regex

impl<T> Any for T
where T: 'static + ?Sized,

impl<T> Borrow<T> for T
where T: ?Sized,

impl<T> BorrowMut<T> for T
where T: ?Sized,

impl<T, U> Into<U> for T
where U: From<T>,

impl<T, U> TryFrom<U> for T
where U: Into<T>,

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,