Struct aho_corasick::automaton::StreamChunkIter

source ·

struct StreamChunkIter<'a, A, R> {
    aut: &'a A,
    rdr: R,
    buf: Buffer,
    start: StateID,
    sid: StateID,
    absolute_pos: usize,
    buffer_pos: usize,
    buffer_reported_pos: usize,
}

Expand description

An iterator that reports matches in a stream.

(This doesn’t actually implement the Iterator trait because it returns something with a lifetime attached to a buffer it owns, but that’s OK. It still has a next method and is iterator-like enough to be fine.)

This iterator yields elements of type io::Result<StreamChunk>, where an error is reported if there was a problem reading from the underlying stream. The iterator terminates only when the underlying stream reaches EOF.

The idea here is that each chunk represents either a match or a non-match, and if you concatenated all of the chunks together, you’d reproduce the entire contents of the stream, byte-for-byte.

This chunk machinery is a bit complicated and it isn’t strictly required for a stream searcher that just reports matches. But we do need something like this to deal with the “replacement” API, which needs to know which chunks it can copy and which it needs to replace.

Fields§

§aut: &'a A

The underlying automaton to do the search.

§rdr: R

The source of bytes we read from.

§buf: Buffer

A roll buffer for managing bytes from rdr. Basically, this is used to handle the case of a match that is split by two different calls to rdr.read(). This isn’t strictly needed if all we needed to do was report matches, but here we are reporting chunks of non-matches and matches and in order to do that, we really just cannot treat our stream as non-overlapping blocks of bytes. We need to permit some overlap while we retain bytes from a previous read call in memory.

§start: StateID

The unanchored starting state of this automaton.

§sid: StateID

The state of the automaton.

§absolute_pos: usize

The absolute position over the entire stream.

§buffer_pos: usize

The position we’re currently at within buf.

§buffer_reported_pos: usize

The buffer position of the end of the bytes that we last returned to the caller. Basically, whenever we find a match, we look to see if there is a difference between where the match started and the position of the last byte we returned to the caller. If there’s a difference, then we need to return a ‘NonMatch’ chunk.

Struct aho_corasick::automaton::StreamChunkIterCopy item path

Fields§

Implementations§

impl<'a, A: Automaton, R: Read> StreamChunkIter<'a, A, R>

fn new(aut: &'a A, rdr: R) -> Result<StreamChunkIter<'a, A, R>, MatchError>

fn next(&mut self) -> Option<Result<StreamChunk<'_>>>

fn get_match_chunk(&self, mat: Match) -> Range<usize>

fn get_non_match_chunk(&self, mat: Match) -> Option<Range<usize>>

fn get_pre_roll_non_match_chunk(&self) -> Option<Range<usize>>

fn get_eof_non_match_chunk(&self) -> Option<Range<usize>>

fn get_match(&self) -> Match

Trait Implementations§

impl<'a, A: Debug, R: Debug> Debug for StreamChunkIter<'a, A, R>

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Auto Trait Implementations§

impl<'a, A, R> Freeze for StreamChunkIter<'a, A, R>where R: Freeze,

impl<'a, A, R> RefUnwindSafe for StreamChunkIter<'a, A, R>where R: RefUnwindSafe, A: RefUnwindSafe,

impl<'a, A, R> Send for StreamChunkIter<'a, A, R>where R: Send, A: Sync,

impl<'a, A, R> Sync for StreamChunkIter<'a, A, R>where R: Sync, A: Sync,

impl<'a, A, R> Unpin for StreamChunkIter<'a, A, R>where R: Unpin,

impl<'a, A, R> UnwindSafe for StreamChunkIter<'a, A, R>where R: UnwindSafe, A: RefUnwindSafe,

Blanket Implementations§

impl<T> Any for Twhere T: 'static + ?Sized,

fn type_id(&self) -> TypeId

impl<T> Borrow<T> for Twhere T: ?Sized,

fn borrow(&self) -> &T

impl<T> BorrowMut<T> for Twhere T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

impl<T> From<T> for T

fn from(t: T) -> T

impl<T, U> Into<U> for Twhere U: From<T>,

fn into(self) -> U

impl<T, U> TryFrom<U> for Twhere U: Into<T>,

type Error = Infallible

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

impl<T, U> TryInto<U> for Twhere U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Struct aho_corasick::automaton::StreamChunkIter

impl<'a, A, R> Freeze for StreamChunkIter<'a, A, R>
where R: Freeze,

impl<'a, A, R> RefUnwindSafe for StreamChunkIter<'a, A, R>
where R: RefUnwindSafe, A: RefUnwindSafe,

impl<'a, A, R> Send for StreamChunkIter<'a, A, R>
where R: Send, A: Sync,

impl<'a, A, R> Sync for StreamChunkIter<'a, A, R>
where R: Sync, A: Sync,

impl<'a, A, R> Unpin for StreamChunkIter<'a, A, R>
where R: Unpin,

impl<'a, A, R> UnwindSafe for StreamChunkIter<'a, A, R>
where R: UnwindSafe, A: RefUnwindSafe,

impl<T> Any for T
where T: 'static + ?Sized,

impl<T> Borrow<T> for T
where T: ?Sized,

impl<T> BorrowMut<T> for T
where T: ?Sized,

impl<T, U> Into<U> for T
where U: From<T>,

impl<T, U> TryFrom<U> for T
where U: Into<T>,

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,