Struct memchr::memmem::searcher::Prefilter

source ·

struct Prefilter {
    call: unsafe fn(strat: &Prefilter, haystack: &[u8]) -> Option<usize>,
    kind: PrefilterKind,
    rarest_byte: u8,
    rarest_offset: u8,
}

Expand description

The implementation of a prefilter.

This type encapsulates dispatch to one of several possible choices for a prefilter. Generally speaking, all prefilters have the same approximate algorithm: they choose a couple of bytes from the needle that are believed to be rare, use a fast vector algorithm to look for those bytes and return positions as candidates for some substring search algorithm (currently only Two-Way) to confirm as a match or not.

The differences between the algorithms are actually at the vector implementation level. Namely, we need different routines based on both which target architecture we’re on and what CPU features are supported.

The straight-forwardly obvious approach here is to use an enum, and make Prefilter::find do case analysis to determine which algorithm was selected and invoke it. However, I’ve observed that this leads to poor codegen in some cases, especially in latency sensitive benchmarks. That is, this approach comes with overhead that I wasn’t able to eliminate.

The second obvious approach is to use dynamic dispatch with traits. Doing that in this context where Prefilter owns the selection generally requires heap allocation, and this code is designed to run in core-only environments.

So we settle on using a union (that’s PrefilterKind) and a function pointer (that’s PrefilterKindFn). We select the right function pointer based on which field in the union we set, and that function in turn knows which field of the union to access. The downside of this approach is that it forces us to think about safety, but the upside is that there are some nice latency improvements to benchmarks. (Especially the memmem/sliceslice/short benchmark.)

In cases where we’ve selected a vector algorithm and the haystack given is too short, we fallback to the scalar version of memchr on the rarest_byte. (The scalar version of memchr is still better than a naive byte-at-a-time loop because it will read in usize-sized chunks at a time.)

Fields§

§call: unsafe fn(strat: &Prefilter, haystack: &[u8]) -> Option<usize>§kind: PrefilterKind§rarest_byte: u8§rarest_offset: u8

Struct memchr::memmem::searcher::PrefilterCopy item path

Fields§

Implementations§

impl Prefilter

fn fallback<R: HeuristicFrequencyRank>( ranker: R, pair: Pair, needle: &[u8], ) -> Option<Prefilter>

fn sse2(finder: Finder, needle: &[u8]) -> Prefilter

fn avx2(finder: Finder, needle: &[u8]) -> Prefilter

fn find(&self, haystack: &[u8]) -> Option<usize>

fn find_simple(&self, haystack: &[u8]) -> Option<usize>

Trait Implementations§

impl Clone for Prefilter

fn clone(&self) -> Prefilter

fn clone_from(&mut self, source: &Self)

impl Debug for Prefilter

fn fmt(&self, f: &mut Formatter<'_>) -> Result

impl Copy for Prefilter

Auto Trait Implementations§

impl Freeze for Prefilter

impl RefUnwindSafe for Prefilter

impl Send for Prefilter

impl Sync for Prefilter

impl Unpin for Prefilter

impl UnwindSafe for Prefilter

Blanket Implementations§

impl<T> Any for Twhere T: 'static + ?Sized,

fn type_id(&self) -> TypeId

impl<T> Borrow<T> for Twhere T: ?Sized,

fn borrow(&self) -> &T

impl<T> BorrowMut<T> for Twhere T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

impl<T> CloneToUninit for Twhere T: Clone,

unsafe fn clone_to_uninit(&self, dst: *mut T)

impl<T> From<T> for T

fn from(t: T) -> T

impl<T, U> Into<U> for Twhere U: From<T>,

fn into(self) -> U

impl<T> ToOwned for Twhere T: Clone,

type Owned = T

fn to_owned(&self) -> T

fn clone_into(&self, target: &mut T)

impl<T, U> TryFrom<U> for Twhere U: Into<T>,

type Error = Infallible

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

impl<T, U> TryInto<U> for Twhere U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Struct memchr::memmem::searcher::Prefilter

impl<T> Any for T
where T: 'static + ?Sized,

impl<T> Borrow<T> for T
where T: ?Sized,

impl<T> BorrowMut<T> for T
where T: ?Sized,

impl<T> CloneToUninit for T
where T: Clone,

impl<T, U> Into<U> for T
where U: From<T>,

impl<T> ToOwned for T
where T: Clone,

impl<T, U> TryFrom<U> for T
where U: Into<T>,

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,