Struct aho_corasick::packed::Searcher
source · pub struct Searcher { /* private fields */ }
Expand description
A packed searcher for quickly finding occurrences of multiple patterns.
If callers need more flexible construction, or if one wants to change the
match semantics (either leftmost-first or leftmost-longest), then one can
use the Config
and/or Builder
types for more fine grained control.
Example
This example shows how to create a searcher from an iterator of patterns. By default, leftmost-first match semantics are used.
use aho_corasick::{packed::{MatchKind, Searcher}, PatternID};
let searcher = Searcher::new(["foobar", "foo"].iter().cloned())?;
let matches: Vec<PatternID> = searcher
.find_iter("foobar")
.map(|mat| mat.pattern())
.collect();
assert_eq!(vec![PatternID::ZERO], matches);
Implementations§
source§impl Searcher
impl Searcher
sourcepub fn new<I, P>(patterns: I) -> Option<Searcher>
pub fn new<I, P>(patterns: I) -> Option<Searcher>
A convenience function for constructing a searcher from an iterator
of things that can be converted to a &[u8]
.
If a searcher could not be constructed (either because of an
unsupported CPU or because there are too many patterns), then None
is returned.
Example
Basic usage:
use aho_corasick::{packed::{MatchKind, Searcher}, PatternID};
let searcher = Searcher::new(["foobar", "foo"].iter().cloned())?;
let matches: Vec<PatternID> = searcher
.find_iter("foobar")
.map(|mat| mat.pattern())
.collect();
assert_eq!(vec![PatternID::ZERO], matches);
sourcepub fn config() -> Config
pub fn config() -> Config
A convenience function for calling Config::new()
.
This is useful for avoiding an additional import.
sourcepub fn builder() -> Builder
pub fn builder() -> Builder
A convenience function for calling Builder::new()
.
This is useful for avoiding an additional import.
sourcepub fn find<B: AsRef<[u8]>>(&self, haystack: B) -> Option<Match>
pub fn find<B: AsRef<[u8]>>(&self, haystack: B) -> Option<Match>
Return the first occurrence of any of the patterns in this searcher,
according to its match semantics, in the given haystack. The Match
returned will include the identifier of the pattern that matched, which
corresponds to the index of the pattern (starting from 0
) in which it
was added.
Example
Basic usage:
use aho_corasick::{packed::{MatchKind, Searcher}, PatternID};
let searcher = Searcher::new(["foobar", "foo"].iter().cloned())?;
let mat = searcher.find("foobar")?;
assert_eq!(PatternID::ZERO, mat.pattern());
assert_eq!(0, mat.start());
assert_eq!(6, mat.end());
sourcepub fn find_in<B: AsRef<[u8]>>(&self, haystack: B, span: Span) -> Option<Match>
pub fn find_in<B: AsRef<[u8]>>(&self, haystack: B, span: Span) -> Option<Match>
Return the first occurrence of any of the patterns in this searcher, according to its match semantics, in the given haystack starting from the given position.
The Match
returned will include the identifier of the pattern that
matched, which corresponds to the index of the pattern (starting from
0
) in which it was added. The offsets in the Match
will be relative
to the start of haystack
(and not at
).
Example
Basic usage:
use aho_corasick::{packed::{MatchKind, Searcher}, PatternID, Span};
let haystack = "foofoobar";
let searcher = Searcher::new(["foobar", "foo"].iter().cloned())?;
let mat = searcher.find_in(haystack, Span::from(3..haystack.len()))?;
assert_eq!(PatternID::ZERO, mat.pattern());
assert_eq!(3, mat.start());
assert_eq!(9, mat.end());
sourcepub fn find_iter<'a, 'b, B: ?Sized + AsRef<[u8]>>(
&'a self,
haystack: &'b B
) -> FindIter<'a, 'b> ⓘ
pub fn find_iter<'a, 'b, B: ?Sized + AsRef<[u8]>>( &'a self, haystack: &'b B ) -> FindIter<'a, 'b> ⓘ
Return an iterator of non-overlapping occurrences of the patterns in this searcher, according to its match semantics, in the given haystack.
Example
Basic usage:
use aho_corasick::{packed::{MatchKind, Searcher}, PatternID};
let searcher = Searcher::new(["foobar", "foo"].iter().cloned())?;
let matches: Vec<PatternID> = searcher
.find_iter("foobar fooba foofoo")
.map(|mat| mat.pattern())
.collect();
assert_eq!(vec![
PatternID::must(0),
PatternID::must(1),
PatternID::must(1),
PatternID::must(1),
], matches);
sourcepub fn match_kind(&self) -> &MatchKind
pub fn match_kind(&self) -> &MatchKind
Returns the match kind used by this packed searcher.
Examples
Basic usage:
use aho_corasick::packed::{MatchKind, Searcher};
let searcher = Searcher::new(["foobar", "foo"].iter().cloned())?;
// leftmost-first is the default.
assert_eq!(&MatchKind::LeftmostFirst, searcher.match_kind());
sourcepub fn minimum_len(&self) -> usize
pub fn minimum_len(&self) -> usize
Returns the minimum length of a haystack that is required in order for packed searching to be effective.
In some cases, the underlying packed searcher may not be able to search very short haystacks. When that occurs, the implementation will defer to a slower non-packed searcher (which is still generally faster than Aho-Corasick for a small number of patterns). However, callers may want to avoid ever using the slower variant, which one can do by never passing a haystack shorter than the minimum length returned by this method.
sourcepub fn memory_usage(&self) -> usize
pub fn memory_usage(&self) -> usize
Returns the approximate total amount of heap used by this searcher, in units of bytes.