Struct regex::Regex

source ·

pub struct Regex { /* private fields */ }

Expand description

A compiled regular expression for searching Unicode haystacks.

A Regex can be used to search haystacks, split haystacks into substrings or replace substrings in a haystack with a different substring. All searching is done with an implicit (?s:.)*? at the beginning and end of an pattern. To force an expression to match the whole string (or a prefix or a suffix), you must use an anchor like ^ or $ (or \A and \z).

While this crate will handle Unicode strings (whether in the regular expression or in the haystack), all positions returned are byte offsets. Every byte offset is guaranteed to be at a Unicode code point boundary. That is, all offsets returned by the Regex API are guaranteed to be ranges that can slice a &str without panicking. If you want to relax this requirement, then you must search &[u8] haystacks with a bytes::Regex.

The only methods that allocate new strings are the string replacement methods. All other methods (searching and splitting) return borrowed references into the haystack given.

Example

Find the offsets of a US phone number:

use regex::Regex;

let re = Regex::new("[0-9]{3}-[0-9]{3}-[0-9]{4}").unwrap();
let m = re.find("phone: 111-222-3333").unwrap();
assert_eq!(7..19, m.range());

Example: extracting capture groups

A common way to use regexes is with capture groups. That is, instead of just looking for matches of an entire regex, parentheses are used to create groups that represent part of the match.

For example, consider a haystack with multiple lines, and each line has three whitespace delimited fields where the second field is expected to be a number and the third field a boolean. To make this convenient, we use the Captures::extract API to put the strings that match each group into a fixed size array:

use regex::Regex;

let hay = "
rabbit         54 true
groundhog 2 true
does not match
fox   109    false
";
let re = Regex::new(r"(?m)^\s*(\S+)\s+([0-9]+)\s+(true|false)\s*$").unwrap();
let mut fields: Vec<(&str, i64, bool)> = vec![];
for (_, [f1, f2, f3]) in re.captures_iter(hay).map(|caps| caps.extract()) {
    fields.push((f1, f2.parse()?, f3.parse()?));
}
assert_eq!(fields, vec![
    ("rabbit", 54, true),
    ("groundhog", 2, true),
    ("fox", 109, false),
]);

Example: searching with the `Pattern` trait

Note: This section requires that this crate is compiled with the pattern Cargo feature enabled, which requires nightly Rust.

Since Regex implements Pattern from the standard library, one can use regexes with methods defined on &str. For example, is_match, find, find_iter and split can, in some cases, be replaced with str::contains, str::find, str::match_indices and str::split.

Here are some examples:

use regex::Regex;

let re = Regex::new(r"\d+").unwrap();
let hay = "a111b222c";

assert!(hay.contains(&re));
assert_eq!(hay.find(&re), Some(1));
assert_eq!(hay.match_indices(&re).collect::<Vec<_>>(), vec![
    (1, "111"),
    (5, "222"),
]);
assert_eq!(hay.split(&re).collect::<Vec<_>>(), vec!["a", "b", "c"]);

Struct regex::Regex

Implementations§

impl Regex

pub fn new(re: &str) -> Result<Regex, Error>

pub fn is_match(&self, haystack: &str) -> bool

pub fn find<'h>(&self, haystack: &'h str) -> Option<Match<'h>>

pub fn find_iter<'r, 'h>(&'r self, haystack: &'h str) -> Matches<'r, 'h> ⓘ

pub fn captures<'h>(&self, haystack: &'h str) -> Option<Captures<'h>>

pub fn captures_iter<'r, 'h>( &'r self, haystack: &'h str ) -> CaptureMatches<'r, 'h> ⓘ

pub fn split<'r, 'h>(&'r self, haystack: &'h str) -> Split<'r, 'h> ⓘ

pub fn splitn<'r, 'h>( &'r self, haystack: &'h str, limit: usize ) -> SplitN<'r, 'h> ⓘ

pub fn replace<'h, R: Replacer>( &self, haystack: &'h str, rep: R ) -> Cow<'h, str>

pub fn replace_all<'h, R: Replacer>( &self, haystack: &'h str, rep: R ) -> Cow<'h, str>

pub fn replacen<'h, R: Replacer>( &self, haystack: &'h str, limit: usize, rep: R ) -> Cow<'h, str>

impl Regex

pub fn shortest_match(&self, haystack: &str) -> Option<usize>

pub fn shortest_match_at(&self, haystack: &str, start: usize) -> Option<usize>

pub fn is_match_at(&self, haystack: &str, start: usize) -> bool

pub fn find_at<'h>(&self, haystack: &'h str, start: usize) -> Option<Match<'h>>

pub fn captures_at<'h>( &self, haystack: &'h str, start: usize ) -> Option<Captures<'h>>

pub fn captures_read<'h>( &self, locs: &mut CaptureLocations, haystack: &'h str ) -> Option<Match<'h>>

pub fn captures_read_at<'h>( &self, locs: &mut CaptureLocations, haystack: &'h str, start: usize ) -> Option<Match<'h>>

impl Regex

pub fn as_str(&self) -> &str

pub fn capture_names(&self) -> CaptureNames<'_> ⓘ

pub fn captures_len(&self) -> usize

pub fn static_captures_len(&self) -> Option<usize>

pub fn capture_locations(&self) -> CaptureLocations

Trait Implementations§

impl Clone for Regex

fn clone(&self) -> Regex

fn clone_from(&mut self, source: &Self)

impl Debug for Regex

fn fmt(&self, f: &mut Formatter<'_>) -> Result

impl Display for Regex

fn fmt(&self, f: &mut Formatter<'_>) -> Result

impl FromStr for Regex

fn from_str(s: &str) -> Result<Regex, Error>

type Err = Error

impl TryFrom<&str> for Regex

fn try_from(s: &str) -> Result<Regex, Error>

type Error = Error

impl TryFrom<String> for Regex

fn try_from(s: String) -> Result<Regex, Error>

type Error = Error

Auto Trait Implementations§

impl RefUnwindSafe for Regex

impl Send for Regex

impl Sync for Regex

impl Unpin for Regex

impl UnwindSafe for Regex

Blanket Implementations§

impl<T> Any for Twhere T: 'static + ?Sized,

fn type_id(&self) -> TypeId

impl<T> Borrow<T> for Twhere T: ?Sized,

fn borrow(&self) -> &T

impl<T> BorrowMut<T> for Twhere T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

impl<T> From<T> for T

fn from(t: T) -> T

impl<T, U> Into<U> for Twhere U: From<T>,

fn into(self) -> U

impl<T> ToOwned for Twhere T: Clone,

type Owned = T

fn to_owned(&self) -> T

fn clone_into(&self, target: &mut T)

impl<T> ToString for Twhere T: Display + ?Sized,

default fn to_string(&self) -> String

impl<T, U> TryFrom<U> for Twhere U: Into<T>,

type Error = Infallible

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

impl<T, U> TryInto<U> for Twhere U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

impl<T> Any for T
where T: 'static + ?Sized,

impl<T> Borrow<T> for T
where T: ?Sized,

impl<T> BorrowMut<T> for T
where T: ?Sized,

impl<T, U> Into<U> for T
where U: From<T>,

impl<T> ToOwned for T
where T: Clone,

impl<T> ToString for T
where T: Display + ?Sized,

impl<T, U> TryFrom<U> for T
where U: Into<T>,

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,