nom_locate

Struct LocatedSpan

Source
pub struct LocatedSpan<T, X = ()> {
    pub extra: X,
    /* private fields */
}
Expand description

A LocatedSpan is a set of meta information about the location of a token, including extra information.

The LocatedSpan structure can be used as an input of the nom parsers. It implements all the necessary traits for LocatedSpan<&str,X> and LocatedSpan<&[u8],X>

Fields§

§extra: X

Extra information that can be embedded by the user. Example: the parsed file name

Implementations§

Source§

impl<T> LocatedSpan<T, ()>

Source

pub fn new(program: T) -> LocatedSpan<T, ()>

Create a span for a particular input with default offset and line values and empty extra data. You can compute the column through the get_column or get_utf8_column methods.

offset starts at 0, line starts at 1, and column starts at 1.

Do not use this constructor in parser functions; nom and nom_locate assume span offsets are relative to the beginning of the same input. In these cases, you probably want to use the nom::traits::Slice trait instead.

§Example of use
use nom_locate::LocatedSpan;

let span = LocatedSpan::new(b"foobar");

assert_eq!(span.location_offset(), 0);
assert_eq!(span.location_line(),   1);
assert_eq!(span.get_column(),      1);
assert_eq!(span.fragment(),        &&b"foobar"[..]);
Source§

impl<T, X> LocatedSpan<T, X>

Source

pub fn new_extra(program: T, extra: X) -> LocatedSpan<T, X>

Create a span for a particular input with default offset and line values. You can compute the column through the get_column or get_utf8_column methods.

offset starts at 0, line starts at 1, and column starts at 1.

Do not use this constructor in parser functions; nom and nom_locate assume span offsets are relative to the beginning of the same input. In these cases, you probably want to use the nom::traits::Slice trait instead.

§Example of use
use nom_locate::LocatedSpan;

let span = LocatedSpan::new_extra(b"foobar", "extra");

assert_eq!(span.location_offset(), 0);
assert_eq!(span.location_line(),   1);
assert_eq!(span.get_column(),      1);
assert_eq!(span.fragment(),        &&b"foobar"[..]);
assert_eq!(span.extra,             "extra");
Source

pub unsafe fn new_from_raw_offset( offset: usize, line: u32, fragment: T, extra: X, ) -> LocatedSpan<T, X>

Similar to new_extra, but allows overriding offset and line. This is unsafe, because giving an offset too large may result in undefined behavior, as some methods move back along the fragment assuming any negative index within the offset is valid.

Source

pub fn location_offset(&self) -> usize

The offset represents the position of the fragment relatively to the input of the parser. It starts at offset 0.

Source

pub fn location_line(&self) -> u32

The line number of the fragment relatively to the input of the parser. It starts at line 1.

Source

pub fn fragment(&self) -> &T

The fragment that is spanned. The fragment represents a part of the input of the parser.

Source

pub fn map_extra<U, F: FnOnce(X) -> U>(self, f: F) -> LocatedSpan<T, U>

Transform the extra inside into another type

§Example of use
use nom::{
  IResult,
  combinator::{recognize, map_res},
  sequence::{terminated, tuple},
  character::{complete::{char, one_of}, is_digit},
  bytes::complete::{tag, take_while1}
};

fn decimal(input: LocatedSpan<&str>) -> IResult<LocatedSpan<&str>, LocatedSpan<&str>> {
  recognize(
       take_while1(|c| is_digit(c as u8) || c == '_')
  )(input)
}

fn main() {
    let span = LocatedSpan::new("$10");
    // matches the $ and then matches the decimal number afterwards,
    // converting it into a `u8` and putting that value in the span
    let (_, (_, n)) = tuple((
                        tag("$"),
                        map_res(
                            decimal,
                            |x| x.fragment().parse::<u8>().map(|n| x.map_extra(|_| n))
                        )
                      ))(span).unwrap();
    assert_eq!(n.extra, 10);
}
Source

pub fn into_fragment(self) -> T

Takes ownership of the fragment without (re)borrowing it.

§Example of use
use nom::{
    IResult,
    bytes::complete::{take_till, tag},
    combinator::rest,
};

fn parse_pair<'a>(input: LocatedSpan<&'a str>) -> IResult<LocatedSpan<&'a str>, (&'a str, &'a str)> {
    let (input, key) = take_till(|c| c == '=')(input)?;
    let (input, _) = tag("=")(input)?;
    let (input, value) = rest(input)?;

    Ok((input, (key.into_fragment(), value.into_fragment())))
}

fn main() {
    let span = LocatedSpan::new("key=value");
    let (_, pair) = parse_pair(span).unwrap();
    assert_eq!(pair, ("key", "value"));
}
Source

pub fn into_fragment_and_extra(self) -> (T, X)

Takes ownership of the fragment and extra data without (re)borrowing them.

Source§

impl<T: AsBytes, X> LocatedSpan<T, X>

Source

pub fn get_line_beginning(&self) -> &[u8]

Return the line that contains this LocatedSpan.

The get_column and get_utf8_column functions returns indexes that corresponds to the line returned by this function.

Note that if this LocatedSpan ends before the end of the original data, the result of calling get_line_beginning() will not include any data from after the LocatedSpan.

let program = LocatedSpan::new(
    "Hello World!\
    \nThis is a multi-line input\
    \nthat ends after this line.\n");
let multi = program.find_substring("multi").unwrap();

assert_eq!(
    program.slice(multi..).get_line_beginning(),
    "This is a multi-line input".as_bytes(),
);
Source

pub fn get_column(&self) -> usize

Return the column index, assuming 1 byte = 1 column.

Use it for ascii text, or use get_utf8_column for UTF8.

§Example of use

let span = LocatedSpan::new("foobar");

assert_eq!(span.slice(3..).get_column(), 4);
Source

pub fn get_utf8_column(&self) -> usize

Return the column index for UTF8 text. Return value is unspecified for non-utf8 text.

This version uses bytecount’s hyper algorithm to count characters. This is much faster for long lines, but is non-negligibly slower for short slices (below around 100 bytes). This is also sped up significantly more depending on architecture and enabling the simd feature gates. If you expect primarily short lines, you may get a noticeable speedup in parsing by using naive_get_utf8_column instead. Benchmark your specific use case!

§Example of use

let span = LocatedSpan::new("メカジキ");
let indexOf3dKanji = span.find_substring("ジ").unwrap();

assert_eq!(span.slice(indexOf3dKanji..).get_column(), 7);
assert_eq!(span.slice(indexOf3dKanji..).get_utf8_column(), 3);
Source

pub fn naive_get_utf8_column(&self) -> usize

Return the column index for UTF8 text. Return value is unspecified for non-utf8 text.

A simpler implementation of get_utf8_column that may be faster on shorter lines. If benchmarking shows that this is faster, you can use it instead of get_utf8_column. Prefer defaulting to get_utf8_column unless this legitimately is a performance bottleneck.

§Example of use

let span = LocatedSpan::new("メカジキ");
let indexOf3dKanji = span.find_substring("ジ").unwrap();

assert_eq!(span.slice(indexOf3dKanji..).get_column(), 7);
assert_eq!(span.slice(indexOf3dKanji..).naive_get_utf8_column(), 3);

Trait Implementations§

Source§

impl<T: AsBytes, X> AsBytes for LocatedSpan<T, X>

Source§

fn as_bytes(&self) -> &[u8]

Casts the input type to a byte slice
Source§

impl<T, U, X> AsRef<U> for LocatedSpan<&T, X>
where T: ?Sized + AsRef<U>, U: ?Sized,

Source§

fn as_ref(&self) -> &U

Converts this type into a shared reference of the (usually inferred) input type.
Source§

impl<T: Clone, X: Clone> Clone for LocatedSpan<T, X>

Source§

fn clone(&self) -> LocatedSpan<T, X>

Returns a copy of the value. Read more
1.0.0 · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more
Source§

impl<A: Compare<B>, B: Into<LocatedSpan<B>>, X> Compare<B> for LocatedSpan<A, X>

Source§

fn compare(&self, t: B) -> CompareResult

Compares self to another value for equality
Source§

fn compare_no_case(&self, t: B) -> CompareResult

Compares self to another value for equality independently of the case. Read more
Source§

impl<T: Debug, X: Debug> Debug for LocatedSpan<T, X>

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl<T, X> Deref for LocatedSpan<T, X>

Source§

type Target = T

The resulting type after dereferencing.
Source§

fn deref(&self) -> &Self::Target

Dereferences the value.
Source§

impl<T: ToString, X> Display for LocatedSpan<T, X>

Source§

fn fmt(&self, fmt: &mut Formatter<'_>) -> FmtResult

Formats the value using the given formatter. Read more
Source§

impl<'a, T, X> ExtendInto for LocatedSpan<T, X>
where T: ExtendInto,

Source§

type Item = <T as ExtendInto>::Item

The current input type is a sequence of that Item type. Read more
Source§

type Extender = <T as ExtendInto>::Extender

The type that will be produced
Source§

fn new_builder(&self) -> Self::Extender

Create a new Extend of the correct type
Source§

fn extend_into(&self, acc: &mut Self::Extender)

Accumulate the input into an accumulator
Source§

impl<T, U, X> FindSubstring<U> for LocatedSpan<T, X>
where T: FindSubstring<U>,

Source§

fn find_substring(&self, substr: U) -> Option<usize>

Returns the byte position of the substring if it is found
Source§

impl<Fragment: FindToken<Token>, Token, X> FindToken<Token> for LocatedSpan<Fragment, X>

Source§

fn find_token(&self, token: Token) -> bool

Returns true if self contains the token
Source§

impl<T: AsBytes, X: Default> From<T> for LocatedSpan<T, X>

Source§

fn from(i: T) -> Self

Converts to this type from the input type.
Source§

impl<T: Hash, X> Hash for LocatedSpan<T, X>

Source§

fn hash<H: Hasher>(&self, state: &mut H)

Feeds this value into the given Hasher. Read more
1.3.0 · Source§

fn hash_slice<H>(data: &[Self], state: &mut H)
where H: Hasher, Self: Sized,

Feeds a slice of this type into the given Hasher. Read more
Source§

impl<'a, T, X> InputIter for LocatedSpan<T, X>
where T: InputIter,

Source§

type Item = <T as InputIter>::Item

The current input type is a sequence of that Item type. Read more
Source§

type Iter = <T as InputIter>::Iter

An iterator over the input type, producing the item and its position for use with Slice. If we’re iterating over &str, the position corresponds to the byte index of the character
Source§

type IterElem = <T as InputIter>::IterElem

An iterator over the input type, producing the item
Source§

fn iter_indices(&self) -> Self::Iter

Returns an iterator over the elements and their byte offsets
Source§

fn iter_elements(&self) -> Self::IterElem

Returns an iterator over the elements
Source§

fn position<P>(&self, predicate: P) -> Option<usize>
where P: Fn(Self::Item) -> bool,

Finds the byte position of the element
Source§

fn slice_index(&self, count: usize) -> Result<usize, Needed>

Get the byte offset from the element’s position in the stream
Source§

impl<T: InputLength, X> InputLength for LocatedSpan<T, X>

Source§

fn input_len(&self) -> usize

Calculates the input length, as indicated by its name, and the name of the trait itself
Source§

impl<T, X> InputTake for LocatedSpan<T, X>
where Self: Slice<RangeFrom<usize>> + Slice<RangeTo<usize>>,

Source§

fn take(&self, count: usize) -> Self

Returns a slice of count bytes. panics if count > length
Source§

fn take_split(&self, count: usize) -> (Self, Self)

Split the stream at the count byte offset. panics if count > length
Source§

impl<T, X> InputTakeAtPosition for LocatedSpan<T, X>

Source§

type Item = <T as InputIter>::Item

The current input type is a sequence of that Item type. Read more
Source§

fn split_at_position_complete<P, E: ParseError<Self>>( &self, predicate: P, ) -> IResult<Self, Self, E>
where P: Fn(Self::Item) -> bool,

Looks for the first element of the input type for which the condition returns true, and returns the input up to this position. Read more
Source§

fn split_at_position<P, E: ParseError<Self>>( &self, predicate: P, ) -> IResult<Self, Self, E>
where P: Fn(Self::Item) -> bool,

Looks for the first element of the input type for which the condition returns true, and returns the input up to this position. Read more
Source§

fn split_at_position1<P, E: ParseError<Self>>( &self, predicate: P, e: ErrorKind, ) -> IResult<Self, Self, E>
where P: Fn(Self::Item) -> bool,

Looks for the first element of the input type for which the condition returns true and returns the input up to this position. Read more
Source§

fn split_at_position1_complete<P, E: ParseError<Self>>( &self, predicate: P, e: ErrorKind, ) -> IResult<Self, Self, E>
where P: Fn(Self::Item) -> bool,

Looks for the first element of the input type for which the condition returns true and returns the input up to this position. Read more
Source§

impl<T, X> Offset for LocatedSpan<T, X>

Source§

fn offset(&self, second: &Self) -> usize

Offset between the first byte of self and the first byte of the argument
Source§

impl<R: FromStr, T, X> ParseTo<R> for LocatedSpan<T, X>
where T: ParseTo<R>,

Source§

fn parse_to(&self) -> Option<R>

Succeeds if parse() succeeded. The byte slice implementation will first convert it to a &str, then apply the parse() function
Source§

impl<T: AsBytes + PartialEq, X> PartialEq for LocatedSpan<T, X>

Source§

fn eq(&self, other: &Self) -> bool

Tests for self and other values to be equal, and is used by ==.
1.0.0 · Source§

fn ne(&self, other: &Rhs) -> bool

Tests for !=. The default implementation is almost always sufficient, and should not be overridden without very good reason.
Source§

impl<'a, T, R, X: Clone> Slice<R> for LocatedSpan<T, X>
where T: Slice<R> + Offset + AsBytes + Slice<RangeTo<usize>>,

Source§

fn slice(&self, range: R) -> Self

Slices self according to the range argument
Source§

impl<T: Copy, X: Copy> Copy for LocatedSpan<T, X>

Source§

impl<T: AsBytes + Eq, X> Eq for LocatedSpan<T, X>

Auto Trait Implementations§

§

impl<T, X> Freeze for LocatedSpan<T, X>
where T: Freeze, X: Freeze,

§

impl<T, X> RefUnwindSafe for LocatedSpan<T, X>

§

impl<T, X> Send for LocatedSpan<T, X>
where T: Send, X: Send,

§

impl<T, X> Sync for LocatedSpan<T, X>
where T: Sync, X: Sync,

§

impl<T, X> Unpin for LocatedSpan<T, X>
where T: Unpin, X: Unpin,

§

impl<T, X> UnwindSafe for LocatedSpan<T, X>
where T: UnwindSafe, X: UnwindSafe,

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> CloneToUninit for T
where T: Clone,

Source§

unsafe fn clone_to_uninit(&self, dst: *mut T)

🔬This is a nightly-only experimental API. (clone_to_uninit)
Performs copy-assignment from self to dst. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<P, T> Receiver for P
where P: Deref<Target = T> + ?Sized, T: ?Sized,

Source§

type Target = T

🔬This is a nightly-only experimental API. (arbitrary_self_types)
The target type on which the method may be called.
Source§

impl<T> ToOwned for T
where T: Clone,

Source§

type Owned = T

The resulting type after obtaining ownership.
Source§

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more
Source§

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more
Source§

impl<T> ToString for T
where T: Display + ?Sized,

Source§

default fn to_string(&self) -> String

Converts the given value to a String. Read more
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.