[−][src]Struct csv::Reader

pub struct Reader<R> { /* fields omitted */ }

A CSV reader.

This reader parses CSV data and exposes records via iterators.

Example

This example shows how to do type-based decoding for each record in the CSV data.

let data = "
sticker,mortals,7
bribed,personae,7
wobbling,poncing,4
interposed,emmett,9
chocolate,refile,7";

let mut rdr = csv::Reader::from_string(data).has_headers(false);
for row in rdr.decode() {
    let (n1, n2, dist): (String, String, u32) = row.unwrap();
    println!("{}, {}: {}", n1, n2, dist);
}

Here's another example that parses tab-delimited values with records of varying length:

let data = "
sticker\tmortals\t7
bribed\tpersonae\t7
wobbling
interposed\temmett\t9
chocolate\trefile\t7";

let mut rdr = csv::Reader::from_string(data)
                          .has_headers(false)
                          .delimiter(b'\t')
                          .flexible(true);
for row in rdr.records() {
    let row = row.unwrap();
    println!("{:?}", row);
}

Methods

`impl<R: Read> Reader<R>`[src]

`pub fn from_reader(rdr: R) -> Reader<R>`[src]

Creates a new CSV reader from an arbitrary io::Read.

The reader is buffered for you automatically.

`impl Reader<File>`[src]

`pub fn from_file<P: AsRef<Path>>(path: P) -> Result<Reader<File>>`[src]

Creates a new CSV reader for the data at the file path given.

`impl Reader<Cursor<Vec<u8>>>`[src]

`pub fn from_string<'a, S>(s: S) -> Reader<Cursor<Vec<u8>>> where S: Into<String>,` [src]

Creates a CSV reader for an in memory string buffer.

`pub fn from_bytes<'a, V>(bytes: V) -> Reader<Cursor<Vec<u8>>> where V: Into<Vec<u8>>,` [src]

Creates a CSV reader for an in memory buffer of bytes.

`impl<R: Read> Reader<R>`[src]

ⓘImportant traits for DecodedRecords<'a, R, D>
Important traits for DecodedRecords<'a, R, D>
`impl<'a, R, D> Iterator for DecodedRecords<'a, R, D> where R: Read, D: Decodable, type Item = Result<D>;`
`pub fn decode<'a, D: Decodable>(&'a mut self) -> DecodedRecords<'a, R, D>`[src]

Uses type-based decoding to read a single record from CSV data.

The type that is being decoded into should correspond to one full CSV record. A tuple, struct or Vec fit this category. A tuple, struct or Vec should consist of primitive types like integers, floats, characters and strings which map to single fields. If a field cannot be decoded into the type requested, an error is returned.

Enums are also supported in a limited way. Namely, its variants must have exactly 1 parameter each. Each variant decodes based on its constituent type and variants are tried in the order that they appear in their enum definition. See below for examples.

Examples

This example shows how to decode records into a struct. (Note that currently, the names of the struct members are irrelevant.)

extern crate rustc_serialize;

#[derive(RustcDecodable)]
struct Pair {
    name1: String,
    name2: String,
    dist: u32,
}

let mut rdr = csv::Reader::from_string("foo,bar,1\nfoo,baz,2")
                          .has_headers(false);
// Instantiating a specific type when decoding is usually necessary.
let rows = rdr.decode().collect::<csv::Result<Vec<Pair>>>().unwrap();

assert_eq!(rows[0].dist, 1);
assert_eq!(rows[1].dist, 2);

We can get a little crazier with custon enum types or Option types. An Option type in particular is useful when a column doesn't contain valid data in every record (whether it be empty or malformed).

extern crate rustc_serialize;

#[derive(RustcDecodable, PartialEq, Debug)]
struct MyUint(u32);

#[derive(RustcDecodable, PartialEq, Debug)]
enum Number { Integer(i64), Float(f64) }

#[derive(RustcDecodable)]
struct Row {
    name1: String,
    name2: String,
    dist: Option<MyUint>,
    dist2: Number,
}

let mut rdr = csv::Reader::from_string("foo,bar,1,1\nfoo,baz,,1.5")
                          .has_headers(false);
let rows = rdr.decode().collect::<csv::Result<Vec<Row>>>().unwrap();

assert_eq!(rows[0].dist, Some(MyUint(1)));
assert_eq!(rows[1].dist, None);
assert_eq!(rows[0].dist2, Number::Integer(1));
assert_eq!(rows[1].dist2, Number::Float(1.5));

Finally, as a special case, a tuple/struct/Vec can be used as the "tail" of another tuple/struct/Vec to capture all remaining fields:

extern crate rustc_serialize;

#[derive(RustcDecodable)]
struct Pair {
    name1: String,
    name2: String,
    attrs: Vec<u32>,
}

let mut rdr = csv::Reader::from_string("a,b,1,2,3,4\ny,z,5,6,7,8")
                          .has_headers(false);
let rows = rdr.decode().collect::<csv::Result<Vec<Pair>>>().unwrap();

assert_eq!(rows[0].attrs, vec![1,2,3,4]);
assert_eq!(rows[1].attrs, vec![5,6,7,8]);

If a tuple/struct/Vec appears any where other than the "tail" of a record, then the behavior is undefined. (You'll likely get a runtime error. I believe this is a limitation of the current decoding machinery in the serialize crate.)

ⓘImportant traits for StringRecords<'a, R>
Important traits for StringRecords<'a, R>
`impl<'a, R> Iterator for StringRecords<'a, R> where R: Read, type Item = Result<Vec<String>>;`
`pub fn records<'a>(&'a mut self) -> StringRecords<'a, R>`[src]

Returns an iterator of records in the CSV data where each field is a String.

Example

This is your standard CSV interface with no type decoding magic.

let data = "
sticker,mortals,7
bribed,personae,7
wobbling,poncing,4
interposed,emmett,9
chocolate,refile,7";

let mut rdr = csv::Reader::from_string(data).has_headers(false);
for row in rdr.records() {
    let row = row.unwrap();
    println!("{:?}", row);
}

`pub fn headers(&mut self) -> Result<Vec<String>>`[src]

Returns a copy of the first record in the CSV data as strings.

This method may be called at any time and regardless of whether no_headers is set or not.

Example

let mut rdr = csv::Reader::from_string("a,b,c\n1,2,3");

let headers1 = rdr.headers().unwrap();
let rows = rdr.records().collect::<csv::Result<Vec<_>>>().unwrap();
let headers2 = rdr.headers().unwrap();

let s = |s: &'static str| s.to_string();
assert_eq!(headers1, headers2);
assert_eq!(headers1, vec![s("a"), s("b"), s("c")]);
assert_eq!(rows.len(), 1);
assert_eq!(rows[0], vec![s("1"), s("2"), s("3")]);

Note that if no_headers is called on the CSV reader, the rows returned in this example include the first record:

let mut rdr = csv::Reader::from_string("a,b,c\n1,2,3")
                          .has_headers(false);

let headers1 = rdr.headers().unwrap();
let rows = rdr.records().collect::<csv::Result<Vec<_>>>().unwrap();
let headers2 = rdr.headers().unwrap();

let s = |s: &'static str| s.to_string();
assert_eq!(headers1, headers2);
assert_eq!(headers1, vec![s("a"), s("b"), s("c")]);

// The header rows are now part of the record iterators.
assert_eq!(rows.len(), 2);
assert_eq!(rows[0], headers1);
assert_eq!(rows[1], vec![s("1"), s("2"), s("3")]);

`impl<R: Read> Reader<R>`[src]

`pub fn delimiter(self, delimiter: u8) -> Reader<R>`[src]

The delimiter to use when reading CSV data.

Since the CSV reader is meant to be mostly encoding agnostic, you must specify the delimiter as a single ASCII byte. For example, to read tab-delimited data, you would use b'\t'.

The default value is b','.

`pub fn has_headers(self, yes: bool) -> Reader<R>`[src]

Whether to treat the first row as a special header row.

By default, the first row is treated as a special header row, which means it is excluded from iterators returned by the decode, records or byte_records methods. When yes is set to false, the first row is included in those iterators.

Note that the headers method is unaffected by whether this is set.

`pub fn flexible(self, yes: bool) -> Reader<R>`[src]

Whether to allow flexible length records when reading CSV data.

When this is set to true, records in the CSV data can have different lengths. By default, this is disabled, which will cause the CSV reader to return an error if it tries to read a record that has a different length than the first record that it read.

`pub fn record_terminator(self, term: RecordTerminator) -> Reader<R>`[src]

Set the record terminator to use when reading CSV data.

In the vast majority of situations, you'll want to use the default value, RecordTerminator::CRLF, which automatically handles \r, \n or \r\n as record terminators. (Notably, this is a special case since two characters can correspond to a single terminator token.)

However, you may use RecordTerminator::Any to specify any ASCII character to use as the record terminator. For example, you could use RecordTerminator::Any(b'\n') to only accept line feeds as record terminators, or b'\x1e' for the ASCII record separator.

`pub fn quote(self, quote: u8) -> Reader<R>`[src]

Set the quote character to use when reading CSV data.

Since the CSV reader is meant to be mostly encoding agnostic, you must specify the quote as a single ASCII byte. For example, to read single quoted data, you would use b'\''.

The default value is b'"'.

If quote is None, then no quoting will be used.

`pub fn escape(self, escape: Option<u8>) -> Reader<R>`[src]

Set the escape character to use when reading CSV data.

Since the CSV reader is meant to be mostly encoding agnostic, you must specify the escape as a single ASCII byte.

When set to None (which is the default), the "doubling" escape is used for quote character.

When set to something other than None, it is used as the escape character for quotes. (e.g., b'\\'.)

`pub fn double_quote(self, yes: bool) -> Reader<R>`[src]

Enable double quote escapes.

When disabled, doubled quotes are not interpreted as escapes.

`pub fn ascii(self) -> Reader<R>`[src]

A convenience method for reading ASCII delimited text.

This sets the delimiter and record terminator to the ASCII unit separator (\x1f) and record separator (\x1e), respectively.

Since ASCII delimited text is meant to be unquoted, this also sets quote to None.

`impl<R: Read> Reader<R>`[src]

These are low level methods for dealing with the raw bytes of CSV records. You should only need to use these when you need the performance or if your CSV data isn't UTF-8 encoded.

`pub fn byte_headers(&mut self) -> Result<Vec<ByteString>>`[src]

This is just like headers, except fields are ByteStrings instead of Strings.

ⓘImportant traits for ByteRecords<'a, R>
Important traits for ByteRecords<'a, R>
`impl<'a, R> Iterator for ByteRecords<'a, R> where R: Read, type Item = Result<Vec<ByteString>>;`
`pub fn byte_records<'a>(&'a mut self) -> ByteRecords<'a, R>`[src]

This is just like records, except fields are ByteStrings instead of Strings.

`pub fn done(&self) -> bool`[src]

Returns true if the CSV parser has reached its final state. When this method returns true, all iterators will always return None.

This is not needed in typical usage since the record iterators will stop for you when the parser completes. This method is useful when you're accessing the parser's lowest-level iterator.

Example

This is the fastest way to compute the number of records in CSV data using this crate. (It is fast because it does not allocate space for every field.)

let data = "
sticker,mortals,7
bribed,personae,7
wobbling,poncing,4
interposed,emmett,9
chocolate,refile,7";

let mut rdr = csv::Reader::from_string(data);
let mut count = 0u64;
while !rdr.done() {
    loop {
        // This case analysis is necessary because we only want to
        // increment the count when `EndOfRecord` is seen. (If the
        // CSV data is empty, then it will never be emitted.)
        match rdr.next_bytes() {
            csv::NextField::EndOfCsv => break,
            csv::NextField::EndOfRecord => { count += 1; break; },
            csv::NextField::Error(err) => panic!(err),
            csv::NextField::Data(_) => {}
        }
    }
}

assert_eq!(count, 5);

`pub fn next_bytes(&mut self) -> NextField<[u8]>`[src]

An iterator over fields in the current record.

This provides low level access to CSV records as raw byte slices. Namely, no allocation is performed. Unlike other iterators in this crate, this yields fields instead of records. Notably, this cannot implement the Iterator trait safely. As such, it cannot be used with a for loop.

See the documentation for the NextField type on how the iterator works.

This iterator always returns all records (i.e., it won't skip the header row).

Example

This method is most useful when used in conjunction with the the done method:

let data = "
sticker,mortals,7
bribed,personae,7
wobbling,poncing,4
interposed,emmett,9
chocolate,refile,7";

let mut rdr = csv::Reader::from_string(data);
while !rdr.done() {
    while let Some(r) = rdr.next_bytes().into_iter_result() {
        print!("{:?} ", r.unwrap());
    }
    println!("");
}

`pub fn next_str(&mut self) -> NextField<str>`[src]

This is just like next_bytes except it converts each field to a Unicode string in place.

`pub fn byte_offset(&self) -> u64`[src]

Returns the byte offset at which the current record started.

`impl<R: Read + Seek> Reader<R>`[src]

`pub fn seek(&mut self, pos: u64) -> Result<()>`[src]

Seeks the underlying reader to the file cursor specified.

This comes with several caveats:

The existing buffer is dropped and a new one is created.
If you seek to a position other than the start of a record, you'll probably get an incorrect parse. (This is not unsafe.)

Mostly, this is intended for use with the index sub module.

Note that if pos is equivalent to the current parsed byte offset, then no seeking is performed. (In this case, seek is a no-op.)

Auto Trait Implementations

`impl<R> Send for Reader<R> where R: Send,`

`impl<R> Sync for Reader<R> where R: Sync,`

Blanket Implementations

`impl<T> From for T`[src]

`fn from(t: T) -> T`[src]

`impl<T, U> Into for T where U: From<T>,` [src]

`fn into(self) -> U`[src]

`impl<T, U> TryFrom for T where U: Into<T>,` [src]

`type Error = Infallible`

The type returned in the event of a conversion error.

`fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>`[src]

`impl<T> Borrow for T where T: ?Sized,` [src]

`fn borrow(&self) -> &T`[src]

`impl<T> Any for T where T: 'static + ?Sized,` [src]

`fn type_id(&self) -> TypeId`[src]

`impl<T> BorrowMut for T where T: ?Sized,` [src]

`fn borrow_mut(&mut self) -> &mut T`[src]

`impl<T, U> TryInto for T where U: TryFrom<T>,` [src]

`type Error = <U as TryFrom<T>>::Error`

The type returned in the event of a conversion error.

[−][src]Struct csv::Reader

Example

Methods

impl<R: Read> Reader<R>[src]

pub fn from_reader(rdr: R) -> Reader<R>[src]

impl Reader<File>[src]

pub fn from_file<P: AsRef<Path>>(path: P) -> Result<Reader<File>>[src]

impl Reader<Cursor<Vec<u8>>>[src]

pub fn from_string<'a, S>(s: S) -> Reader<Cursor<Vec<u8>>> where S: Into<String>, [src]

pub fn from_bytes<'a, V>(bytes: V) -> Reader<Cursor<Vec<u8>>> where V: Into<Vec<u8>>, [src]

impl<R: Read> Reader<R>[src]

ⓘImportant traits for DecodedRecords<'a, R, D>Important traits for DecodedRecords<'a, R, D>impl<'a, R, D> Iterator for DecodedRecords<'a, R, D> where R: Read, D: Decodable, type Item = Result<D>;pub fn decode<'a, D: Decodable>(&'a mut self) -> DecodedRecords<'a, R, D>[src]

Important traits for DecodedRecords<'a, R, D>

Examples

ⓘImportant traits for StringRecords<'a, R>Important traits for StringRecords<'a, R>impl<'a, R> Iterator for StringRecords<'a, R> where R: Read, type Item = Result<Vec<String>>;pub fn records<'a>(&'a mut self) -> StringRecords<'a, R>[src]

Important traits for StringRecords<'a, R>

Example

pub fn headers(&mut self) -> Result<Vec<String>>[src]

Example

impl<R: Read> Reader<R>[src]

pub fn delimiter(self, delimiter: u8) -> Reader<R>[src]

pub fn has_headers(self, yes: bool) -> Reader<R>[src]

pub fn flexible(self, yes: bool) -> Reader<R>[src]

pub fn record_terminator(self, term: RecordTerminator) -> Reader<R>[src]

pub fn quote(self, quote: u8) -> Reader<R>[src]

pub fn escape(self, escape: Option<u8>) -> Reader<R>[src]

pub fn double_quote(self, yes: bool) -> Reader<R>[src]

pub fn ascii(self) -> Reader<R>[src]

impl<R: Read> Reader<R>[src]

pub fn byte_headers(&mut self) -> Result<Vec<ByteString>>[src]

ⓘImportant traits for ByteRecords<'a, R>Important traits for ByteRecords<'a, R>impl<'a, R> Iterator for ByteRecords<'a, R> where R: Read, type Item = Result<Vec<ByteString>>;pub fn byte_records<'a>(&'a mut self) -> ByteRecords<'a, R>[src]

Important traits for ByteRecords<'a, R>

pub fn done(&self) -> bool[src]

Example

pub fn next_bytes(&mut self) -> NextField<[u8]>[src]

Example

pub fn next_str(&mut self) -> NextField<str>[src]

pub fn byte_offset(&self) -> u64[src]

impl<R: Read + Seek> Reader<R>[src]

pub fn seek(&mut self, pos: u64) -> Result<()>[src]

Auto Trait Implementations

impl<R> Send for Reader<R> where R: Send,

impl<R> Sync for Reader<R> where R: Sync,

Blanket Implementations

impl<T> From for T[src]

fn from(t: T) -> T[src]

impl<T, U> Into for T where U: From<T>, [src]

fn into(self) -> U[src]

impl<T, U> TryFrom for T where U: Into<T>, [src]

type Error = Infallible

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>[src]

impl<T> Borrow for T where T: ?Sized, [src]

fn borrow(&self) -> &T[src]

impl<T> Any for T where T: 'static + ?Sized, [src]

fn type_id(&self) -> TypeId[src]

impl<T> BorrowMut for T where T: ?Sized, [src]

fn borrow_mut(&mut self) -> &mut T[src]

impl<T, U> TryInto for T where U: TryFrom<T>, [src]

type Error = <U as TryFrom<T>>::Error

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>[src]

`impl<R: Read> Reader<R>`[src]

`pub fn from_reader(rdr: R) -> Reader<R>`[src]

`impl Reader<File>`[src]

`pub fn from_file<P: AsRef<Path>>(path: P) -> Result<Reader<File>>`[src]

`impl Reader<Cursor<Vec<u8>>>`[src]

`pub fn from_string<'a, S>(s: S) -> Reader<Cursor<Vec<u8>>> where S: Into<String>,` [src]

`pub fn from_bytes<'a, V>(bytes: V) -> Reader<Cursor<Vec<u8>>> where V: Into<Vec<u8>>,` [src]

`impl<R: Read> Reader<R>`[src]

ⓘImportant traits for DecodedRecords<'a, R, D>
Important traits for DecodedRecords<'a, R, D>
`impl<'a, R, D> Iterator for DecodedRecords<'a, R, D> where R: Read, D: Decodable, type Item = Result<D>;`
`pub fn decode<'a, D: Decodable>(&'a mut self) -> DecodedRecords<'a, R, D>`[src]

ⓘImportant traits for StringRecords<'a, R>
Important traits for StringRecords<'a, R>
`impl<'a, R> Iterator for StringRecords<'a, R> where R: Read, type Item = Result<Vec<String>>;`
`pub fn records<'a>(&'a mut self) -> StringRecords<'a, R>`[src]

`pub fn headers(&mut self) -> Result<Vec<String>>`[src]

`impl<R: Read> Reader<R>`[src]

`pub fn delimiter(self, delimiter: u8) -> Reader<R>`[src]

`pub fn has_headers(self, yes: bool) -> Reader<R>`[src]

`pub fn flexible(self, yes: bool) -> Reader<R>`[src]

`pub fn record_terminator(self, term: RecordTerminator) -> Reader<R>`[src]

`pub fn quote(self, quote: u8) -> Reader<R>`[src]

`pub fn escape(self, escape: Option<u8>) -> Reader<R>`[src]

`pub fn double_quote(self, yes: bool) -> Reader<R>`[src]

`pub fn ascii(self) -> Reader<R>`[src]

`impl<R: Read> Reader<R>`[src]

`pub fn byte_headers(&mut self) -> Result<Vec<ByteString>>`[src]

ⓘImportant traits for ByteRecords<'a, R>
Important traits for ByteRecords<'a, R>
`impl<'a, R> Iterator for ByteRecords<'a, R> where R: Read, type Item = Result<Vec<ByteString>>;`
`pub fn byte_records<'a>(&'a mut self) -> ByteRecords<'a, R>`[src]

`pub fn done(&self) -> bool`[src]

`pub fn next_bytes(&mut self) -> NextField<[u8]>`[src]

`pub fn next_str(&mut self) -> NextField<str>`[src]

`pub fn byte_offset(&self) -> u64`[src]

`impl<R: Read + Seek> Reader<R>`[src]

`pub fn seek(&mut self, pos: u64) -> Result<()>`[src]

`impl<R> Send for Reader<R> where R: Send,`

`impl<R> Sync for Reader<R> where R: Sync,`

`impl<T> From for T`[src]

`fn from(t: T) -> T`[src]

`impl<T, U> Into for T where U: From<T>,` [src]

`fn into(self) -> U`[src]

`impl<T, U> TryFrom for T where U: Into<T>,` [src]

`type Error = Infallible`

`fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>`[src]

`impl<T> Borrow for T where T: ?Sized,` [src]

`fn borrow(&self) -> &T`[src]

`impl<T> Any for T where T: 'static + ?Sized,` [src]

`fn type_id(&self) -> TypeId`[src]

`impl<T> BorrowMut for T where T: ?Sized,` [src]

`fn borrow_mut(&mut self) -> &mut T`[src]

`impl<T, U> TryInto for T where U: TryFrom<T>,` [src]

`type Error = <U as TryFrom<T>>::Error`

`fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>`[src]