Class: CSV
- Inherits:
-
Object
- Object
- CSV
- Extended by:
- Forwardable
- Includes:
- Enumerable
- Defined in:
- lib/csv.rb,
lib/csv/row.rb,
lib/csv/table.rb,
lib/csv/parser.rb,
lib/csv/writer.rb,
lib/csv/version.rb,
lib/csv/fields_converter.rb,
lib/csv/input_record_separator.rb
Overview
CSV
CSV Data
CSV (comma-separated values) data is a text representation of a table:
-
A row separator delimits table rows. A common row separator is the newline character
"\n"
. -
A column separator delimits fields in a row. A common column separator is the comma character
","
.
This CSV String, with row separator "\n"
and column separator ","
, has three rows and two columns:
"foo,0\nbar,1\nbaz,2\n"
Despite the name CSV, a CSV representation can use different separators.
For more about tables, see the Wikipedia article “Table (information)”, especially its section “Simple table”
Class CSV
Class CSV provides methods for:
-
Parsing CSV data from a String object, a File (via its file path), or an IO object.
-
Generating CSV data to a String object.
To make CSV available:
require 'csv'
All examples here assume that this has been done.
Keeping It Simple
A CSV object has dozens of instance methods that offer fine-grained control of parsing and generating CSV data. For many needs, though, simpler approaches will do.
This section summarizes the singleton methods in CSV that allow you to parse and generate without explicitly creating CSV objects. For details, follow the links.
Simple Parsing
Parsing methods commonly return either of:
-
An Array of Arrays of Strings:
-
The outer Array is the entire “table”.
-
Each inner Array is a row.
-
Each String is a field.
-
-
A CSV::Table object. For details, see CSV with Headers.
Parsing a String
The input to be parsed can be a string:
string = "foo,0\nbar,1\nbaz,2\n"
Method CSV.parse returns the entire CSV data:
CSV.parse(string) # => [["foo", "0"], ["bar", "1"], ["baz", "2"]]
Method CSV.parse_line returns only the first row:
CSV.parse_line(string) # => ["foo", "0"]
CSV extends class String with instance method String#parse_csv, which also returns only the first row:
string.parse_csv # => ["foo", "0"]
Parsing Via a File Path
The input to be parsed can be in a file:
string = "foo,0\nbar,1\nbaz,2\n"
path = 't.csv'
File.write(path, string)
Method CSV.read returns the entire CSV data:
CSV.read(path) # => [["foo", "0"], ["bar", "1"], ["baz", "2"]]
Method CSV.foreach iterates, passing each row to the given block:
CSV.foreach(path) do |row|
p row
end
Output:
["foo", "0"]
["bar", "1"]
["baz", "2"]
Method CSV.table returns the entire CSV data as a CSV::Table object:
CSV.table(path) # => #<CSV::Table mode:col_or_row row_count:3>
Parsing from an Open IO Stream
The input to be parsed can be in an open IO stream:
Method CSV.read returns the entire CSV data:
File.open(path) do |file|
CSV.read(file)
end # => [["foo", "0"], ["bar", "1"], ["baz", "2"]]
As does method CSV.parse:
File.open(path) do |file|
CSV.parse(file)
end # => [["foo", "0"], ["bar", "1"], ["baz", "2"]]
Method CSV.parse_line returns only the first row:
File.open(path) do |file|
CSV.parse_line(file)
end # => ["foo", "0"]
Method CSV.foreach iterates, passing each row to the given block:
File.open(path) do |file|
CSV.foreach(file) do |row|
p row
end
end
Output:
["foo", "0"]
["bar", "1"]
["baz", "2"]
Method CSV.table returns the entire CSV data as a CSV::Table object:
File.open(path) do |file|
CSV.table(file)
end # => #<CSV::Table mode:col_or_row row_count:3>
Simple Generating
Method CSV.generate returns a String; this example uses method CSV#<< to append the rows that are to be generated:
output_string = CSV.generate do |csv|
csv << ['foo', 0]
csv << ['bar', 1]
csv << ['baz', 2]
end
output_string # => "foo,0\nbar,1\nbaz,2\n"
Method CSV.generate_line returns a String containing the single row constructed from an Array:
CSV.generate_line(['foo', '0']) # => "foo,0\n"
CSV extends class Array with instance method Array#to_csv
, which forms an Array into a String:
['foo', '0'].to_csv # => "foo,0\n"
“Filtering” CSV
Method CSV.filter provides a Unix-style filter for CSV data. The input data is processed to form the output data:
in_string = "foo,0\nbar,1\nbaz,2\n"
out_string = ''
CSV.filter(in_string, out_string) do |row|
row[0] = row[0].upcase
row[1] *= 4
end
out_string # => "FOO,0000\nBAR,1111\nBAZ,2222\n"
CSV Objects
There are three ways to create a CSV object:
-
Method CSV.new returns a new CSV object.
-
Method CSV.instance returns a new or cached CSV object.
-
Method CSV() also returns a new or cached CSV object.
Instance Methods
CSV has three groups of instance methods:
-
Its own internally defined instance methods.
-
Methods included by module Enumerable.
-
Methods delegated to class IO. See below.
Delegated Methods
For convenience, a CSV object will delegate to many methods in class IO. (A few have wrapper “guard code” in CSV.) You may call:
-
IO#binmode
-
#binmode?
-
IO#close
-
IO#close_read
-
IO#close_write
-
IO#closed?
-
#eof
-
#eof?
-
IO#external_encoding
-
IO#fcntl
-
IO#fileno
-
#flock
-
IO#flush
-
IO#fsync
-
IO#internal_encoding
-
#ioctl
-
IO#isatty
-
#path
-
IO#pid
-
IO#pos
-
IO#pos=
-
IO#reopen
-
#rewind
-
IO#seek
-
#stat
-
IO#string
-
IO#sync
-
IO#sync=
-
IO#tell
-
#to_i
-
#to_io
-
IO#truncate
-
IO#tty?
Options
The default values for options are:
DEFAULT_OPTIONS = {
# For both parsing and generating.
col_sep: ",",
row_sep: :auto,
quote_char: '"',
# For parsing.
field_size_limit: nil,
converters: nil,
unconverted_fields: nil,
headers: false,
return_headers: false,
header_converters: nil,
skip_blanks: false,
skip_lines: nil,
liberal_parsing: false,
nil_value: nil,
empty_value: "",
strip: false,
# For generating.
write_headers: nil,
quote_empty: true,
force_quotes: false,
write_converters: nil,
write_nil_value: nil,
write_empty_value: "",
}
Options for Parsing
Options for parsing, described in detail below, include:
-
row_sep
: Specifies the row separator; used to delimit rows. -
col_sep
: Specifies the column separator; used to delimit fields. -
quote_char
: Specifies the quote character; used to quote fields. -
field_size_limit
: Specifies the maximum field size + 1 allowed. Deprecated since 3.2.3. Usemax_field_size
instead. -
max_field_size
: Specifies the maximum field size allowed. -
converters
: Specifies the field converters to be used. -
unconverted_fields
: Specifies whether unconverted fields are to be available. -
headers
: Specifies whether data contains headers, or specifies the headers themselves. -
return_headers
: Specifies whether headers are to be returned. -
header_converters
: Specifies the header converters to be used. -
skip_blanks
: Specifies whether blanks lines are to be ignored. -
skip_lines
: Specifies how comments lines are to be recognized. -
strip
: Specifies whether leading and trailing whitespace are to be stripped from fields. This must be compatible withcol_sep
; if it is not, then anArgumentError
exception will be raised. -
liberal_parsing
: Specifies whether CSV should attempt to parse non-compliant data. -
nil_value
: Specifies the object that is to be substituted for each null (no-text) field. -
empty_value
: Specifies the object that is to be substituted for each empty field.
:include: ../doc/csv/options/common/row_sep.rdoc
:include: ../doc/csv/options/common/col_sep.rdoc
:include: ../doc/csv/options/common/quote_char.rdoc
:include: ../doc/csv/options/parsing/field_size_limit.rdoc
:include: ../doc/csv/options/parsing/converters.rdoc
:include: ../doc/csv/options/parsing/unconverted_fields.rdoc
:include: ../doc/csv/options/parsing/headers.rdoc
:include: ../doc/csv/options/parsing/return_headers.rdoc
:include: ../doc/csv/options/parsing/header_converters.rdoc
:include: ../doc/csv/options/parsing/skip_blanks.rdoc
:include: ../doc/csv/options/parsing/skip_lines.rdoc
:include: ../doc/csv/options/parsing/strip.rdoc
:include: ../doc/csv/options/parsing/liberal_parsing.rdoc
:include: ../doc/csv/options/parsing/nil_value.rdoc
:include: ../doc/csv/options/parsing/empty_value.rdoc
Options for Generating
Options for generating, described in detail below, include:
-
row_sep
: Specifies the row separator; used to delimit rows. -
col_sep
: Specifies the column separator; used to delimit fields. -
quote_char
: Specifies the quote character; used to quote fields. -
write_headers
: Specifies whether headers are to be written. -
force_quotes
: Specifies whether each output field is to be quoted. -
quote_empty
: Specifies whether each empty output field is to be quoted. -
write_converters
: Specifies the field converters to be used in writing. -
write_nil_value
: Specifies the object that is to be substituted for eachnil
-valued field. -
write_empty_value
: Specifies the object that is to be substituted for each empty field.
:include: ../doc/csv/options/common/row_sep.rdoc
:include: ../doc/csv/options/common/col_sep.rdoc
:include: ../doc/csv/options/common/quote_char.rdoc
:include: ../doc/csv/options/generating/write_headers.rdoc
:include: ../doc/csv/options/generating/force_quotes.rdoc
:include: ../doc/csv/options/generating/quote_empty.rdoc
:include: ../doc/csv/options/generating/write_converters.rdoc
:include: ../doc/csv/options/generating/write_nil_value.rdoc
:include: ../doc/csv/options/generating/write_empty_value.rdoc
CSV with Headers
CSV allows to specify column names of CSV file, whether they are in data, or provided separately. If headers are specified, reading methods return an instance of CSV::Table, consisting of CSV::Row.
# Headers are part of data
data = CSV.parse(<<~ROWS, headers: true)
Name,Department,Salary
Bob,Engineering,1000
Jane,Sales,2000
John,Management,5000
ROWS
data.class #=> CSV::Table
data.first #=> #<CSV::Row "Name":"Bob" "Department":"Engineering" "Salary":"1000">
data.first.to_h #=> {"Name"=>"Bob", "Department"=>"Engineering", "Salary"=>"1000"}
# Headers provided by developer
data = CSV.parse('Bob,Engineering,1000', headers: %i[name department salary])
data.first #=> #<CSV::Row name:"Bob" department:"Engineering" salary:"1000">
Converters
By default, each value (field or header) parsed by CSV is formed into a String. You can use a field converter or header converter to intercept and modify the parsed values:
-
See Field Converters.
-
See Header Converters.
Also by default, each value to be written during generation is written ‘as-is’. You can use a write converter to modify values before writing.
-
See Write Converters.
Specifying Converters
You can specify converters for parsing or generating in the options
argument to various CSV methods:
-
Option
converters
for converting parsed field values. -
Option
header_converters
for converting parsed header values. -
Option
write_converters
for converting values to be written (generated).
There are three forms for specifying converters:
-
A converter proc: executable code to be used for conversion.
-
A converter name: the name of a stored converter.
-
A converter list: an array of converter procs, converter names, and converter lists.
Converter Procs
This converter proc, strip_converter
, accepts a value field
and returns field.strip
:
strip_converter = proc {|field| field.strip }
In this call to CSV.parse
, the keyword argument converters: string_converter
specifies that:
-
Proc
string_converter
is to be called for each parsed field. -
The converter’s return value is to replace the
field
value.
Example:
string = " foo , 0 \n bar , 1 \n baz , 2 \n"
array = CSV.parse(string, converters: strip_converter)
array # => [["foo", "0"], ["bar", "1"], ["baz", "2"]]
A converter proc can receive a second argument, field_info
, that contains details about the field. This modified strip_converter
displays its arguments:
strip_converter = proc do |field, field_info|
p [field, field_info]
field.strip
end
string = " foo , 0 \n bar , 1 \n baz , 2 \n"
array = CSV.parse(string, converters: strip_converter)
array # => [["foo", "0"], ["bar", "1"], ["baz", "2"]]
Output:
[" foo ", #<struct CSV::FieldInfo index=0, line=1, header=nil>]
[" 0 ", #<struct CSV::FieldInfo index=1, line=1, header=nil>]
[" bar ", #<struct CSV::FieldInfo index=0, line=2, header=nil>]
[" 1 ", #<struct CSV::FieldInfo index=1, line=2, header=nil>]
[" baz ", #<struct CSV::FieldInfo index=0, line=3, header=nil>]
[" 2 ", #<struct CSV::FieldInfo index=1, line=3, header=nil>]
Each CSV::FieldInfo object shows:
-
The 0-based field index.
-
The 1-based line index.
-
The field header, if any.
Stored Converters
A converter may be given a name and stored in a structure where the parsing methods can find it by name.
The storage structure for field converters is the Hash CSV::Converters. It has several built-in converter procs:
-
:integer
: converts each String-embedded integer into a true Integer. -
:float
: converts each String-embedded float into a true Float. -
:date
: converts each String-embedded date into a true Date. -
:date_time
: converts each String-embedded date-time into a true DateTime
. This example creates a converter proc, then stores it:
strip_converter = proc {|field| field.strip }
CSV::Converters[:strip] = strip_converter
Then the parsing method call can refer to the converter by its name, :strip
:
string = " foo , 0 \n bar , 1 \n baz , 2 \n"
array = CSV.parse(string, converters: :strip)
array # => [["foo", "0"], ["bar", "1"], ["baz", "2"]]
The storage structure for header converters is the Hash CSV::HeaderConverters, which works in the same way. It also has built-in converter procs:
-
:downcase
: Downcases each header. -
:symbol
: Converts each header to a Symbol.
There is no such storage structure for write headers.
In order for the parsing methods to access stored converters in non-main-Ractors, the storage structure must be made shareable first. Therefore, Ractor.make_shareable(CSV::Converters)
and Ractor.make_shareable(CSV::HeaderConverters)
must be called before the creation of Ractors that use the converters stored in these structures. (Since making the storage structures shareable involves freezing them, any custom converters that are to be used must be added first.)
Converter Lists
A converter list is an Array that may include any assortment of:
-
Converter procs.
-
Names of stored converters.
-
Nested converter lists.
Examples:
numeric_converters = [:integer, :float]
date_converters = [:date, :date_time]
[numeric_converters, strip_converter]
[strip_converter, date_converters, :float]
Like a converter proc, a converter list may be named and stored in either CSV::Converters or CSV::HeaderConverters:
CSV::Converters[:custom] = [strip_converter, date_converters, :float]
CSV::HeaderConverters[:custom] = [:downcase, :symbol]
There are two built-in converter lists:
CSV::Converters[:numeric] # => [:integer, :float]
CSV::Converters[:all] # => [:date_time, :numeric]
Field Converters
With no conversion, all parsed fields in all rows become Strings:
string = "foo,0\nbar,1\nbaz,2\n"
ary = CSV.parse(string)
ary # => # => [["foo", "0"], ["bar", "1"], ["baz", "2"]]
When you specify a field converter, each parsed field is passed to the converter; its return value becomes the stored value for the field. A converter might, for example, convert an integer embedded in a String into a true Integer. (In fact, that’s what built-in field converter :integer
does.)
There are three ways to use field converters.
-
Using option converters with a parsing method:
ary = CSV.parse(string, converters: :integer) ary # => [0, 1, 2] # => [["foo", 0], ["bar", 1], ["baz", 2]]
-
Using option converters with a new CSV instance:
csv = CSV.new(string, converters: :integer) # Field converters in effect: csv.converters # => [:integer] csv.read # => [["foo", 0], ["bar", 1], ["baz", 2]]
-
Using method #convert to add a field converter to a CSV instance:
csv = CSV.new(string) # Add a converter. csv.convert(:integer) csv.converters # => [:integer] csv.read # => [["foo", 0], ["bar", 1], ["baz", 2]]
Installing a field converter does not affect already-read rows:
csv = CSV.new(string)
csv.shift # => ["foo", "0"]
# Add a converter.
csv.convert(:integer)
csv.converters # => [:integer]
csv.read # => [["bar", 1], ["baz", 2]]
There are additional built-in converters, and custom converters are also supported.
Built-In Field Converters
The built-in field converters are in Hash CSV::Converters:
-
Each key is a field converter name.
-
Each value is one of:
-
A Proc field converter.
-
An Array of field converter names.
-
Display:
CSV::Converters.each_pair do |name, value|
if value.kind_of?(Proc)
p [name, value.class]
else
p [name, value]
end
end
Output:
[:integer, Proc]
[:float, Proc]
[:numeric, [:integer, :float]]
[:date, Proc]
[:date_time, Proc]
[:all, [:date_time, :numeric]]
Each of these converters transcodes values to UTF-8 before attempting conversion. If a value cannot be transcoded to UTF-8 the conversion will fail and the value will remain unconverted.
Converter :integer
converts each field that Integer() accepts:
data = '0,1,2,x'
# Without the converter
csv = CSV.parse_line(data)
csv # => ["0", "1", "2", "x"]
# With the converter
csv = CSV.parse_line(data, converters: :integer)
csv # => [0, 1, 2, "x"]
Converter :float
converts each field that Float() accepts:
data = '1.0,3.14159,x'
# Without the converter
csv = CSV.parse_line(data)
csv # => ["1.0", "3.14159", "x"]
# With the converter
csv = CSV.parse_line(data, converters: :float)
csv # => [1.0, 3.14159, "x"]
Converter :numeric
converts with both :integer
and :float
..
Converter :date
converts each field that Date::parse accepts:
data = '2001-02-03,x'
# Without the converter
csv = CSV.parse_line(data)
csv # => ["2001-02-03", "x"]
# With the converter
csv = CSV.parse_line(data, converters: :date)
csv # => [#<Date: 2001-02-03 ((2451944j,0s,0n),+0s,2299161j)>, "x"]
Converter :date_time
converts each field that DateTime::parse accepts:
data = '2020-05-07T14:59:00-05:00,x'
# Without the converter
csv = CSV.parse_line(data)
csv # => ["2020-05-07T14:59:00-05:00", "x"]
# With the converter
csv = CSV.parse_line(data, converters: :date_time)
csv # => [#<DateTime: 2020-05-07T14:59:00-05:00 ((2458977j,71940s,0n),-18000s,2299161j)>, "x"]
Converter :numeric
converts with both :date_time
and :numeric
..
As seen above, method #convert adds converters to a CSV instance, and method #converters returns an Array of the converters in effect:
csv = CSV.new('0,1,2')
csv.converters # => []
csv.convert(:integer)
csv.converters # => [:integer]
csv.convert(:date)
csv.converters # => [:integer, :date]
Custom Field Converters
You can define a custom field converter:
strip_converter = proc {|field| field.strip }
string = " foo , 0 \n bar , 1 \n baz , 2 \n"
array = CSV.parse(string, converters: strip_converter)
array # => [["foo", "0"], ["bar", "1"], ["baz", "2"]]
You can register the converter in Converters Hash, which allows you to refer to it by name:
CSV::Converters[:strip] = strip_converter
string = " foo , 0 \n bar , 1 \n baz , 2 \n"
array = CSV.parse(string, converters: :strip)
array # => [["foo", "0"], ["bar", "1"], ["baz", "2"]]
Header Converters
Header converters operate only on headers (and not on other rows).
There are three ways to use header converters; these examples use built-in header converter :downcase
, which downcases each parsed header.
-
Option
header_converters
with a singleton parsing method:string = "Name,Count\nFoo,0\n,Bar,1\nBaz,2" tbl = CSV.parse(string, headers: true, header_converters: :downcase) tbl.class # => CSV::Table tbl.headers # => ["name", "count"]
-
Option
header_converters
with a new CSV instance:csv = CSV.new(string, header_converters: :downcase) # Header converters in effect: csv.header_converters # => [:downcase] tbl = CSV.parse(string, headers: true) tbl.headers # => ["Name", "Count"]
-
Method #header_convert adds a header converter to a CSV instance:
csv = CSV.new(string) # Add a header converter. csv.header_convert(:downcase) csv.header_converters # => [:downcase] tbl = CSV.parse(string, headers: true) tbl.headers # => ["Name", "Count"]
Built-In Header Converters
The built-in header converters are in Hash CSV::HeaderConverters. The keys there are the names of the converters:
CSV::HeaderConverters.keys # => [:downcase, :symbol]
Converter :downcase
converts each header by downcasing it:
string = "Name,Count\nFoo,0\n,Bar,1\nBaz,2"
tbl = CSV.parse(string, headers: true, header_converters: :downcase)
tbl.class # => CSV::Table
tbl.headers # => ["name", "count"]
Converter :symbol
converts each header by making it into a Symbol:
string = "Name,Count\nFoo,0\n,Bar,1\nBaz,2"
tbl = CSV.parse(string, headers: true, header_converters: :symbol)
tbl.headers # => [:name, :count]
Details:
-
Strips leading and trailing whitespace.
-
Downcases the header.
-
Replaces embedded spaces with underscores.
-
Removes non-word characters.
-
Makes the string into a Symbol.
Custom Header Converters
You can define a custom header converter:
upcase_converter = proc {|header| header.upcase }
string = "Name,Value\nfoo,0\nbar,1\nbaz,2\n"
table = CSV.parse(string, headers: true, header_converters: upcase_converter)
table # => #<CSV::Table mode:col_or_row row_count:4>
table.headers # => ["NAME", "VALUE"]
You can register the converter in HeaderConverters Hash, which allows you to refer to it by name:
CSV::HeaderConverters[:upcase] = upcase_converter
table = CSV.parse(string, headers: true, header_converters: :upcase)
table # => #<CSV::Table mode:col_or_row row_count:4>
table.headers # => ["NAME", "VALUE"]
Write Converters
When you specify a write converter for generating CSV, each field to be written is passed to the converter; its return value becomes the new value for the field. A converter might, for example, strip whitespace from a field.
Using no write converter (all fields unmodified):
output_string = CSV.generate do |csv|
csv << [' foo ', 0]
csv << [' bar ', 1]
csv << [' baz ', 2]
end
output_string # => " foo ,0\n bar ,1\n baz ,2\n"
Using option write_converters
with two custom write converters:
strip_converter = proc {|field| field.respond_to?(:strip) ? field.strip : field }
upcase_converter = proc {|field| field.respond_to?(:upcase) ? field.upcase : field }
write_converters = [strip_converter, upcase_converter]
output_string = CSV.generate(write_converters: write_converters) do |csv|
csv << [' foo ', 0]
csv << [' bar ', 1]
csv << [' baz ', 2]
end
output_string # => "FOO,0\nBAR,1\nBAZ,2\n"
Character Encodings (M17n or Multilingualization)
This new CSV parser is m17n savvy. The parser works in the Encoding of the IO or String object being read from or written to. Your data is never transcoded (unless you ask Ruby to transcode it for you) and will literally be parsed in the Encoding it is in. Thus CSV will return Arrays or Rows of Strings in the Encoding of your data. This is accomplished by transcoding the parser itself into your Encoding.
Some transcoding must take place, of course, to accomplish this multiencoding support. For example, :col_sep
, :row_sep
, and :quote_char
must be transcoded to match your data. Hopefully this makes the entire process feel transparent, since CSV’s defaults should just magically work for your data. However, you can set these values manually in the target Encoding to avoid the translation.
It’s also important to note that while all of CSV’s core parser is now Encoding agnostic, some features are not. For example, the built-in converters will try to transcode data to UTF-8 before making conversions. Again, you can provide custom converters that are aware of your Encodings to avoid this translation. It’s just too hard for me to support native conversions in all of Ruby’s Encodings.
Anyway, the practical side of this is simple: make sure IO and String objects passed into CSV have the proper Encoding set and everything should just work. CSV methods that allow you to open IO objects (CSV::foreach(), CSV::open(), CSV::read(), and CSV::readlines()) do allow you to specify the Encoding.
One minor exception comes when generating CSV into a String with an Encoding that is not ASCII compatible. There’s no existing data for CSV to use to prepare itself and thus you will probably need to manually specify the desired Encoding for most of those cases. It will try to guess using the fields in a row of output though, when using CSV::generate_line() or Array#to_csv().
I try to point out any other Encoding issues in the documentation of methods as they come up.
This has been tested to the best of my ability with all non-“dummy” Encodings Ruby ships with. However, it is brave new code and may have some bugs. Please feel free to report any issues you find with it.
Defined Under Namespace
Modules: InputRecordSeparator Classes: FieldInfo, FieldsConverter, InvalidEncodingError, MalformedCSVError, Parser, Row, Table, Writer
Constant Summary collapse
- DateMatcher =
A Regexp used to find and convert some common Date formats.
/ \A(?: (\w+,?\s+)?\w+\s+\d{1,2},?\s+\d{2,4} | \d{4}-\d{2}-\d{2} )\z /x
- DateTimeMatcher =
A Regexp used to find and convert some common DateTime formats.
/ \A(?: (\w+,?\s+)?\w+\s+\d{1,2}\s+\d{1,2}:\d{1,2}:\d{1,2},?\s+\d{2,4} | # ISO-8601 and RFC-3339 (space instead of T) recognized by DateTime.parse \d{4}-\d{2}-\d{2} (?:[T\s]\d{2}:\d{2}(?::\d{2}(?:\.\d+)?(?:[+-]\d{2}(?::\d{2})|Z)?)?)? )\z /x
- ConverterEncoding =
The encoding used by all converters.
Encoding.find("UTF-8")
- Converters =
A Hash containing the names and Procs for the built-in field converters. See Built-In Field Converters.
This Hash is intentionally left unfrozen, and may be extended with custom field converters. See Custom Field Converters.
{ integer: lambda { |f| Integer(f.encode(ConverterEncoding)) rescue f }, float: lambda { |f| Float(f.encode(ConverterEncoding)) rescue f }, numeric: [:integer, :float], date: lambda { |f| begin e = f.encode(ConverterEncoding) e.match?(DateMatcher) ? Date.parse(e) : f rescue # encoding conversion or date parse errors f end }, date_time: lambda { |f| begin e = f.encode(ConverterEncoding) e.match?(DateTimeMatcher) ? DateTime.parse(e) : f rescue # encoding conversion or date parse errors f end }, all: [:date_time, :numeric], }
- HeaderConverters =
A Hash containing the names and Procs for the built-in header converters. See Built-In Header Converters.
This Hash is intentionally left unfrozen, and may be extended with custom field converters. See Custom Header Converters.
{ downcase: lambda { |h| h.encode(ConverterEncoding).downcase }, symbol: lambda { |h| h.encode(ConverterEncoding).downcase.gsub(/[^\s\w]+/, "").strip. gsub(/\s+/, "_").to_sym }, symbol_raw: lambda { |h| h.encode(ConverterEncoding).to_sym } }
- DEFAULT_OPTIONS =
Default values for method options.
{ # For both parsing and generating. col_sep: ",", row_sep: :auto, quote_char: '"', # For parsing. field_size_limit: nil, max_field_size: nil, converters: nil, unconverted_fields: nil, headers: false, return_headers: false, header_converters: nil, skip_blanks: false, skip_lines: nil, liberal_parsing: false, nil_value: nil, empty_value: "", strip: false, # For generating. write_headers: nil, quote_empty: true, force_quotes: false, write_converters: nil, write_nil_value: nil, write_empty_value: "", }.freeze
- VERSION =
The version of the installed library.
"3.3.0"
Instance Attribute Summary collapse
-
#encoding ⇒ Object
readonly
:call-seq: csv.encoding -> encoding.
Class Method Summary collapse
-
.filter(input = nil, output = nil, **options) ⇒ Object
:call-seq: filter(in_string_or_io, **options) {|row| … } -> array_of_arrays or csv_table filter(in_string_or_io, out_string_or_io, **options) {|row| … } -> array_of_arrays or csv_table filter(**options) {|row| … } -> array_of_arrays or csv_table.
-
.foreach(path, mode = "r", **options, &block) ⇒ Object
:call-seq: foreach(path_or_io, mode=‘r’, **options) {|row| … ) foreach(path_or_io, mode=‘r’, **options) -> new_enumerator.
-
.generate(str = nil, **options) {|csv| ... } ⇒ Object
:call-seq: generate(csv_string, **options) {|csv| … } generate(**options) {|csv| … }.
-
.generate_line(row, **options) ⇒ Object
:call-seq: CSV.generate_line(ary) CSV.generate_line(ary, **options).
-
.generate_lines(rows, **options) ⇒ Object
:call-seq: CSV.generate_lines(rows) CSV.generate_lines(rows, **options).
-
.instance(data = $stdout, **options) ⇒ Object
:call-seq: instance(string, **options) instance(io = $stdout, **options) instance(string, **options) {|csv| … } instance(io = $stdout, **options) {|csv| … }.
-
.open(filename, mode = "r", **options) ⇒ Object
:call-seq: open(file_path, mode = “rb”, **options ) -> new_csv open(io, mode = “rb”, **options ) -> new_csv open(file_path, mode = “rb”, **options ) { |csv| … } -> object open(io, mode = “rb”, **options ) { |csv| … } -> object.
-
.parse(str, **options, &block) ⇒ Object
:call-seq: parse(string) -> array_of_arrays parse(io) -> array_of_arrays parse(string, headers: …, **options) -> csv_table parse(io, headers: …, **options) -> csv_table parse(string, **options) {|row| … } parse(io, **options) {|row| … }.
-
.parse_line(line, **options) ⇒ Object
:call-seq: CSV.parse_line(string) -> new_array or nil CSV.parse_line(io) -> new_array or nil CSV.parse_line(string, **options) -> new_array or nil CSV.parse_line(io, **options) -> new_array or nil CSV.parse_line(string, headers: true, **options) -> csv_row or nil CSV.parse_line(io, headers: true, **options) -> csv_row or nil.
-
.read(path, **options) ⇒ Object
:call-seq: read(source, **options) -> array_of_arrays read(source, headers: true, **options) -> csv_table.
-
.readlines(path, **options) ⇒ Object
:call-seq: CSV.readlines(source, **options).
-
.table(path, **options) ⇒ Object
:call-seq: CSV.table(source, **options).
Instance Method Summary collapse
-
#<<(row) ⇒ Object
(also: #add_row, #puts)
:call-seq: csv << row -> self.
- #binmode? ⇒ Boolean
-
#col_sep ⇒ Object
:call-seq: csv.col_sep -> string.
-
#convert(name = nil, &converter) ⇒ Object
:call-seq: convert(converter_name) -> array_of_procs convert {|field, field_info| … } -> array_of_procs.
-
#converters ⇒ Object
:call-seq: csv.converters -> array.
-
#each(&block) ⇒ Object
:call-seq: csv.each -> enumerator csv.each {|row| …}.
- #eof? ⇒ Boolean (also: #eof)
-
#field_size_limit ⇒ Object
:call-seq: csv.field_size_limit -> integer or nil.
- #flock(*args) ⇒ Object
-
#force_quotes? ⇒ Boolean
:call-seq: csv.force_quotes? -> true or false.
-
#header_convert(name = nil, &converter) ⇒ Object
The block need not return a String object: csv = CSV.open(path, headers: true) csv.header_convert {|header, field_info| header.to_sym } table = csv.read table.headers # => [:Name, :Value].
-
#header_converters ⇒ Object
:call-seq: csv.header_converters -> array.
-
#header_row? ⇒ Boolean
:call-seq: csv.header_row? -> true or false.
-
#headers ⇒ Object
:call-seq: csv.headers -> object.
-
#initialize(data, col_sep: ",", row_sep: :auto, quote_char: '"', field_size_limit: nil, max_field_size: nil, converters: nil, unconverted_fields: nil, headers: false, return_headers: false, write_headers: nil, header_converters: nil, skip_blanks: false, force_quotes: false, skip_lines: nil, liberal_parsing: false, internal_encoding: nil, external_encoding: nil, encoding: nil, nil_value: nil, empty_value: "", strip: false, quote_empty: true, write_converters: nil, write_nil_value: nil, write_empty_value: "") ⇒ CSV
constructor
:call-seq: CSV.new(string) CSV.new(io) CSV.new(string, **options) CSV.new(io, **options).
-
#inspect ⇒ Object
:call-seq: csv.inspect -> string.
- #ioctl(*args) ⇒ Object
-
#liberal_parsing? ⇒ Boolean
:call-seq: csv.liberal_parsing? -> true or false.
-
#line ⇒ Object
:call-seq: csv.line -> array.
-
#lineno ⇒ Object
:call-seq: csv.line_no -> integer.
-
#max_field_size ⇒ Object
:call-seq: csv.max_field_size -> integer or nil.
- #path ⇒ Object
-
#quote_char ⇒ Object
:call-seq: csv.quote_char -> character.
-
#read ⇒ Object
(also: #readlines)
:call-seq: csv.read -> array or csv_table.
-
#return_headers? ⇒ Boolean
:call-seq: csv.return_headers? -> true or false.
-
#rewind ⇒ Object
Rewinds the underlying IO object and resets CSV’s lineno() counter.
-
#row_sep ⇒ Object
:call-seq: csv.row_sep -> string.
-
#shift ⇒ Object
(also: #gets, #readline)
:call-seq: csv.shift -> array, csv_row, or nil.
-
#skip_blanks? ⇒ Boolean
:call-seq: csv.skip_blanks? -> true or false.
-
#skip_lines ⇒ Object
:call-seq: csv.skip_lines -> regexp or nil.
- #stat(*args) ⇒ Object
- #to_i ⇒ Object
- #to_io ⇒ Object
-
#unconverted_fields? ⇒ Boolean
:call-seq: csv.unconverted_fields? -> object.
-
#write_headers? ⇒ Boolean
:call-seq: csv.write_headers? -> true or false.
Constructor Details
#initialize(data, col_sep: ",", row_sep: :auto, quote_char: '"', field_size_limit: nil, max_field_size: nil, converters: nil, unconverted_fields: nil, headers: false, return_headers: false, write_headers: nil, header_converters: nil, skip_blanks: false, force_quotes: false, skip_lines: nil, liberal_parsing: false, internal_encoding: nil, external_encoding: nil, encoding: nil, nil_value: nil, empty_value: "", strip: false, quote_empty: true, write_converters: nil, write_nil_value: nil, write_empty_value: "") ⇒ CSV
:call-seq:
CSV.new(string)
CSV.new(io)
CSV.new(string, **)
CSV.new(io, **)
Returns the new CSV object created using string
or io
and the specified options
.
-
Argument
string
should be a String object; it will be put into a new StringIO object positioned at the beginning.
:include: ../doc/csv/arguments/io.rdoc
-
Argument
options
: See:For performance reasons, the options cannot be overridden in a CSV object, so those specified here will endure.
In addition to the CSV instance methods, several IO methods are delegated. See Delegated Methods.
Create a CSV object from a String object:
csv = CSV.new('foo,0')
csv # => #<CSV io_type:StringIO encoding:UTF-8 lineno:0 col_sep:"," row_sep:"\n" quote_char:"\"">
Create a CSV object from a File object:
File.write('t.csv', 'foo,0')
csv = CSV.new(File.open('t.csv'))
csv # => #<CSV io_type:File io_path:"t.csv" encoding:UTF-8 lineno:0 col_sep:"," row_sep:"\n" quote_char:"\"">
Raises an exception if the argument is nil
:
# Raises ArgumentError (Cannot parse nil as CSV):
CSV.new(nil)
1905 1906 1907 1908 1909 1910 1911 1912 1913 1914 1915 1916 1917 1918 1919 1920 1921 1922 1923 1924 1925 1926 1927 1928 1929 1930 1931 1932 1933 1934 1935 1936 1937 1938 1939 1940 1941 1942 1943 1944 1945 1946 1947 1948 1949 1950 1951 1952 1953 1954 1955 1956 1957 1958 1959 1960 1961 1962 1963 1964 1965 1966 1967 1968 1969 1970 1971 1972 1973 1974 1975 1976 1977 1978 1979 1980 1981 1982 1983 1984 1985 1986 1987 1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 |
# File 'lib/csv.rb', line 1905 def initialize(data, col_sep: ",", row_sep: :auto, quote_char: '"', field_size_limit: nil, max_field_size: nil, converters: nil, unconverted_fields: nil, headers: false, return_headers: false, write_headers: nil, header_converters: nil, skip_blanks: false, force_quotes: false, skip_lines: nil, liberal_parsing: false, internal_encoding: nil, external_encoding: nil, encoding: nil, nil_value: nil, empty_value: "", strip: false, quote_empty: true, write_converters: nil, write_nil_value: nil, write_empty_value: "") raise ArgumentError.new("Cannot parse nil as CSV") if data.nil? if data.is_a?(String) if encoding if encoding.is_a?(String) data_external_encoding, data_internal_encoding = encoding.split(":", 2) if data_internal_encoding data = data.encode(data_internal_encoding, data_external_encoding) else data = data.dup.force_encoding(data_external_encoding) end else data = data.dup.force_encoding(encoding) end end @io = StringIO.new(data) else @io = data end @encoding = determine_encoding(encoding, internal_encoding) @base_fields_converter_options = { nil_value: nil_value, empty_value: empty_value, } @write_fields_converter_options = { nil_value: write_nil_value, empty_value: write_empty_value, } @initial_converters = converters @initial_header_converters = header_converters @initial_write_converters = write_converters if max_field_size.nil? and field_size_limit max_field_size = field_size_limit - 1 end @parser_options = { column_separator: col_sep, row_separator: row_sep, quote_character: quote_char, max_field_size: max_field_size, unconverted_fields: unconverted_fields, headers: headers, return_headers: return_headers, skip_blanks: skip_blanks, skip_lines: skip_lines, liberal_parsing: liberal_parsing, encoding: @encoding, nil_value: nil_value, empty_value: empty_value, strip: strip, } @parser = nil @parser_enumerator = nil @eof_error = nil @writer_options = { encoding: @encoding, force_encoding: (not encoding.nil?), force_quotes: force_quotes, headers: headers, write_headers: write_headers, column_separator: col_sep, row_separator: row_sep, quote_character: quote_char, quote_empty: quote_empty, } @writer = nil writer if @writer_options[:write_headers] end |
Instance Attribute Details
#encoding ⇒ Object (readonly)
:call-seq:
csv.encoding -> encoding
Returns the encoding used for parsing and generating; see Character Encodings (M17n or Multilingualization):
CSV.new('').encoding # => #<Encoding:UTF-8>
2192 2193 2194 |
# File 'lib/csv.rb', line 2192 def encoding @encoding end |
Class Method Details
.filter(input = nil, output = nil, **options) ⇒ Object
:call-seq:
filter(in_string_or_io, **options) {|row| ... } -> array_of_arrays or csv_table
filter(in_string_or_io, out_string_or_io, **options) {|row| ... } -> array_of_arrays or csv_table
filter(**options) {|row| ... } -> array_of_arrays or csv_table
-
Parses CSV from a source (String, IO stream, or ARGF).
-
Calls the given block with each parsed row:
-
Without headers, each row is an Array.
-
With headers, each row is a CSV::Row.
-
-
Generates CSV to an output (String, IO stream, or STDOUT).
-
Returns the parsed source:
-
Without headers, an Array of Arrays.
-
With headers, a CSV::Table.
-
When in_string_or_io
is given, but not out_string_or_io
, parses from the given in_string_or_io
and generates to STDOUT.
String input without headers:
in_string = "foo,0\nbar,1\nbaz,2"
CSV.filter(in_string) do |row|
row[0].upcase!
row[1] = - row[1].to_i
end # => [["FOO", 0], ["BAR", -1], ["BAZ", -2]]
Output (to STDOUT):
FOO,0
BAR,-1
BAZ,-2
String input with headers:
in_string = "Name,Value\nfoo,0\nbar,1\nbaz,2"
CSV.filter(in_string, headers: true) do |row|
row[0].upcase!
row[1] = - row[1].to_i
end # => #<CSV::Table mode:col_or_row row_count:4>
Output (to STDOUT):
Name,Value
FOO,0
BAR,-1
BAZ,-2
IO stream input without headers:
File.write('t.csv', "foo,0\nbar,1\nbaz,2")
File.open('t.csv') do |in_io|
CSV.filter(in_io) do |row|
row[0].upcase!
row[1] = - row[1].to_i
end
end # => [["FOO", 0], ["BAR", -1], ["BAZ", -2]]
Output (to STDOUT):
FOO,0
BAR,-1
BAZ,-2
IO stream input with headers:
File.write('t.csv', "Name,Value\nfoo,0\nbar,1\nbaz,2")
File.open('t.csv') do |in_io|
CSV.filter(in_io, headers: true) do |row|
row[0].upcase!
row[1] = - row[1].to_i
end
end # => #<CSV::Table mode:col_or_row row_count:4>
Output (to STDOUT):
Name,Value
FOO,0
BAR,-1
BAZ,-2
When both in_string_or_io
and out_string_or_io
are given, parses from in_string_or_io
and generates to out_string_or_io
.
String output without headers:
in_string = "foo,0\nbar,1\nbaz,2"
out_string = ''
CSV.filter(in_string, out_string) do |row|
row[0].upcase!
row[1] = - row[1].to_i
end # => [["FOO", 0], ["BAR", -1], ["BAZ", -2]]
out_string # => "FOO,0\nBAR,-1\nBAZ,-2\n"
String output with headers:
in_string = "Name,Value\nfoo,0\nbar,1\nbaz,2"
out_string = ''
CSV.filter(in_string, out_string, headers: true) do |row|
row[0].upcase!
row[1] = - row[1].to_i
end # => #<CSV::Table mode:col_or_row row_count:4>
out_string # => "Name,Value\nFOO,0\nBAR,-1\nBAZ,-2\n"
IO stream output without headers:
in_string = "foo,0\nbar,1\nbaz,2"
File.open('t.csv', 'w') do |out_io|
CSV.filter(in_string, out_io) do |row|
row[0].upcase!
row[1] = - row[1].to_i
end
end # => [["FOO", 0], ["BAR", -1], ["BAZ", -2]]
File.read('t.csv') # => "FOO,0\nBAR,-1\nBAZ,-2\n"
IO stream output with headers:
in_string = "Name,Value\nfoo,0\nbar,1\nbaz,2"
File.open('t.csv', 'w') do |out_io|
CSV.filter(in_string, out_io, headers: true) do |row|
row[0].upcase!
row[1] = - row[1].to_i
end
end # => #<CSV::Table mode:col_or_row row_count:4>
File.read('t.csv') # => "Name,Value\nFOO,0\nBAR,-1\nBAZ,-2\n"
When neither in_string_or_io
nor out_string_or_io
given, parses from ARGF and generates to STDOUT.
Without headers:
# Put Ruby code into a file.
ruby = <<-EOT
require 'csv'
CSV.filter do |row|
row[0].upcase!
row[1] = - row[1].to_i
end
EOT
File.write('t.rb', ruby)
# Put some CSV into a file.
File.write('t.csv', "foo,0\nbar,1\nbaz,2")
# Run the Ruby code with CSV filename as argument.
system(Gem.ruby, "t.rb", "t.csv")
Output (to STDOUT):
FOO,0
BAR,-1
BAZ,-2
With headers:
# Put Ruby code into a file.
ruby = <<-EOT
require 'csv'
CSV.filter(headers: true) do |row|
row[0].upcase!
row[1] = - row[1].to_i
end
EOT
File.write('t.rb', ruby)
# Put some CSV into a file.
File.write('t.csv', "Name,Value\nfoo,0\nbar,1\nbaz,2")
# Run the Ruby code with CSV filename as argument.
system(Gem.ruby, "t.rb", "t.csv")
Output (to STDOUT):
Name,Value
FOO,0
BAR,-1
BAZ,-2
Arguments:
-
Argument
in_string_or_io
must be a String or an IO stream. -
Argument
out_string_or_io
must be a String or an IO stream. -
Arguments
**options
must be keyword options. See Options for Parsing.
1202 1203 1204 1205 1206 1207 1208 1209 1210 1211 1212 1213 1214 1215 1216 1217 1218 1219 1220 1221 1222 1223 1224 1225 1226 1227 1228 1229 1230 1231 1232 1233 1234 1235 1236 1237 1238 1239 1240 1241 1242 1243 1244 |
# File 'lib/csv.rb', line 1202 def filter(input=nil, output=nil, **) # parse options for input, output, or both , = Hash.new, {row_sep: InputRecordSeparator.value} .each do |key, value| case key when /\Ain(?:put)?_(.+)\Z/ [$1.to_sym] = value when /\Aout(?:put)?_(.+)\Z/ [$1.to_sym] = value else [key] = value [key] = value end end # build input and output wrappers input = new(input || ARGF, **) output = new(output || $stdout, **) # process headers need_manual_header_output = ([:headers] and [:headers] == true and [:write_headers]) if need_manual_header_output first_row = input.shift if first_row if first_row.is_a?(Row) headers = first_row.headers yield headers output << headers end yield first_row output << first_row end end # read, yield, write input.each do |row| yield row output << row end end |
.foreach(path, mode = "r", **options, &block) ⇒ Object
:call-seq:
foreach(path_or_io, mode='r', **options) {|row| ... )
foreach(path_or_io, mode='r', **options) -> new_enumerator
Calls the block with each row read from source path_or_io
.
Path input without headers:
string = "foo,0\nbar,1\nbaz,2\n"
in_path = 't.csv'
File.write(in_path, string)
CSV.foreach(in_path) {|row| p row }
Output:
["foo", "0"]
["bar", "1"]
["baz", "2"]
Path input with headers:
string = "Name,Value\nfoo,0\nbar,1\nbaz,2\n"
in_path = 't.csv'
File.write(in_path, string)
CSV.foreach(in_path, headers: true) {|row| p row }
Output:
<CSV::Row "Name":"foo" "Value":"0">
<CSV::Row "Name":"bar" "Value":"1">
<CSV::Row "Name":"baz" "Value":"2">
IO stream input without headers:
string = "foo,0\nbar,1\nbaz,2\n"
path = 't.csv'
File.write(path, string)
File.open('t.csv') do |in_io|
CSV.foreach(in_io) {|row| p row }
end
Output:
["foo", "0"]
["bar", "1"]
["baz", "2"]
IO stream input with headers:
string = "Name,Value\nfoo,0\nbar,1\nbaz,2\n"
path = 't.csv'
File.write(path, string)
File.open('t.csv') do |in_io|
CSV.foreach(in_io, headers: true) {|row| p row }
end
Output:
<CSV::Row "Name":"foo" "Value":"0">
<CSV::Row "Name":"bar" "Value":"1">
<CSV::Row "Name":"baz" "Value":"2">
With no block given, returns an Enumerator:
string = "foo,0\nbar,1\nbaz,2\n"
path = 't.csv'
File.write(path, string)
CSV.foreach(path) # => #<Enumerator: CSV:foreach("t.csv", "r")>
Arguments:
-
Argument
path_or_io
must be a file path or an IO stream. -
Argument
mode
, if given, must be a File mode. See Access Modes. -
Arguments
**options
must be keyword options. See Options for Parsing. -
This method optionally accepts an additional
:encoding
option that you can use to specify the Encoding of the data read frompath
orio
. You must provide this unless your data is in the encoding given byEncoding::default_external
. Parsing will use this to determine how to parse the data. You may provide a second Encoding to have the data transcoded as it is read. For example,encoding: 'UTF-32BE:UTF-8'
would read
UTF-32BE
data from the file but transcode it toUTF-8
before parsing.
1332 1333 1334 1335 1336 1337 |
# File 'lib/csv.rb', line 1332 def foreach(path, mode="r", **, &block) return to_enum(__method__, path, mode, **) unless block_given? open(path, mode, **) do |csv| csv.each(&block) end end |
.generate(str = nil, **options) {|csv| ... } ⇒ Object
:call-seq:
generate(csv_string, **options) {|csv| ... }
generate(**options) {|csv| ... }
-
Argument
csv_string
, if given, must be a String object; defaults to a new empty String. -
Arguments
options
, if given, should be generating options. See Options for Generating.
Creates a new CSV object via CSV.new(csv_string, **options)
; calls the block with the CSV object, which the block may modify; returns the String generated from the CSV object.
Note that a passed String is modified by this method. Pass csv_string
.dup if the String must be preserved.
This method has one additional option: :encoding
, which sets the base Encoding for the output if no no str
is specified. CSV needs this hint if you plan to output non-ASCII compatible data.
Add lines:
input_string = "foo,0\nbar,1\nbaz,2\n"
output_string = CSV.generate(input_string) do |csv|
csv << ['bat', 3]
csv << ['bam', 4]
end
output_string # => "foo,0\nbar,1\nbaz,2\nbat,3\nbam,4\n"
input_string # => "foo,0\nbar,1\nbaz,2\nbat,3\nbam,4\n"
output_string.equal?(input_string) # => true # Same string, modified
Add lines into new string, preserving old string:
input_string = "foo,0\nbar,1\nbaz,2\n"
output_string = CSV.generate(input_string.dup) do |csv|
csv << ['bat', 3]
csv << ['bam', 4]
end
output_string # => "foo,0\nbar,1\nbaz,2\nbat,3\nbam,4\n"
input_string # => "foo,0\nbar,1\nbaz,2\n"
output_string.equal?(input_string) # => false # Different strings
Create lines from nothing:
output_string = CSV.generate do |csv|
csv << ['foo', 0]
csv << ['bar', 1]
csv << ['baz', 2]
end
output_string # => "foo,0\nbar,1\nbaz,2\n"
Raises an exception if csv_string
is not a String object:
# Raises TypeError (no implicit conversion of Integer into String)
CSV.generate(0)
1398 1399 1400 1401 1402 1403 1404 1405 1406 1407 1408 1409 1410 1411 1412 |
# File 'lib/csv.rb', line 1398 def generate(str=nil, **) encoding = [:encoding] # add a default empty String, if none was given if str str = StringIO.new(str) str.seek(0, IO::SEEK_END) str.set_encoding(encoding) if encoding else str = +"" str.force_encoding(encoding) if encoding end csv = new(str, **) # wrap yield csv # yield for appending csv.string # return final String end |
.generate_line(row, **options) ⇒ Object
:call-seq:
CSV.generate_line(ary)
CSV.generate_line(ary, **)
Returns the String created by generating CSV from ary
using the specified options
.
Argument ary
must be an Array.
Special options:
-
Option
:row_sep
defaults to"\n"> on Ruby 3.0 or later and <tt>$INPUT_RECORD_SEPARATOR
($/
) otherwise.:$INPUT_RECORD_SEPARATOR # => "\n"
-
This method accepts an additional option,
:encoding
, which sets the base Encoding for the output. This method will try to guess your Encoding from the first non-nil
field inrow
, if possible, but you may need to use this parameter as a backup plan.
For other options
, see Options for Generating.
Returns the String generated from an Array:
CSV.generate_line(['foo', '0']) # => "foo,0\n"
Raises an exception if ary
is not an Array:
# Raises NoMethodError (undefined method `find' for :foo:Symbol)
CSV.generate_line(:foo)
1446 1447 1448 1449 1450 1451 1452 1453 1454 1455 1456 1457 1458 1459 1460 1461 1462 1463 1464 1465 1466 1467 |
# File 'lib/csv.rb', line 1446 def generate_line(row, **) = {row_sep: InputRecordSeparator.value}.merge() str = +"" if [:encoding] str.force_encoding([:encoding]) else fallback_encoding = nil output_encoding = nil row.each do |field| next unless field.is_a?(String) fallback_encoding ||= field.encoding next if field.ascii_only? output_encoding = field.encoding break end output_encoding ||= fallback_encoding if output_encoding str.force_encoding(output_encoding) end end (new(str, **) << row).string end |
.generate_lines(rows, **options) ⇒ Object
:call-seq:
CSV.generate_lines(rows)
CSV.generate_lines(rows, **)
Returns the String created by generating CSV from using the specified options
.
Argument rows
must be an Array of row. Row is Array of String or CSV::Row.
Special options:
-
Option
:row_sep
defaults to"\n"
on Ruby 3.0 or later and$INPUT_RECORD_SEPARATOR
($/
) otherwise.:$INPUT_RECORD_SEPARATOR # => "\n"
-
This method accepts an additional option,
:encoding
, which sets the base Encoding for the output. This method will try to guess your Encoding from the first non-nil
field inrow
, if possible, but you may need to use this parameter as a backup plan.
For other options
, see Options for Generating.
Returns the String generated from an
CSV.generate_lines([['foo', '0'], ['bar', '1'], ['baz', '2']]) # => "foo,0\nbar,1\nbaz,2\n"
Raises an exception
# Raises NoMethodError (undefined method `each' for :foo:Symbol)
CSV.generate_lines(:foo)
1501 1502 1503 1504 1505 1506 1507 |
# File 'lib/csv.rb', line 1501 def generate_lines(rows, **) self.generate(**) do |csv| rows.each do |row| csv << row end end end |
.instance(data = $stdout, **options) ⇒ Object
:call-seq:
instance(string, **options)
instance(io = $stdout, **options)
instance(string, **options) {|csv| ... }
instance(io = $stdout, **options) {|csv| ... }
Creates or retrieves cached CSV objects. For arguments and options, see CSV.new.
This API is not Ractor-safe.
With no block given, returns a CSV object.
The first call to instance
creates and caches a CSV object:
s0 = 's0'
csv0 = CSV.instance(s0)
csv0.class # => CSV
Subsequent calls to instance
with that same string
or io
retrieve that same cached object:
csv1 = CSV.instance(s0)
csv1.class # => CSV
csv1.equal?(csv0) # => true # Same CSV object
A subsequent call to instance
with a different string
or io
creates and caches a different CSV object.
s1 = 's1'
csv2 = CSV.instance(s1)
csv2.equal?(csv0) # => false # Different CSV object
All the cached objects remains available:
csv3 = CSV.instance(s0)
csv3.equal?(csv0) # true # Same CSV object
csv4 = CSV.instance(s1)
csv4.equal?(csv2) # true # Same CSV object
When a block is given, calls the block with the created or retrieved CSV object; returns the block’s return value:
CSV.instance(s0) {|csv| :foo } # => :foo
1006 1007 1008 1009 1010 1011 1012 1013 1014 1015 1016 1017 1018 1019 1020 |
# File 'lib/csv.rb', line 1006 def instance(data = $stdout, **) # create a _signature_ for this method call, data object and options sig = [data.object_id] + .values_at(*DEFAULT_OPTIONS.keys) # fetch or create the instance for this signature @@instances ||= Hash.new instance = (@@instances[sig] ||= new(data, **)) if block_given? yield instance # run block, if given, returning result else instance # or return the instance end end |
.open(filename, mode = "r", **options) ⇒ Object
:call-seq:
open(file_path, mode = "rb", **options ) -> new_csv
open(io, mode = "rb", **options ) -> new_csv
open(file_path, mode = "rb", **options ) { |csv| ... } -> object
open(io, mode = "rb", **options ) { |csv| ... } -> object
possible options elements:
keyword form:
:invalid => nil # raise error on invalid byte sequence (default)
:invalid => :replace # replace invalid byte sequence
:undef => :replace # replace undefined conversion
:replace => string # replacement string ("?" or "\uFFFD" if not specified)
-
Argument
path
, if given, must be the path to a file.
:include: ../doc/csv/arguments/io.rdoc
-
Argument
mode
, if given, must be a File mode. See Access Modes. -
Arguments
**options
must be keyword options. See Options for Generating. -
This method optionally accepts an additional
:encoding
option that you can use to specify the Encoding of the data read frompath
orio
. You must provide this unless your data is in the encoding given byEncoding::default_external
. Parsing will use this to determine how to parse the data. You may provide a second Encoding to have the data transcoded as it is read. For example,encoding: 'UTF-32BE:UTF-8'
would read
UTF-32BE
data from the file but transcode it toUTF-8
before parsing.
These examples assume prior execution of:
string = "foo,0\nbar,1\nbaz,2\n"
path = 't.csv'
File.write(path, string)
With no block given, returns a new CSV object.
Create a CSV object using a file path:
csv = CSV.open(path)
csv # => #<CSV io_type:File io_path:"t.csv" encoding:UTF-8 lineno:0 col_sep:"," row_sep:"\n" quote_char:"\"">
Create a CSV object using an open File:
csv = CSV.open(File.open(path))
csv # => #<CSV io_type:File io_path:"t.csv" encoding:UTF-8 lineno:0 col_sep:"," row_sep:"\n" quote_char:"\"">
With a block given, calls the block with the created CSV object; returns the block’s return value:
Using a file path:
csv = CSV.open(path) {|csv| p csv}
csv # => #<CSV io_type:File io_path:"t.csv" encoding:UTF-8 lineno:0 col_sep:"," row_sep:"\n" quote_char:"\"">
Output:
#<CSV io_type:File io_path:"t.csv" encoding:UTF-8 lineno:0 col_sep:"," row_sep:"\n" quote_char:"\"">
Using an open File:
csv = CSV.open(File.open(path)) {|csv| p csv}
csv # => #<CSV io_type:File io_path:"t.csv" encoding:UTF-8 lineno:0 col_sep:"," row_sep:"\n" quote_char:"\"">
Output:
#<CSV io_type:File io_path:"t.csv" encoding:UTF-8 lineno:0 col_sep:"," row_sep:"\n" quote_char:"\"">
Raises an exception if the argument is not a String object or IO object:
# Raises TypeError (no implicit conversion of Symbol into String)
CSV.open(:foo)
1581 1582 1583 1584 1585 1586 1587 1588 1589 1590 1591 1592 1593 1594 1595 1596 1597 1598 1599 1600 1601 1602 1603 1604 1605 1606 1607 1608 1609 1610 1611 1612 1613 1614 1615 1616 1617 1618 |
# File 'lib/csv.rb', line 1581 def open(filename, mode="r", **) # wrap a File opened with the remaining +args+ with no newline # decorator file_opts = .dup unless file_opts.key?(:newline) file_opts[:universal_newline] ||= false end .delete(:invalid) .delete(:undef) .delete(:replace) .delete_if {|k, _| /newline\z/.match?(k)} begin f = File.open(filename, mode, **file_opts) rescue ArgumentError => e raise unless /needs binmode/.match?(e.) and mode == "r" mode = "rb" file_opts = {encoding: Encoding.default_external}.merge(file_opts) retry end begin csv = new(f, **) rescue Exception f.close raise end # handle blocks like Ruby's open(), not like the CSV library if block_given? begin yield csv ensure csv.close end else csv end end |
.parse(str, **options, &block) ⇒ Object
:call-seq:
parse(string) -> array_of_arrays
parse(io) -> array_of_arrays
parse(string, headers: ..., **options) -> csv_table
parse(io, headers: ..., **options) -> csv_table
parse(string, **options) {|row| ... }
parse(io, **options) {|row| ... }
Parses string
or io
using the specified options
.
-
Argument
string
should be a String object; it will be put into a new StringIO object positioned at the beginning.
:include: ../doc/csv/arguments/io.rdoc
-
Argument
options
: see Options for Parsing
Without Option headers
These examples assume prior execution of:
string = "foo,0\nbar,1\nbaz,2\n"
path = 't.csv'
File.write(path, string)
With no block given, returns an Array of Arrays formed from the source.
Parse a String:
a_of_a = CSV.parse(string)
a_of_a # => [["foo", "0"], ["bar", "1"], ["baz", "2"]]
Parse an open File:
a_of_a = File.open(path) do |file|
CSV.parse(file)
end
a_of_a # => [["foo", "0"], ["bar", "1"], ["baz", "2"]]
With a block given, calls the block with each parsed row:
Parse a String:
CSV.parse(string) {|row| p row }
Output:
["foo", "0"]
["bar", "1"]
["baz", "2"]
Parse an open File:
File.open(path) do |file|
CSV.parse(file) {|row| p row }
end
Output:
["foo", "0"]
["bar", "1"]
["baz", "2"]
With Option headers
These examples assume prior execution of:
string = "Name,Count\nfoo,0\nbar,1\nbaz,2\n"
path = 't.csv'
File.write(path, string)
With no block given, returns a CSV::Table object formed from the source.
Parse a String:
csv_table = CSV.parse(string, headers: ['Name', 'Count'])
csv_table # => #<CSV::Table mode:col_or_row row_count:5>
Parse an open File:
csv_table = File.open(path) do |file|
CSV.parse(file, headers: ['Name', 'Count'])
end
csv_table # => #<CSV::Table mode:col_or_row row_count:4>
With a block given, calls the block with each parsed row, which has been formed into a CSV::Row object:
Parse a String:
CSV.parse(string, headers: ['Name', 'Count']) {|row| p row }
Output:
# <CSV::Row "Name":"foo" "Count":"0">
# <CSV::Row "Name":"bar" "Count":"1">
# <CSV::Row "Name":"baz" "Count":"2">
Parse an open File:
File.open(path) do |file|
CSV.parse(file, headers: ['Name', 'Count']) {|row| p row }
end
Output:
# <CSV::Row "Name":"foo" "Count":"0">
# <CSV::Row "Name":"bar" "Count":"1">
# <CSV::Row "Name":"baz" "Count":"2">
Raises an exception if the argument is not a String object or IO object:
# Raises NoMethodError (undefined method `close' for :foo:Symbol)
CSV.parse(:foo)
1732 1733 1734 1735 1736 1737 1738 1739 1740 1741 1742 1743 |
# File 'lib/csv.rb', line 1732 def parse(str, **, &block) csv = new(str, **) return csv.each(&block) if block_given? # slurp contents, if no block is given begin csv.read ensure csv.close end end |
.parse_line(line, **options) ⇒ Object
:call-seq:
CSV.parse_line(string) -> new_array or nil
CSV.parse_line(io) -> new_array or nil
CSV.parse_line(string, **options) -> new_array or nil
CSV.parse_line(io, **options) -> new_array or nil
CSV.parse_line(string, headers: true, **options) -> csv_row or nil
CSV.parse_line(io, headers: true, **options) -> csv_row or nil
Returns the data created by parsing the first line of string
or io
using the specified options
.
-
Argument
string
should be a String object; it will be put into a new StringIO object positioned at the beginning.
:include: ../doc/csv/arguments/io.rdoc
-
Argument
options
: see Options for Parsing
Without Option headers
Without option headers
, returns the first row as a new Array.
These examples assume prior execution of:
string = "foo,0\nbar,1\nbaz,2\n"
path = 't.csv'
File.write(path, string)
Parse the first line from a String object:
CSV.parse_line(string) # => ["foo", "0"]
Parse the first line from a File object:
File.open(path) do |file|
CSV.parse_line(file) # => ["foo", "0"]
end # => ["foo", "0"]
Returns nil
if the argument is an empty String:
CSV.parse_line('') # => nil
With Option headers
With headers
, returns the first row as a CSV::Row object.
These examples assume prior execution of:
string = "Name,Count\nfoo,0\nbar,1\nbaz,2\n"
path = 't.csv'
File.write(path, string)
Parse the first line from a String object:
CSV.parse_line(string, headers: true) # => #<CSV::Row "Name":"foo" "Count":"0">
Parse the first line from a File object:
File.open(path) do |file|
CSV.parse_line(file, headers: true)
end # => #<CSV::Row "Name":"foo" "Count":"0">
Raises an exception if the argument is nil
:
# Raises ArgumentError (Cannot parse nil as CSV):
CSV.parse_line(nil)
1805 1806 1807 |
# File 'lib/csv.rb', line 1805 def parse_line(line, **) new(line, **).each.first end |
.read(path, **options) ⇒ Object
:call-seq:
read(source, **options) -> array_of_arrays
read(source, headers: true, **options) -> csv_table
Opens the given source
with the given options
(see CSV.open), reads the source (see CSV#read), and returns the result, which will be either an Array of Arrays or a CSV::Table.
Without headers:
string = "foo,0\nbar,1\nbaz,2\n"
path = 't.csv'
File.write(path, string)
CSV.read(path) # => [["foo", "0"], ["bar", "1"], ["baz", "2"]]
With headers:
string = "Name,Value\nfoo,0\nbar,1\nbaz,2\n"
path = 't.csv'
File.write(path, string)
CSV.read(path, headers: true) # => #<CSV::Table mode:col_or_row row_count:4>
1829 1830 1831 |
# File 'lib/csv.rb', line 1829 def read(path, **) open(path, **) { |csv| csv.read } end |
.readlines(path, **options) ⇒ Object
1837 1838 1839 |
# File 'lib/csv.rb', line 1837 def readlines(path, **) read(path, **) end |
.table(path, **options) ⇒ Object
:call-seq:
CSV.table(source, **)
Calls CSV.read with source
, options
, and certain default options:
-
headers
:true
-
converters
::numeric
-
header_converters
::symbol
Returns a CSV::Table object.
Example:
string = "Name,Value\nfoo,0\nbar,1\nbaz,2\n"
path = 't.csv'
File.write(path, string)
CSV.table(path) # => #<CSV::Table mode:col_or_row row_count:4>
1856 1857 1858 1859 1860 1861 1862 1863 1864 |
# File 'lib/csv.rb', line 1856 def table(path, **) = { headers: true, converters: :numeric, header_converters: :symbol, } = .merge() read(path, **) end |
Instance Method Details
#<<(row) ⇒ Object Also known as: add_row, puts
:call-seq:
csv << row -> self
Appends a row to self
.
-
Argument
row
must be an Array object or a CSV::Row object. -
The output stream must be open for writing.
Append Arrays:
CSV.generate do |csv|
csv << ['foo', 0]
csv << ['bar', 1]
csv << ['baz', 2]
end # => "foo,0\nbar,1\nbaz,2\n"
Append CSV::Rows:
headers = []
CSV.generate do |csv|
csv << CSV::Row.new(headers, ['foo', 0])
csv << CSV::Row.new(headers, ['bar', 1])
csv << CSV::Row.new(headers, ['baz', 2])
end # => "foo,0\nbar,1\nbaz,2\n"
Headers in CSV::Row objects are not appended:
headers = ['Name', 'Count']
CSV.generate do |csv|
csv << CSV::Row.new(headers, ['foo', 0])
csv << CSV::Row.new(headers, ['bar', 1])
csv << CSV::Row.new(headers, ['baz', 2])
end # => "foo,0\nbar,1\nbaz,2\n"
Raises an exception if row
is not an Array or CSV::Row:
CSV.generate do |csv|
# Raises NoMethodError (undefined method `collect' for :foo:Symbol)
csv << :foo
end
Raises an exception if the output stream is not opened for writing:
path = 't.csv'
File.write(path, '')
File.open(path) do |file|
CSV.open(file) do |csv|
# Raises IOError (not opened for writing)
csv << ['foo', 0]
end
end
2372 2373 2374 2375 |
# File 'lib/csv.rb', line 2372 def <<(row) writer << row self end |
#binmode? ⇒ Boolean
2261 2262 2263 2264 2265 2266 2267 |
# File 'lib/csv.rb', line 2261 def binmode? if @io.respond_to?(:binmode?) @io.binmode? else false end end |
#col_sep ⇒ Object
2009 2010 2011 |
# File 'lib/csv.rb', line 2009 def col_sep parser.column_separator end |
#convert(name = nil, &converter) ⇒ Object
:call-seq:
convert(converter_name) -> array_of_procs
convert {|field, field_info| ... } -> array_of_procs
-
With no block, installs a field converter (a Proc).
-
With a block, defines and installs a custom field converter.
-
Returns the Array of installed field converters.
-
Argument
converter_name
, if given, should be the name of an existing field converter.
See Field Converters.
With no block, installs a field converter:
csv = CSV.new('')
csv.convert(:integer)
csv.convert(:float)
csv.convert(:date)
csv.converters # => [:integer, :float, :date]
The block, if given, is called for each field:
-
Argument
field
is the field value. -
Argument
field_info
is a CSV::FieldInfo object containing details about the field.
The examples here assume the prior execution of:
string = "foo,0\nbar,1\nbaz,2\n"
path = 't.csv'
File.write(path, string)
Example giving a block:
csv = CSV.open(path)
csv.convert {|field, field_info| p [field, field_info]; field.upcase }
csv.read # => [["FOO", "0"], ["BAR", "1"], ["BAZ", "2"]]
Output:
["foo", #<struct CSV::FieldInfo index=0, line=1, header=nil>]
["0", #<struct CSV::FieldInfo index=1, line=1, header=nil>]
["bar", #<struct CSV::FieldInfo index=0, line=2, header=nil>]
["1", #<struct CSV::FieldInfo index=1, line=2, header=nil>]
["baz", #<struct CSV::FieldInfo index=0, line=3, header=nil>]
["2", #<struct CSV::FieldInfo index=1, line=3, header=nil>]
The block need not return a String object:
csv = CSV.open(path)
csv.convert {|field, field_info| field.to_sym }
csv.read # => [[:foo, :"0"], [:bar, :"1"], [:baz, :"2"]]
If converter_name
is given, the block is not called:
csv = CSV.open(path)
csv.convert(:integer) {|field, field_info| fail 'Cannot happen' }
csv.read # => [["foo", 0], ["bar", 1], ["baz", 2]]
Raises a parse-time exception if converter_name
is not the name of a built-in field converter:
csv = CSV.open(path)
csv.convert(:nosuch) => [nil]
# Raises NoMethodError (undefined method `arity' for nil:NilClass)
csv.read
2443 2444 2445 |
# File 'lib/csv.rb', line 2443 def convert(name = nil, &converter) parser_fields_converter.add_converter(name, &converter) end |
#converters ⇒ Object
:call-seq:
csv.converters -> array
Returns an Array containing field converters; see Field Converters:
csv = CSV.new('')
csv.converters # => []
csv.convert(:integer)
csv.converters # => [:integer]
csv.convert(proc {|x| x.to_s })
csv.converters
Notes that you need to call Ractor.make_shareable(CSV::Converters) on the main Ractor to use this method.
2082 2083 2084 2085 2086 2087 |
# File 'lib/csv.rb', line 2082 def converters parser_fields_converter.map do |converter| name = Converters.rassoc(converter) name ? name.first : converter end end |
#each(&block) ⇒ Object
:call-seq:
csv.each -> enumerator
csv.each {|row| ...}
Calls the block with each successive row. The data source must be opened for reading.
Without headers:
string = "foo,0\nbar,1\nbaz,2\n"
csv = CSV.new(string)
csv.each do |row|
p row
end
Output:
["foo", "0"]
["bar", "1"]
["baz", "2"]
With headers:
string = "Name,Value\nfoo,0\nbar,1\nbaz,2\n"
csv = CSV.new(string, headers: true)
csv.each do |row|
p row
end
Output:
<CSV::Row "Name":"foo" "Value":"0">
<CSV::Row "Name":"bar" "Value":"1">
<CSV::Row "Name":"baz" "Value":"2">
Raises an exception if the source is not opened for reading:
string = "foo,0\nbar,1\nbaz,2\n"
csv = CSV.new(string)
csv.close
# Raises IOError (not opened for reading)
csv.each do |row|
p row
end
2554 2555 2556 2557 2558 2559 2560 2561 2562 |
# File 'lib/csv.rb', line 2554 def each(&block) return to_enum(__method__) unless block_given? begin while true yield(parser_enumerator.next) end rescue StopIteration end end |
#eof? ⇒ Boolean Also known as: eof
2297 2298 2299 2300 2301 2302 2303 2304 2305 2306 2307 2308 |
# File 'lib/csv.rb', line 2297 def eof? return false if @eof_error begin parser_enumerator.peek false rescue MalformedCSVError => error @eof_error = error false rescue StopIteration true end end |
#field_size_limit ⇒ Object
2041 2042 2043 |
# File 'lib/csv.rb', line 2041 def field_size_limit parser.field_size_limit end |
#flock(*args) ⇒ Object
2269 2270 2271 2272 |
# File 'lib/csv.rb', line 2269 def flock(*args) raise NotImplementedError unless @io.respond_to?(:flock) @io.flock(*args) end |
#force_quotes? ⇒ Boolean
2172 2173 2174 |
# File 'lib/csv.rb', line 2172 def force_quotes? @writer_options[:force_quotes] end |
#header_convert(name = nil, &converter) ⇒ Object
The block need not return a String object:
csv = CSV.open(path, headers: true)
csv.header_convert {|header, field_info| header.to_sym }
table = csv.read
table.headers # => [:Name, :Value]
If converter_name
is given, the block is not called:
csv = CSV.open(path, headers: true)
csv.header_convert(:downcase) {|header, field_info| fail 'Cannot happen' }
table = csv.read
table.headers # => ["name", "value"]
Raises a parse-time exception if converter_name
is not the name of a built-in field converter:
csv = CSV.open(path, headers: true)
csv.header_convert(:nosuch)
# Raises NoMethodError (undefined method `arity' for nil:NilClass)
csv.read
2509 2510 2511 |
# File 'lib/csv.rb', line 2509 def header_convert(name = nil, &converter) header_fields_converter.add_converter(name, &converter) end |
#header_converters ⇒ Object
:call-seq:
csv.header_converters -> array
Returns an Array containing header converters; used for parsing; see Header Converters:
CSV.new('').header_converters # => []
Notes that you need to call Ractor.make_shareable(CSV::HeaderConverters) on the main Ractor to use this method.
2148 2149 2150 2151 2152 2153 |
# File 'lib/csv.rb', line 2148 def header_converters header_fields_converter.map do |converter| name = HeaderConverters.rassoc(converter) name ? name.first : converter end end |
#header_row? ⇒ Boolean
:call-seq:
csv.header_row? -> true or false
Returns true
if the next row to be read is a header row; false
otherwise.
Without headers:
string = "foo,0\nbar,1\nbaz,2\n"
csv = CSV.new(string)
csv.header_row? # => false
With headers:
string = "Name,Value\nfoo,0\nbar,1\nbaz,2\n"
csv = CSV.new(string, headers: true)
csv.header_row? # => true
csv.shift # => #<CSV::Row "Name":"foo" "Value":"0">
csv.header_row? # => false
Raises an exception if the source is not opened for reading:
string = "foo,0\nbar,1\nbaz,2\n"
csv = CSV.new(string)
csv.close
# Raises IOError (not opened for reading)
csv.header_row?
2631 2632 2633 |
# File 'lib/csv.rb', line 2631 def header_row? parser.header_row? end |
#headers ⇒ Object
2106 2107 2108 2109 2110 2111 2112 2113 2114 2115 2116 |
# File 'lib/csv.rb', line 2106 def headers if @writer @writer.headers else parsed_headers = parser.headers return parsed_headers if parsed_headers raw_headers = @parser_options[:headers] raw_headers = nil if raw_headers == false raw_headers end end |
#inspect ⇒ Object
:call-seq:
csv.inspect -> string
Returns a String showing certain properties of self
:
string = "Name,Value\nfoo,0\nbar,1\nbaz,2\n"
csv = CSV.new(string, headers: true)
s = csv.inspect
s # => "#<CSV io_type:StringIO encoding:UTF-8 lineno:0 col_sep:\",\" row_sep:\"\\n\" quote_char:\"\\\"\" headers:true>"
2690 2691 2692 2693 2694 2695 2696 2697 2698 2699 2700 2701 2702 2703 2704 2705 2706 2707 2708 2709 2710 2711 2712 2713 2714 2715 2716 2717 2718 2719 2720 2721 2722 2723 2724 2725 2726 |
# File 'lib/csv.rb', line 2690 def inspect str = ["#<", self.class.to_s, " io_type:"] # show type of wrapped IO if @io == $stdout then str << "$stdout" elsif @io == $stdin then str << "$stdin" elsif @io == $stderr then str << "$stderr" else str << @io.class.to_s end # show IO.path(), if available if @io.respond_to?(:path) and (p = @io.path) str << " io_path:" << p.inspect end # show encoding str << " encoding:" << @encoding.name # show other attributes ["lineno", "col_sep", "row_sep", "quote_char"].each do |attr_name| if a = __send__(attr_name) str << " " << attr_name << ":" << a.inspect end end ["skip_blanks", "liberal_parsing"].each do |attr_name| if a = __send__("#{attr_name}?") str << " " << attr_name << ":" << a.inspect end end _headers = headers str << " headers:" << _headers.inspect if _headers str << ">" begin str.join('') rescue # any encoding error str.map do |s| e = Encoding::Converter.asciicompat_encoding(s.encoding) e ? s.encode(e) : s.force_encoding("ASCII-8BIT") end.join('') end end |
#ioctl(*args) ⇒ Object
2274 2275 2276 2277 |
# File 'lib/csv.rb', line 2274 def ioctl(*args) raise NotImplementedError unless @io.respond_to?(:ioctl) @io.ioctl(*args) end |
#liberal_parsing? ⇒ Boolean
2182 2183 2184 |
# File 'lib/csv.rb', line 2182 def liberal_parsing? parser.liberal_parsing? end |
#line ⇒ Object
2247 2248 2249 |
# File 'lib/csv.rb', line 2247 def line parser.line end |
#lineno ⇒ Object
:call-seq:
csv.line_no -> integer
Returns the count of the rows parsed or generated.
Parsing:
string = "foo,0\nbar,1\nbaz,2\n"
path = 't.csv'
File.write(path, string)
CSV.open(path) do |csv|
csv.each do |row|
p [csv.lineno, row]
end
end
Output:
[1, ["foo", "0"]]
[2, ["bar", "1"]]
[3, ["baz", "2"]]
Generating:
CSV.generate do |csv|
p csv.lineno; csv << ['foo', 0]
p csv.lineno; csv << ['bar', 1]
p csv.lineno; csv << ['baz', 2]
end
Output:
0
1
2
2223 2224 2225 2226 2227 2228 2229 |
# File 'lib/csv.rb', line 2223 def lineno if @writer @writer.lineno else parser.lineno end end |
#max_field_size ⇒ Object
2053 2054 2055 |
# File 'lib/csv.rb', line 2053 def max_field_size parser.max_field_size end |
#path ⇒ Object
2279 2280 2281 |
# File 'lib/csv.rb', line 2279 def path @io.path if @io.respond_to?(:path) end |
#quote_char ⇒ Object
2029 2030 2031 |
# File 'lib/csv.rb', line 2029 def quote_char parser.quote_character end |
#read ⇒ Object Also known as: readlines
:call-seq:
csv.read -> array or csv_table
Forms the remaining rows from self
into:
-
A CSV::Table object, if headers are in use.
-
An Array of Arrays, otherwise.
The data source must be opened for reading.
Without headers:
string = "foo,0\nbar,1\nbaz,2\n"
path = 't.csv'
File.write(path, string)
csv = CSV.open(path)
csv.read # => [["foo", "0"], ["bar", "1"], ["baz", "2"]]
With headers:
string = "Name,Value\nfoo,0\nbar,1\nbaz,2\n"
path = 't.csv'
File.write(path, string)
csv = CSV.open(path, headers: true)
csv.read # => #<CSV::Table mode:col_or_row row_count:4>
Raises an exception if the source is not opened for reading:
string = "foo,0\nbar,1\nbaz,2\n"
csv = CSV.new(string)
csv.close
# Raises IOError (not opened for reading)
csv.read
2595 2596 2597 2598 2599 2600 2601 2602 |
# File 'lib/csv.rb', line 2595 def read rows = to_a if parser.use_headers? Table.new(rows, headers: parser.headers) else rows end end |
#return_headers? ⇒ Boolean
2124 2125 2126 |
# File 'lib/csv.rb', line 2124 def return_headers? parser.return_headers? end |
#rewind ⇒ Object
Rewinds the underlying IO object and resets CSV’s lineno() counter.
2312 2313 2314 2315 2316 2317 2318 |
# File 'lib/csv.rb', line 2312 def rewind @parser = nil @parser_enumerator = nil @eof_error = nil @writer.rewind if @writer @io.rewind end |
#row_sep ⇒ Object
2019 2020 2021 |
# File 'lib/csv.rb', line 2019 def row_sep parser.row_separator end |
#shift ⇒ Object Also known as: gets, readline
:call-seq:
csv.shift -> array, csv_row, or nil
Returns the next row of data as:
-
An Array if no headers are used.
-
A CSV::Row object if headers are used.
The data source must be opened for reading.
Without headers:
string = "foo,0\nbar,1\nbaz,2\n"
csv = CSV.new(string)
csv.shift # => ["foo", "0"]
csv.shift # => ["bar", "1"]
csv.shift # => ["baz", "2"]
csv.shift # => nil
With headers:
string = "Name,Value\nfoo,0\nbar,1\nbaz,2\n"
csv = CSV.new(string, headers: true)
csv.shift # => #<CSV::Row "Name":"foo" "Value":"0">
csv.shift # => #<CSV::Row "Name":"bar" "Value":"1">
csv.shift # => #<CSV::Row "Name":"baz" "Value":"2">
csv.shift # => nil
Raises an exception if the source is not opened for reading:
string = "foo,0\nbar,1\nbaz,2\n"
csv = CSV.new(string)
csv.close
# Raises IOError (not opened for reading)
csv.shift
2668 2669 2670 2671 2672 2673 2674 2675 2676 2677 2678 |
# File 'lib/csv.rb', line 2668 def shift if @eof_error eof_error, @eof_error = @eof_error, nil raise eof_error end begin parser_enumerator.next rescue StopIteration nil end end |
#skip_blanks? ⇒ Boolean
2161 2162 2163 |
# File 'lib/csv.rb', line 2161 def skip_blanks? parser.skip_blanks? end |
#skip_lines ⇒ Object
2063 2064 2065 |
# File 'lib/csv.rb', line 2063 def skip_lines parser.skip_lines end |
#stat(*args) ⇒ Object
2283 2284 2285 2286 |
# File 'lib/csv.rb', line 2283 def stat(*args) raise NotImplementedError unless @io.respond_to?(:stat) @io.stat(*args) end |
#to_i ⇒ Object
2288 2289 2290 2291 |
# File 'lib/csv.rb', line 2288 def to_i raise NotImplementedError unless @io.respond_to?(:to_i) @io.to_i end |
#to_io ⇒ Object
2293 2294 2295 |
# File 'lib/csv.rb', line 2293 def to_io @io.respond_to?(:to_io) ? @io.to_io : @io end |
#unconverted_fields? ⇒ Boolean
2096 2097 2098 |
# File 'lib/csv.rb', line 2096 def unconverted_fields? parser.unconverted_fields? end |
#write_headers? ⇒ Boolean
2134 2135 2136 |
# File 'lib/csv.rb', line 2134 def write_headers? @writer_options[:write_headers] end |