Description

This is a implementation of the JSON specification according to RFC 4627 www.ietf.org/rfc/rfc4627.txt . Starting from version 1.0.0 on there will be two variants available:

  • A pure ruby variant, that relies on the iconv and the stringscan extensions, which are both part of the ruby standard library.

  • The quite a bit faster C extension variant, which is in parts implemented in C and comes with its own unicode conversion functions and a parser generated by the ragel state machine compiler www.cs.queensu.ca/~thurston/ragel .

Both variants of the JSON generator generate UTF-8 character sequences by default. If an :ascii_only option with a true value is given, they escape all non-ASCII and control characters with uXXXX escape sequences, and support UTF-16 surrogate pairs in order to be able to generate the whole range of unicode code points.

All strings, that are to be encoded as JSON strings, should be UTF-8 byte sequences on the Ruby side. To encode raw binary strings, that aren’t UTF-8 encoded, please use the to_json_raw_object method of String (which produces an object, that contains a byte array) and decode the result on the receiving endpoint.

The JSON parsers can parse UTF-8, UTF-16BE, UTF-16LE, UTF-32BE, and UTF-32LE JSON documents under Ruby 1.8. Under Ruby 1.9 they take advantage of Ruby’s M17n features and can parse all documents which have the correct String#encoding set. If a document string has ASCII-8BIT as an encoding the parser attempts to figure out which of the UTF encodings from above it is and trys to parse it.

Installation

It’s recommended to use the extension variant of JSON, because it’s faster than the pure ruby variant. If you cannot build it on your system, you can settle for the latter.

Just type into the command line as root:

# rake install

The above command will build the extensions and install them on your system.

# rake install_pure

or

# ruby install.rb

will just install the pure ruby implementation of JSON.

If you use Rubygems you can type

# gem install json

instead, to install the newest JSON version.

There is also a pure ruby json only variant of the gem, that can be installed with:

# gem install json_pure

Compiling the extensions yourself

If you want to build the extensions yourself you need rake:

You can get it from rubyforge:
  http://rubyforge.org/projects/rake

or just type

# gem install rake

for the installation via rubygems.

If you want to create the parser.c file from its parser.rl file or draw nice graphviz images of the state machines, you need ragel from: www.cs.queensu.ca/~thurston/ragel

Usage

To use JSON you can

require 'json'

to load the installed variant (either the extension ‘json’ or the pure variant ‘json_pure’). If you have installed the extension variant, you can pick either the extension variant or the pure variant by typing

require 'json/ext'

or

require 'json/pure'

Now you can parse a JSON document into a ruby data structure by calling

JSON.parse(document)

If you want to generate a JSON document from a ruby data structure call

JSON.generate(data)

You can also use the pretty_generate method (which formats the output more verbosely and nicely) or fast_generate (which doesn’t do any of the security checks generate performs, e. g. nesting deepness checks).

To create a valid JSON document you have to make sure, that the output is embedded in either a JSON array [] or a JSON object {}. The easiest way to do this, is by putting your values in a Ruby Array or Hash instance.

There are also the JSON and JSON[] methods which use parse on a String or generate a JSON document from an array or hash:

document = JSON 'test'  => 23 # => "{\"test\":23}"
document = JSON['test'] => 23 # => "{\"test\":23}"

and

data = JSON '{"test":23}'  # => {"test"=>23}
data = JSON['{"test":23}'] # => {"test"=>23}

You can choose to load a set of common additions to ruby core’s objects if you

require 'json/add/core'

After requiring this you can, e. g., serialise/deserialise Ruby ranges:

JSON JSON(1..10) # => 1..10

To find out how to add JSON support to other or your own classes, read the section “More Examples” below.

To get the best compatibility to rails’ JSON implementation, you can

require 'json/add/rails'

Both of the additions attempt to require ‘json’ (like above) first, if it has not been required yet.

More Examples

To create a JSON document from a ruby data structure, you can call JSON.generate like that:

json = JSON.generate [1, 2, {"a"=>3.141}, false, true, nil, 4..10]
# => "[1,2,{\"a\":3.141},false,true,null,\"4..10\"]"

To get back a ruby data structure from a JSON document, you have to call JSON.parse on it:

JSON.parse json
# => [1, 2, {"a"=>3.141}, false, true, nil, "4..10"]

Note, that the range from the original data structure is a simple string now. The reason for this is, that JSON doesn’t support ranges or arbitrary classes. In this case the json library falls back to call Object#to_json, which is the same as #to_s.to_json.

It’s possible to add JSON support serialization to arbitrary classes by simply implementing a more specialized version of the #to_json method, that should return a JSON object (a hash converted to JSON with #to_json) like this (don’t forget the *a for all the arguments):

class Range
  def to_json(*a)
    {
      'json_class'   => self.class.name, # = 'Range'
      'data'         => [ first, last, exclude_end? ]
    }.to_json(*a)
  end
end

The hash key ‘json_class’ is the class, that will be asked to deserialise the JSON representation later. In this case it’s ‘Range’, but any namespace of the form ‘A::B’ or ‘::A::B’ will do. All other keys are arbitrary and can be used to store the necessary data to configure the object to be deserialised.

If a the key ‘json_class’ is found in a JSON object, the JSON parser checks if the given class responds to the json_create class method. If so, it is called with the JSON object converted to a Ruby hash. So a range can be deserialised by implementing Range.json_create like this:

class Range
  def self.json_create(o)
    new(*o['data'])
  end
end

Now it possible to serialise/deserialise ranges as well:

json = JSON.generate [1, 2, {"a"=>3.141}, false, true, nil, 4..10]
# => "[1,2,{\"a\":3.141},false,true,null,{\"json_class\":\"Range\",\"data\":[4,10,false]}]"
JSON.parse json
# => [1, 2, {"a"=>3.141}, false, true, nil, 4..10]

JSON.generate always creates the shortest possible string representation of a ruby data structure in one line. This is good for data storage or network protocols, but not so good for humans to read. Fortunately there’s also JSON.pretty_generate (or JSON.pretty_generate) that creates a more readable output:

puts JSON.pretty_generate([1, 2, {"a"=>3.141}, false, true, nil, 4..10])
[
  1,
  2,
  {
    "a": 3.141
  },
  false,
  true,
  null,
  {
    "json_class": "Range",
    "data": [
      4,
      10,
      false
    ]
  }
]

There are also the methods Kernel#j for generate, and Kernel#jj for pretty_generate output to the console, that work analogous to Core Ruby’s p and the pp library’s pp methods.

The script tools/server.rb contains a small example if you want to test, how receiving a JSON object from a webrick server in your browser with the javasript prototype library www.prototypejs.org works.

Speed Comparisons

I have created some benchmark results (see the benchmarks/data-p4-3Ghz subdir of the package) for the JSON-parser to estimate the speed up in the C extension:

Comparing times (call_time_mean):
 1 ParserBenchmarkExt#parser   900 repeats:
       553.922304770 (  real) ->   21.500x 
         0.001805307
 2 ParserBenchmarkYAML#parser  1000 repeats:
       224.513358139 (  real) ->    8.714x 
         0.004454078
 3 ParserBenchmarkPure#parser  1000 repeats:
        26.755020642 (  real) ->    1.038x 
         0.037376163
 4 ParserBenchmarkRails#parser 1000 repeats:
        25.763381731 (  real) ->    1.000x 
         0.038814780
           calls/sec (  time) ->    speed  covers
           secs/call

In the table above 1 is JSON::Ext::Parser, 2 is YAML.load with YAML compatbile JSON document, 3 is is JSON::Pure::Parser, and 4 is ActiveSupport::JSON.decode. The ActiveSupport JSON-decoder converts the input first to YAML and then uses the YAML-parser, the conversion seems to slow it down so much that it is only as fast as the JSON::Pure::Parser!

If you look at the benchmark data you can see that this is mostly caused by the frequent high outliers - the median of the Rails-parser runs is still overall smaller than the median of the JSON::Pure::Parser runs:

Comparing times (call_time_median):
 1 ParserBenchmarkExt#parser   900 repeats:
       800.592479481 (  real) ->   26.936x 
         0.001249075
 2 ParserBenchmarkYAML#parser  1000 repeats:
       271.002390644 (  real) ->    9.118x 
         0.003690004
 3 ParserBenchmarkRails#parser 1000 repeats:
        30.227910865 (  real) ->    1.017x 
         0.033082008
 4 ParserBenchmarkPure#parser  1000 repeats:
        29.722384421 (  real) ->    1.000x 
         0.033644676
           calls/sec (  time) ->    speed  covers
           secs/call

I have benchmarked the JSON-Generator as well. This generated a few more values, because there are different modes that also influence the achieved speed:

Comparing times (call_time_mean):
 1 GeneratorBenchmarkExt#generator_fast    1000 repeats:
       547.354332608 (  real) ->   15.090x 
         0.001826970
 2 GeneratorBenchmarkExt#generator_safe    1000 repeats:
       443.968212317 (  real) ->   12.240x 
         0.002252414
 3 GeneratorBenchmarkExt#generator_pretty  900 repeats:
       375.104545883 (  real) ->   10.341x 
         0.002665923
 4 GeneratorBenchmarkPure#generator_fast   1000 repeats:
        49.978706968 (  real) ->    1.378x 
         0.020008521
 5 GeneratorBenchmarkRails#generator       1000 repeats:
        38.531868759 (  real) ->    1.062x 
         0.025952543
 6 GeneratorBenchmarkPure#generator_safe   1000 repeats:
        36.927649925 (  real) ->    1.018x 7 (>=3859)
         0.027079979
 7 GeneratorBenchmarkPure#generator_pretty 1000 repeats:
        36.272134441 (  real) ->    1.000x 6 (>=3859)
         0.027569373
           calls/sec (  time) ->    speed  covers
           secs/call

In the table above 1-3 are JSON::Ext::Generator methods. 4, 6, and 7 are JSON::Pure::Generator methods and 5 is the Rails JSON generator. It is now a bit faster than the generator_safe and generator_pretty methods of the pure variant but slower than the others.

To achieve the fastest JSON document output, you can use the fast_generate method. Beware, that this will disable the checking for circular Ruby data structures, which may cause JSON to go into an infinite loop.

Here are the median comparisons for completeness’ sake:

Comparing times (call_time_median):
 1 GeneratorBenchmarkExt#generator_fast    1000 repeats:
       708.258020939 (  real) ->   16.547x 
         0.001411915
 2 GeneratorBenchmarkExt#generator_safe    1000 repeats:
       569.105020353 (  real) ->   13.296x 
         0.001757145
 3 GeneratorBenchmarkExt#generator_pretty  900 repeats:
       482.825371244 (  real) ->   11.280x 
         0.002071142
 4 GeneratorBenchmarkPure#generator_fast   1000 repeats:
        62.717626652 (  real) ->    1.465x 
         0.015944481
 5 GeneratorBenchmarkRails#generator       1000 repeats:
        43.965681162 (  real) ->    1.027x 
         0.022745013
 6 GeneratorBenchmarkPure#generator_safe   1000 repeats:
        43.929073409 (  real) ->    1.026x 7 (>=3859)
         0.022763968
 7 GeneratorBenchmarkPure#generator_pretty 1000 repeats:
        42.802514491 (  real) ->    1.000x 6 (>=3859)
         0.023363113
           calls/sec (  time) ->    speed  covers
           secs/call

Author

Florian Frank <[email protected]>

License

Ruby License, see the COPYING file included in the source distribution. The Ruby License includes the GNU General Public License (GPL), Version 2, so see the file GPL as well.

Download

The latest version of this library can be downloaded at

Online Documentation should be located at