Class: CodeRay::Scanners::Scanner

Inherits:
StringScanner
  • Object
show all
Extended by:
Plugin
Includes:
Enumerable
Defined in:
lib/coderay/scanner.rb

Overview

Scanner

The base class for all Scanners.

It is a subclass of Ruby’s great StringScanner, which makes it easy to access the scanning methods inside.

It is also Enumerable, so you can use it like an Array of Tokens:

require 'coderay'

c_scanner = CodeRay::Scanners[:c].new "if (*p == '{') nest++;"

for text, kind in c_scanner
  puts text if kind == :operator
end

# prints: (*==)++;

OK, this is a very simple example :) You can also use map, any?, find and even sort_by, if you want.

Constant Summary collapse

ScanError =

Raised if a Scanner fails while scanning

Class.new(Exception)
DEFAULT_OPTIONS =

The default options for all scanner classes.

Define @default_options for subclasses.

{ :stream => false }
KINDS_NOT_LOC =
[:comment, :doctype]

Class Method Summary collapse

Instance Method Summary collapse

Methods included from Plugin

helper, included, plugin_host, plugin_id, register_for, title

Constructor Details

#initialize(code = '', options = {}, &block) ⇒ Scanner

Create a new Scanner.

  • code is the input String and is handled by the superclass StringScanner.

  • options is a Hash with Symbols as keys. It is merged with the default options of the class (you can overwrite default options here.)

  • block is the callback for streamed highlighting.

If you set :stream to true in the options, the Scanner uses a TokenStream with the block as callback to handle the tokens.

Else, a Tokens object is used.



116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
# File 'lib/coderay/scanner.rb', line 116

def initialize code='', options = {}, &block
  raise "I am only the basic Scanner class. I can't scan "\
    "anything. :( Use my subclasses." if self.class == Scanner
  
  @options = self.class::DEFAULT_OPTIONS.merge options

  super Scanner.normify(code)

  @tokens = options[:tokens]
  if @options[:stream]
    warn "warning in CodeRay::Scanner.new: :stream is set, "\
      "but no block was given" unless block_given?
    raise NotStreamableError, self unless kind_of? Streamable
    @tokens ||= TokenStream.new(&block)
  else
    warn "warning in CodeRay::Scanner.new: Block given, "\
      "but :stream is #{@options[:stream]}" if block_given?
    @tokens ||= Tokens.new
  end
  @tokens.scanner = self

  setup
end

Class Method Details

.file_extension(extension = nil) ⇒ Object



82
83
84
85
86
87
88
# File 'lib/coderay/scanner.rb', line 82

def file_extension extension = nil
  if extension
    @file_extension = extension.to_s
  else
    @file_extension ||= plugin_id.to_s
  end
end

.normify(code) ⇒ Object



69
70
71
72
73
74
75
76
77
78
79
80
# File 'lib/coderay/scanner.rb', line 69

def normify code
  code = code.to_s
  if code.respond_to? :force_encoding
    begin
      code.force_encoding 'utf-8'
      code[/\z/]  # raises an ArgumentError when code contains a non-UTF-8 char
    rescue ArgumentError
      code.force_encoding 'binary'
    end
  end
  code.to_unix
end

.streamable?Boolean

Returns if the Scanner can be used in streaming mode.

Returns:

  • (Boolean)


65
66
67
# File 'lib/coderay/scanner.rb', line 65

def streamable?
  is_a? Streamable
end

Instance Method Details

#column(pos = self.pos) ⇒ Object



194
195
196
197
198
199
200
201
202
# File 'lib/coderay/scanner.rb', line 194

def column pos = self.pos
  return 0 if pos <= 0
  string = string()
  if string.respond_to?(:bytesize) && (defined?(@bin_string) || string.bytesize != string.size)
    @bin_string ||= string.dup.force_encoding(:binary)
    string = @bin_string
  end
  pos - (string.rindex(?\n, pos) || 0)
end

#each(&block) ⇒ Object

Traverses the tokens.

Raises:

  • (ArgumentError)


179
180
181
182
183
# File 'lib/coderay/scanner.rb', line 179

def each &block
  raise ArgumentError,
    'Cannot traverse TokenStream.' if @options[:stream]
  tokens.each(&block)
end

#lineObject

The current line position of the scanner.

Beware, this is implemented inefficiently. It should be used for debugging only.



190
191
192
# File 'lib/coderay/scanner.rb', line 190

def line
  string[0..pos].count("\n") + 1
end

#marshal_dumpObject



204
205
206
# File 'lib/coderay/scanner.rb', line 204

def marshal_dump
  @options
end

#marshal_load(options) ⇒ Object



208
209
210
# File 'lib/coderay/scanner.rb', line 208

def marshal_load options
  @options = options
end

#resetObject



140
141
142
143
# File 'lib/coderay/scanner.rb', line 140

def reset
  super
  reset_instance
end

#streaming?Boolean

Whether the scanner is in streaming mode.

Returns:

  • (Boolean)


174
175
176
# File 'lib/coderay/scanner.rb', line 174

def streaming?
  !!@options[:stream]
end

#string=(code) ⇒ Object Also known as: code=



145
146
147
148
149
# File 'lib/coderay/scanner.rb', line 145

def string= code
  code = Scanner.normify(code)
  super code
  reset_instance
end

#tokenize(new_string = nil, options = {}) ⇒ Object

Scans the code and returns all tokens in a Tokens object.



156
157
158
159
160
161
162
163
164
165
166
167
# File 'lib/coderay/scanner.rb', line 156

def tokenize new_string=nil, options = {}
  options = @options.merge(options)
  self.string = new_string if new_string
  @cached_tokens =
    if @options[:stream]  # :stream must have been set already
      reset unless new_string
      scan_tokens @tokens, options
      @tokens
    else
      scan_tokens @tokens, options
    end
end

#tokensObject



169
170
171
# File 'lib/coderay/scanner.rb', line 169

def tokens
  @cached_tokens ||= tokenize
end