Class: CodeRay::Scanners::Scanner
- Inherits:
-
StringScanner
- Object
- StringScanner
- CodeRay::Scanners::Scanner
- Extended by:
- Plugin
- Includes:
- Enumerable
- Defined in:
- lib/coderay/scanner.rb
Overview
Scanner
The base class for all Scanners.
It is a subclass of Ruby’s great StringScanner
, which makes it easy to access the scanning methods inside.
It is also Enumerable
, so you can use it like an Array of Tokens:
require 'coderay'
c_scanner = CodeRay::Scanners[:c].new "if (*p == '{') nest++;"
for text, kind in c_scanner
puts text if kind == :operator
end
# prints: (*==)++;
OK, this is a very simple example :) You can also use map
, any?
, find
and even sort_by
, if you want.
Direct Known Subclasses
C, Debug, Delphi, Diff, HTML, JSON, Java, JavaScript, NitroXHTML, Plaintext, Python, RHTML, Ruby, SQL, Scheme, YAML
Constant Summary collapse
- ScanError =
Raised if a Scanner fails while scanning
Class.new(Exception)
- DEFAULT_OPTIONS =
The default options for all scanner classes.
Define @default_options for subclasses.
{ :stream => false }
Class Method Summary collapse
- .file_extension(extension = nil) ⇒ Object
- .normify(code) ⇒ Object
-
.streamable? ⇒ Boolean
Returns if the Scanner can be used in streaming mode.
Instance Method Summary collapse
- #column(pos = self.pos) ⇒ Object
-
#each(&block) ⇒ Object
Traverses the tokens.
-
#initialize(code = '', options = {}, &block) ⇒ Scanner
constructor
Create a new Scanner.
-
#line ⇒ Object
The current line position of the scanner.
- #reset ⇒ Object
-
#streaming? ⇒ Boolean
Whether the scanner is in streaming mode.
- #string=(code) ⇒ Object (also: #code=)
-
#tokenize(new_string = nil, options = {}) ⇒ Object
Scans the code and returns all tokens in a Tokens object.
- #tokens ⇒ Object
Methods included from Plugin
helper, included, plugin_host, plugin_id, register_for, title
Constructor Details
#initialize(code = '', options = {}, &block) ⇒ Scanner
Create a new Scanner.
-
code
is the input String and is handled by the superclass StringScanner. -
options
is a Hash with Symbols as keys. It is merged with the default options of the class (you can overwrite default options here.) -
block
is the callback for streamed highlighting.
If you set :stream to true
in the options, the Scanner uses a TokenStream with the block
as callback to handle the tokens.
Else, a Tokens object is used.
113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 |
# File 'lib/coderay/scanner.rb', line 113 def initialize code='', = {}, &block @options = self.class::DEFAULT_OPTIONS.merge raise "I am only the basic Scanner class. I can't scan "\ "anything. :( Use my subclasses." if self.class == Scanner super Scanner.normify(code) @tokens = [:tokens] if @options[:stream] warn "warning in CodeRay::Scanner.new: :stream is set, "\ "but no block was given" unless block_given? raise NotStreamableError, self unless kind_of? Streamable @tokens ||= TokenStream.new(&block) else warn "warning in CodeRay::Scanner.new: Block given, "\ "but :stream is #{@options[:stream]}" if block_given? @tokens ||= Tokens.new end setup end |
Class Method Details
.file_extension(extension = nil) ⇒ Object
79 80 81 82 83 84 85 |
# File 'lib/coderay/scanner.rb', line 79 def file_extension extension = nil if extension @file_extension = extension.to_s else @file_extension ||= plugin_id.to_s end end |
.normify(code) ⇒ Object
66 67 68 69 70 71 72 73 74 75 76 77 |
# File 'lib/coderay/scanner.rb', line 66 def normify code code = code.to_s if code.respond_to? :force_encoding begin code.force_encoding 'utf-8' code[/\z/] # raises an ArgumentError when code contains a non-UTF-8 char rescue ArgumentError code.force_encoding 'binary' end end code.to_unix end |
.streamable? ⇒ Boolean
Returns if the Scanner can be used in streaming mode.
62 63 64 |
# File 'lib/coderay/scanner.rb', line 62 def streamable? is_a? Streamable end |
Instance Method Details
#column(pos = self.pos) ⇒ Object
189 190 191 192 193 194 195 196 197 |
# File 'lib/coderay/scanner.rb', line 189 def column pos = self.pos return 0 if pos <= 0 string = string() if string.respond_to?(:bytesize) && (defined?(@bin_string) || string.bytesize != string.size) @bin_string ||= string.dup.force_encoding(:binary) string = @bin_string end pos - (string.rindex(?\n, pos) || 0) end |
#each(&block) ⇒ Object
Traverses the tokens.
174 175 176 177 178 |
# File 'lib/coderay/scanner.rb', line 174 def each &block raise ArgumentError, 'Cannot traverse TokenStream.' if @options[:stream] tokens.each(&block) end |
#line ⇒ Object
The current line position of the scanner.
Beware, this is implemented inefficiently. It should be used for debugging only.
185 186 187 |
# File 'lib/coderay/scanner.rb', line 185 def line string[0..pos].count("\n") + 1 end |
#reset ⇒ Object
135 136 137 138 |
# File 'lib/coderay/scanner.rb', line 135 def reset super reset_instance end |
#streaming? ⇒ Boolean
Whether the scanner is in streaming mode.
169 170 171 |
# File 'lib/coderay/scanner.rb', line 169 def streaming? !!@options[:stream] end |
#string=(code) ⇒ Object Also known as: code=
140 141 142 143 144 |
# File 'lib/coderay/scanner.rb', line 140 def string= code code = Scanner.normify(code) super code reset_instance end |
#tokenize(new_string = nil, options = {}) ⇒ Object
Scans the code and returns all tokens in a Tokens object.
151 152 153 154 155 156 157 158 159 160 161 162 |
# File 'lib/coderay/scanner.rb', line 151 def tokenize new_string=nil, = {} = @options.merge() self.string = new_string if new_string @cached_tokens = if @options[:stream] # :stream must have been set already reset unless new_string scan_tokens @tokens, @tokens else scan_tokens @tokens, end end |
#tokens ⇒ Object
164 165 166 |
# File 'lib/coderay/scanner.rb', line 164 def tokens @cached_tokens ||= tokenize end |