Module: ActiveSupport::Multibyte
- Defined in:
- activesupport/lib/active_support/multibyte/utils.rb,
activesupport/lib/active_support/multibyte.rb,
activesupport/lib/active_support/multibyte/chars.rb,
activesupport/lib/active_support/multibyte/unicode.rb,
activesupport/lib/active_support/multibyte/exceptions.rb
Overview
:nodoc:
Defined Under Namespace
Modules: Unicode Classes: Chars, EncodingError
Constant Summary
- VALID_CHARACTER =
Regular expressions that describe valid byte sequences for a character
{ # Borrowed from the Kconv library by Shinji KONO - (also as seen on the W3C site) 'UTF-8' => /\A(?: [\x00-\x7f] | [\xc2-\xdf] [\x80-\xbf] | \xe0 [\xa0-\xbf] [\x80-\xbf] | [\xe1-\xef] [\x80-\xbf] [\x80-\xbf] | \xf0 [\x90-\xbf] [\x80-\xbf] [\x80-\xbf] | [\xf1-\xf3] [\x80-\xbf] [\x80-\xbf] [\x80-\xbf] | \xf4 [\x80-\x8f] [\x80-\xbf] [\x80-\xbf])\z /xn, # Quick check for valid Shift-JIS characters, disregards the odd-even pairing 'Shift_JIS' => /\A(?: [\x00-\x7e\xa1-\xdf] | [\x81-\x9f\xe0-\xef] [\x40-\x7e\x80-\x9e\x9f-\xfc])\z /xn }
Class Method Summary (collapse)
-
+ (Object) clean(string)
Removes all invalid characters from the string.
-
+ (Object) proxy_class
Returns the current proxy class.
-
+ (Object) proxy_class=(klass)
The proxy class returned when calling mb_chars.
-
+ (Object) valid_character
Returns a regular expression that matches valid characters in the current encoding.
-
+ (Object) verify(string)
Verifies the encoding of a string.
-
+ (Object) verify!(string)
Verifies the encoding of the string and raises an exception when it's not valid.
Class Method Details
+ (Object) clean(string)
Removes all invalid characters from the string.
Note: this method is a no-op in Ruby 1.9
46 47 48 |
# File 'activesupport/lib/active_support/multibyte/utils.rb', line 46 def self.clean(string) string end |
+ (Object) proxy_class
Returns the current proxy class
21 22 23 |
# File 'activesupport/lib/active_support/multibyte.rb', line 21 def self.proxy_class @proxy_class ||= ActiveSupport::Multibyte::Chars end |
+ (Object) proxy_class=(klass)
The proxy class returned when calling mb_chars. You can use this accessor to configure your own proxy class so you can support other encodings. See the ActiveSupport::Multibyte::Chars implementation for an example how to do this.
Example:
ActiveSupport::Multibyte.proxy_class = CharsForUTF32
16 17 18 |
# File 'activesupport/lib/active_support/multibyte.rb', line 16 def self.proxy_class=(klass) @proxy_class = klass end |
+ (Object) valid_character
Returns a regular expression that matches valid characters in the current encoding
7 8 9 |
# File 'activesupport/lib/active_support/multibyte/utils.rb', line 7 def self.valid_character VALID_CHARACTER[Encoding.default_external.to_s] end |
+ (Object) verify(string)
Verifies the encoding of a string
23 24 25 |
# File 'activesupport/lib/active_support/multibyte/utils.rb', line 23 def self.verify(string) string.valid_encoding? end |
+ (Object) verify!(string)
Verifies the encoding of the string and raises an exception when it's not valid
38 39 40 |
# File 'activesupport/lib/active_support/multibyte/utils.rb', line 38 def self.verify!(string) raise EncodingError.new("Found characters with invalid encoding") unless verify(string) end |