Module: CoreExtensions::String::Unicode
- Included in:
- String
- Defined in:
- lib/unicode.rb,
lib/unicode.rb
Overview
:nodoc:
Instance Method Summary collapse
-
#chars ⇒ Object
chars
is a Unicode safe proxy for string methods. -
#is_utf8? ⇒ Boolean
Returns true if the string has UTF-8 semantics (a String used for purely byte resources is unlikely to have them), returns false otherwise.
Instance Method Details
#chars ⇒ Object
chars
is a Unicode safe proxy for string methods. It creates and returns an instance of the Multibyte::Chars class which encapsulates the original string. A Unicode safe version of all the String methods are defined on this proxy class. Undefined methods are forwarded to String, so all of the string overrides can also be called through the chars
proxy.
name = 'Claus Müller'
name.reverse # => "rell??M sualC"
name.length # => 13
name.chars.reverse.to_s # => "rellüM sualC"
name.chars.length # => 12
All the methods on the chars proxy which normally return a string will return a Chars object. This allows method chaining on the result of any of these methods.
name.chars.reverse.length # => 12
The Char object tries to be as interchangeable with String objects as possible: sorting and comparing between String and Char work like expected. The bang! methods change the internal string representation in the Chars object. Interoperability problems can be resolved easily with a to_s
call.
For more information about the methods defined on the Chars proxy see Multibyte::Chars and Multibyte::Handlers::UTF8Handler.
37 38 39 |
# File 'lib/unicode.rb', line 37 def chars Multibyte::Chars.new(self) end |
#is_utf8? ⇒ Boolean
Returns true if the string has UTF-8 semantics (a String used for purely byte resources is unlikely to have them), returns false otherwise.
43 44 45 |
# File 'lib/unicode.rb', line 43 def is_utf8? Multibyte::Handlers::UTF8Handler.consumes?(self) end |