Class: String
- Inherits:
-
Object
- Object
- String
- Includes:
- ChineseDetector
- Defined in:
- lib/script_detector.rb
Instance Method Summary collapse
-
#chinese? ⇒ Boolean
Returns true if the string contains Chinese characters and no Japanese or Korean characters.
-
#identify_script ⇒ Object
Try to detect script and return one of “Japanese”, “Korean”, “Traditional Chinese”, “Simplified Chinese”, “Ambiguous Chinese” or “Unknown”.
-
#japanese? ⇒ Boolean
Returns true if the string contains specifically Japanese (hiragana or katakana) characters.
-
#korean? ⇒ Boolean
Returns true if the string contains specifically Korean (hangul) characters.
-
#simplified_chinese? ⇒ Boolean
Return true if the string contains simplified Chinese characters (简体字).
-
#traditional_chinese? ⇒ Boolean
Return true if the string contains traditional Chinese characters (繁體字).
Instance Method Details
#chinese? ⇒ Boolean
Returns true if the string contains Chinese characters and no Japanese or Korean characters
9 10 11 |
# File 'lib/script_detector.rb', line 9 def chinese? look_for_chars_in(/\p{Han}/) and !self.japanese? and !self.korean? end |
#identify_script ⇒ Object
Try to detect script and return one of “Japanese”, “Korean”, “Traditional Chinese”, “Simplified Chinese”, “Ambiguous Chinese” or “Unknown”
34 35 36 37 38 39 40 41 |
# File 'lib/script_detector.rb', line 34 def identify_script return "Japanese" if self.japanese? return "Korean" if self.korean? return "Traditional Chinese" if self.traditional_chinese? return "Simplified Chinese" if self.simplified_chinese? return "Ambiguous Chinese" if self.chinese? "Unknown" end |
#japanese? ⇒ Boolean
Returns true if the string contains specifically Japanese (hiragana or katakana) characters
24 25 26 |
# File 'lib/script_detector.rb', line 24 def japanese? look_for_chars_in /(\p{Katakana}|\p{Hiragana})/ end |
#korean? ⇒ Boolean
Returns true if the string contains specifically Korean (hangul) characters
29 30 31 |
# File 'lib/script_detector.rb', line 29 def korean? look_for_chars_in /\p{Hangul}/ end |
#simplified_chinese? ⇒ Boolean
Return true if the string contains simplified Chinese characters (简体字)
19 20 21 |
# File 'lib/script_detector.rb', line 19 def simplified_chinese? look_for_chars_in simplified_chinese_regex end |
#traditional_chinese? ⇒ Boolean
Return true if the string contains traditional Chinese characters (繁體字)
14 15 16 |
# File 'lib/script_detector.rb', line 14 def traditional_chinese? look_for_chars_in traditional_chinese_regex end |