Module: Google::Cloud::Language::V1::EncodingType
- Defined in:
- proto_docs/google/cloud/language/v1/language_service.rb
Overview
Represents the text encoding that the caller uses to process the output.
Providing an EncodingType
is recommended because the API provides the
beginning offsets for various outputs, such as tokens and mentions, and
languages that natively use different text encodings may access offsets
differently.
Constant Summary collapse
- NONE =
If
EncodingType
is not specified, encoding-dependent information (such asbegin_offset
) will be set at-1
. 0
- UTF8 =
Encoding-dependent information (such as
begin_offset
) is calculated based on the UTF-8 encoding of the input. C++ and Go are examples of languages that use this encoding natively. 1
- UTF16 =
Encoding-dependent information (such as
begin_offset
) is calculated based on the UTF-16 encoding of the input. Java and JavaScript are examples of languages that use this encoding natively. 2
- UTF32 =
Encoding-dependent information (such as
begin_offset
) is calculated based on the UTF-32 encoding of the input. Python is an example of a language that uses this encoding natively. 3