Class: Aws::BedrockAgentRuntime::Types::TextInferenceConfig

Inherits:

Struct

Object
Struct
Aws::BedrockAgentRuntime::Types::TextInferenceConfig

show all

Includes:: Structure

Defined in:: lib/aws-sdk-bedrockagentruntime/types.rb

Overview

Configuration settings for text generation using a language model via the RetrieveAndGenerate operation. Includes parameters like temperature, top-p, maximum token count, and stop sequences.

<note markdown=“1”> The valid range of maxTokens depends on the accepted values for your chosen model’s inference parameters. To see the inference parameters for your model, see [Inference parameters for foundation models.][1]

</note>

[1]: docs.aws.amazon.com/bedrock/latest/userguide/model-parameters.html

Constant Summary collapse

SENSITIVE =

[]

Instance Attribute Summary collapse

#max_tokens ⇒ Integer

The maximum number of tokens to generate in the output text.
#stop_sequences ⇒ Array<String>

A list of sequences of characters that, if generated, will cause the model to stop generating further tokens.
#temperature ⇒ Float

Controls the random-ness of text generated by the language model, influencing how much the model sticks to the most predictable next words versus exploring more surprising options.
#top_p ⇒ Float

A probability distribution threshold which controls what the model considers for the set of possible next tokens.

Instance Attribute Details

#max_tokens ⇒ `Integer`

The maximum number of tokens to generate in the output text. Do not use the minimum of 0 or the maximum of 65536. The limit values described here are arbitary values, for actual values consult the limits defined by your specific model.

Returns:

(Integer)

# File 'lib/aws-sdk-bedrockagentruntime/types.rb', line 5880

class TextInferenceConfig < Struct.new(
  :max_tokens,
  :stop_sequences,
  :temperature,
  :top_p)
  SENSITIVE = []
  include Aws::Structure
end

#stop_sequences ⇒ `Array<String>`

A list of sequences of characters that, if generated, will cause the model to stop generating further tokens. Do not use a minimum length of 1 or a maximum length of 1000. The limit values described here are arbitary values, for actual values consult the limits defined by your specific model.

Returns:

(Array<String>)

# File 'lib/aws-sdk-bedrockagentruntime/types.rb', line 5880

class TextInferenceConfig < Struct.new(
  :max_tokens,
  :stop_sequences,
  :temperature,
  :top_p)
  SENSITIVE = []
  include Aws::Structure
end

#temperature ⇒ `Float`

Controls the random-ness of text generated by the language model, influencing how much the model sticks to the most predictable next words versus exploring more surprising options. A lower temperature value (e.g. 0.2 or 0.3) makes model outputs more deterministic or predictable, while a higher temperature (e.g. 0.8 or 0.9) makes the outputs more creative or unpredictable.

Returns:

(Float)

# File 'lib/aws-sdk-bedrockagentruntime/types.rb', line 5880

class TextInferenceConfig < Struct.new(
  :max_tokens,
  :stop_sequences,
  :temperature,
  :top_p)
  SENSITIVE = []
  include Aws::Structure
end

#top_p ⇒ `Float`

A probability distribution threshold which controls what the model considers for the set of possible next tokens. The model will only consider the top p% of the probability distribution when generating the next token.

Returns:

(Float)

# File 'lib/aws-sdk-bedrockagentruntime/types.rb', line 5880

class TextInferenceConfig < Struct.new(
  :max_tokens,
  :stop_sequences,
  :temperature,
  :top_p)
  SENSITIVE = []
  include Aws::Structure
end

Class: Aws::BedrockAgentRuntime::Types::TextInferenceConfig

Overview

Constant Summary collapse

Instance Attribute Summary collapse

Instance Attribute Details

#max_tokens ⇒ Integer

#stop_sequences ⇒ Array<String>

#temperature ⇒ Float

#top_p ⇒ Float

#max_tokens ⇒ `Integer`

#stop_sequences ⇒ `Array<String>`

#temperature ⇒ `Float`

#top_p ⇒ `Float`