Class: Boxcars::Anthropic

Inherits:

Engine

Object
Engine
Boxcars::Anthropic

show all

Defined in:: lib/boxcars/engine/anthropic.rb

Overview

A engine that uses OpenAI’s API.

Constant Summary collapse

DEFAULT_PARAMS = The default parameters to use when asking the engine.

{
  model: "claude-3-5-sonnet-20240620",
  max_tokens: 4096,
  temperature: 0.1
}.freeze

DEFAULT_NAME = the default name of the engine

"Anthropic engine"

DEFAULT_DESCRIPTION = the default description of the engine

"useful for when you need to use Anthropic AI to answer questions. " \
"You should ask targeted questions"

Instance Attribute Summary collapse

#batch_size ⇒ Object readonly

Returns the value of attribute batch_size.
#llm_params ⇒ Object readonly

Returns the value of attribute llm_params.
#model_kwargs ⇒ Object readonly

Returns the value of attribute model_kwargs.
#prompts ⇒ Object readonly

Returns the value of attribute prompts.

Instance Method Summary collapse

#anthropic_client(anthropic_api_key: nil) ⇒ Object
#check_response(response, must_haves: %w[completion])) ⇒ Object

make sure we got a valid response.
#client(prompt:, inputs: {}, **kwargs) ⇒ Object

Get an answer from the engine.
#combine_assistant(params) ⇒ Object
#combine_assistant_entries(hashes) ⇒ Object

if we have multiple assistant entries in a row, we need to combine them.
#conversation_model?(_model) ⇒ Boolean
#convert_to_anthropic(params) ⇒ Object

convert generic parameters to Anthopic specific ones.
#default_params ⇒ Object

Get the default parameters for the engine.
#default_prefixes ⇒ Object
#engine_type ⇒ Object

the engine type.
#extract_model_version(model_string) ⇒ Object
#generate(prompts:, stop: nil) ⇒ EngineResult

Call out to OpenAI’s endpoint with k unique prompts.
#generation_info(sub_choices) ⇒ Array<Generation>

Get generation informaton.
#get_num_tokens(text:) ⇒ Object

calculate the number of tokens used.
#initialize(name: DEFAULT_NAME, description: DEFAULT_DESCRIPTION, prompts: [], **kwargs) ⇒ Anthropic constructor

A engine is the driver for a single tool to run.
#max_tokens_for_prompt(prompt_text) ⇒ Integer

Calculate the maximum number of tokens possible to generate for a prompt.
#modelname_to_contextsize(_modelname) ⇒ Object

lookup the context size for a model by name.
#run(question, **kwargs) ⇒ Object

get an answer from the engine for a question.

Constructor Details

#initialize(name: DEFAULT_NAME, description: DEFAULT_DESCRIPTION, prompts: [], **kwargs) ⇒ `Anthropic`

A engine is the driver for a single tool to run.

Parameters:

name (String) (defaults to: DEFAULT_NAME) —

The name of the engine. Defaults to “OpenAI engine”.
description (String) (defaults to: DEFAULT_DESCRIPTION) —

A description of the engine. Defaults to: useful for when you need to use AI to answer questions. You should ask targeted questions“.
prompts (Array<String>) (defaults to: []) —

The prompts to use when asking the engine. Defaults to [].

# File 'lib/boxcars/engine/anthropic.rb', line 28

def initialize(name: DEFAULT_NAME, description: DEFAULT_DESCRIPTION, prompts: [], **kwargs)
  @llm_params = DEFAULT_PARAMS.merge(kwargs)
  @prompts = prompts
  @batch_size = 20
  super(description: description, name: name)
end

Instance Attribute Details

#batch_size ⇒ `Object` (readonly)

Returns the value of attribute batch_size.



8
9
10

# File 'lib/boxcars/engine/anthropic.rb', line 8

def batch_size
  @batch_size
end

#llm_params ⇒ `Object` (readonly)

Returns the value of attribute llm_params.



8
9
10

# File 'lib/boxcars/engine/anthropic.rb', line 8

def llm_params
  @llm_params
end

#model_kwargs ⇒ `Object` (readonly)

Returns the value of attribute model_kwargs.



8
9
10

# File 'lib/boxcars/engine/anthropic.rb', line 8

def model_kwargs
  @model_kwargs
end

#prompts ⇒ `Object` (readonly)

Returns the value of attribute prompts.



8
9
10

# File 'lib/boxcars/engine/anthropic.rb', line 8

def prompts
  @prompts
end

Instance Method Details

#anthropic_client(anthropic_api_key: nil) ⇒ `Object`



39
40
41

# File 'lib/boxcars/engine/anthropic.rb', line 39

def anthropic_client(anthropic_api_key: nil)
  ::Anthropic::Client.new(access_token: anthropic_api_key)
end

#check_response(response, must_haves: %w[completion])) ⇒ `Object`

make sure we got a valid response

Parameters:

response (Hash) —

The response to check.
must_haves (Array<String>) (defaults to: %w[completion])) —

The keys that must be in the response. Defaults to %w.

Raises:

(KeyError) —

if there is an issue with the access token.
(ValueError) —

if the response is not valid.

# File 'lib/boxcars/engine/anthropic.rb', line 110

def check_response(response, must_haves: %w[completion])
  if response['error']
    code = response.dig('error', 'code')
    msg = response.dig('error', 'message') || 'unknown error'
    raise KeyError, "ANTHOPIC_API_KEY not valid" if code == 'invalid_api_key'

    raise ValueError, "Anthropic error: #{msg}"
  end

  must_haves.each do |key|
    raise ValueError, "Expecting key #{key} in response" unless response.key?(key)
  end
end

#client(prompt:, inputs: {}, **kwargs) ⇒ `Object`

Get an answer from the engine.

Parameters:

prompt (String) —

The prompt to use when asking the engine.
anthropic_api_key (String) —

Optional api key to use when asking the engine. Defaults to Boxcars.configuration.anthropic_api_key.
kwargs (Hash) —

Additional parameters to pass to the engine if wanted.

# File 'lib/boxcars/engine/anthropic.rb', line 48

def client(prompt:, inputs: {}, **kwargs)
  model_params = llm_params.merge(kwargs)
  api_key = Boxcars.configuration.anthropic_api_key(**kwargs)
  aclient = anthropic_client(anthropic_api_key: api_key)
  prompt = prompt.first if prompt.is_a?(Array)
  params = convert_to_anthropic(prompt.as_messages(inputs).merge(model_params))
  if Boxcars.configuration.log_prompts
    if params[:messages].length < 2 && params[:system].present?
      Boxcars.debug(">>>>>> Role: system <<<<<<\n#{params[:system]}")
    end
    Boxcars.debug(params[:messages].last(2).map { |p| ">>>>>> Role: #{p[:role]} <<<<<<\n#{p[:content]}" }.join("\n"), :cyan)
  end
  response = aclient.messages(parameters: params)
  response['completion'] = response.dig('content', 0, 'text')
  response.delete('content')
  response
rescue StandardError => e
  err = e.respond_to?(:response) ? e.response[:body] : e
  Boxcars.warn("Anthropic Error #{e.class.name}: #{err}", :red)
  raise
end

#combine_assistant(params) ⇒ `Object`

# File 'lib/boxcars/engine/anthropic.rb', line 201

def combine_assistant(params)
  params[:messages] = combine_assistant_entries(params[:messages])
  params[:messages].last[:content].rstrip! if params[:messages].last[:role] == :assistant
  params
end

#combine_assistant_entries(hashes) ⇒ `Object`

if we have multiple assistant entries in a row, we need to combine them

# File 'lib/boxcars/engine/anthropic.rb', line 208

def combine_assistant_entries(hashes)
  combined_hashes = []
  hashes.each do |hash|
    if combined_hashes.empty? || combined_hashes.last[:role] != :assistant || hash[:role] != :assistant
      combined_hashes << hash
    else
      combined_hashes.last[:content].concat("\n", hash[:content].rstrip)
    end
  end
  combined_hashes
end

#conversation_model?(_model) ⇒ `Boolean`

Returns:

(Boolean)



35
36
37

# File 'lib/boxcars/engine/anthropic.rb', line 35

def conversation_model?(_model)
  true
end

#convert_to_anthropic(params) ⇒ `Object`

convert generic parameters to Anthopic specific ones

# File 'lib/boxcars/engine/anthropic.rb', line 194

def convert_to_anthropic(params)
  params[:stop_sequences] = params.delete(:stop) if params.key?(:stop)
  params[:system] = params[:messages].shift[:content] if params.dig(:messages, 0, :role) == :system
  params[:messages].pop if params[:messages].last[:content].blank?
  combine_assistant(params)
end

#default_params ⇒ `Object`

Get the default parameters for the engine.



86
87
88

# File 'lib/boxcars/engine/anthropic.rb', line 86

def default_params
  llm_params
end

#default_prefixes ⇒ `Object`



220
221
222

# File 'lib/boxcars/engine/anthropic.rb', line 220

def default_prefixes
  { system: "Human: ", user: "Human: ", assistant: "Assistant: ", history: :history }
end

#engine_type ⇒ `Object`

the engine type



154
155
156

# File 'lib/boxcars/engine/anthropic.rb', line 154

def engine_type
  "claude"
end

#extract_model_version(model_string) ⇒ `Object`

Raises:

(ArgumentError)

# File 'lib/boxcars/engine/anthropic.rb', line 180

def extract_model_version(model_string)
  # Use a regular expression to find the version number
  match = model_string.match(/claude-(\d+)(?:-(\d+))?/)

  raise ArgumentError, "No version number found in model string: #{model_string}" unless match

  major = match[1].to_i
  minor = match[2].to_i

  # Combine major and minor versions
  major + (minor.to_f / 10)
end

#generate(prompts:, stop: nil) ⇒ `EngineResult`

Call out to OpenAI’s endpoint with k unique prompts.

Parameters:

prompts (Array<String>) —

The prompts to pass into the model.
inputs (Array<String>) —

The inputs to subsitite into the prompt.
stop (Array<String>) (defaults to: nil) —

Optional list of stop words to use when generating.

Returns:

(EngineResult) —

The full engine output.

# File 'lib/boxcars/engine/anthropic.rb', line 129

def generate(prompts:, stop: nil)
  params = {}
  params[:stop] = stop if stop
  choices = []
  # Get the token usage from the response.
  # Includes prompt, completion, and total tokens used.
  prompts.each_slice(batch_size) do |sub_prompts|
    sub_prompts.each do |sprompts, inputs|
      response = client(prompt: sprompts, inputs: inputs, **params)
      check_response(response)
      choices << response
    end
  end

  n = params.fetch(:n, 1)
  generations = []
  prompts.each_with_index do |_prompt, i|
    sub_choices = choices[i * n, (i + 1) * n]
    generations.push(generation_info(sub_choices))
  end
  EngineResult.new(generations: generations, engine_output: { token_usage: {} })
end

#generation_info(sub_choices) ⇒ `Array<Generation>`

Get generation informaton

Parameters:

sub_choices (Array<Hash>) —

The choices to get generation info for.

Returns:

(Array<Generation>) —

The generation information.

# File 'lib/boxcars/engine/anthropic.rb', line 93

def generation_info(sub_choices)
  sub_choices.map do |choice|
    Generation.new(
      text: choice["completion"],
      generation_info: {
        finish_reason: choice.fetch("stop_reason", nil),
        logprobs: choice.fetch("logprobs", nil)
      }
    )
  end
end

#get_num_tokens(text:) ⇒ `Object`

calculate the number of tokens used



159
160
161

# File 'lib/boxcars/engine/anthropic.rb', line 159

def get_num_tokens(text:)
  text.split.length # TODO: hook up to token counting gem
end

#max_tokens_for_prompt(prompt_text) ⇒ `Integer`

Calculate the maximum number of tokens possible to generate for a prompt.

Parameters:

prompt_text (String) —

The prompt text to use.

Returns:

(Integer) —

the number of tokens possible to generate.

# File 'lib/boxcars/engine/anthropic.rb', line 172

def max_tokens_for_prompt(prompt_text)
  num_tokens = get_num_tokens(prompt_text)

  # get max context size for model by name
  max_size = modelname_to_contextsize(model_name)
  max_size - num_tokens
end

#modelname_to_contextsize(_modelname) ⇒ `Object`

lookup the context size for a model by name

Parameters:

modelname (String) —

The name of the model to lookup.



165
166
167

# File 'lib/boxcars/engine/anthropic.rb', line 165

def modelname_to_contextsize(_modelname)
  100000
end

#run(question, **kwargs) ⇒ `Object`

get an answer from the engine for a question.

Parameters:

question (String) —

The question to ask the engine.
kwargs (Hash) —

Additional parameters to pass to the engine if wanted.

Raises:

(Error)

# File 'lib/boxcars/engine/anthropic.rb', line 73

def run(question, **kwargs)
  prompt = Prompt.new(template: question)
  response = client(prompt: prompt, **kwargs)

  raise Error, "Anthropic: No response from API" unless response
  raise Error, "Anthropic: #{response['error']}" if response['error']

  answer = response['completion']
  Boxcars.debug("Answer: #{answer}", :cyan)
  answer
end

Class: Boxcars::Anthropic

Overview

Constant Summary collapse

Instance Attribute Summary collapse

Instance Method Summary collapse

Constructor Details

#initialize(name: DEFAULT_NAME, description: DEFAULT_DESCRIPTION, prompts: [], **kwargs) ⇒ Anthropic

Instance Attribute Details

#batch_size ⇒ Object (readonly)

#llm_params ⇒ Object (readonly)

#model_kwargs ⇒ Object (readonly)

#prompts ⇒ Object (readonly)

Instance Method Details

#anthropic_client(anthropic_api_key: nil) ⇒ Object

#check_response(response, must_haves: %w[completion])) ⇒ Object

#client(prompt:, inputs: {}, **kwargs) ⇒ Object

#combine_assistant(params) ⇒ Object

#combine_assistant_entries(hashes) ⇒ Object

#conversation_model?(_model) ⇒ Boolean

#convert_to_anthropic(params) ⇒ Object

#default_params ⇒ Object

#default_prefixes ⇒ Object

#engine_type ⇒ Object

#extract_model_version(model_string) ⇒ Object

#generate(prompts:, stop: nil) ⇒ EngineResult

#generation_info(sub_choices) ⇒ Array<Generation>

#get_num_tokens(text:) ⇒ Object

#max_tokens_for_prompt(prompt_text) ⇒ Integer

#modelname_to_contextsize(_modelname) ⇒ Object

#run(question, **kwargs) ⇒ Object

#initialize(name: DEFAULT_NAME, description: DEFAULT_DESCRIPTION, prompts: [], **kwargs) ⇒ `Anthropic`

#batch_size ⇒ `Object` (readonly)

#llm_params ⇒ `Object` (readonly)

#model_kwargs ⇒ `Object` (readonly)

#prompts ⇒ `Object` (readonly)

#anthropic_client(anthropic_api_key: nil) ⇒ `Object`

#check_response(response, must_haves: %w[completion])) ⇒ `Object`

#client(prompt:, inputs: {}, **kwargs) ⇒ `Object`

#combine_assistant(params) ⇒ `Object`

#combine_assistant_entries(hashes) ⇒ `Object`

#conversation_model?(_model) ⇒ `Boolean`

#convert_to_anthropic(params) ⇒ `Object`

#default_params ⇒ `Object`

#default_prefixes ⇒ `Object`

#engine_type ⇒ `Object`

#extract_model_version(model_string) ⇒ `Object`

#generate(prompts:, stop: nil) ⇒ `EngineResult`

#generation_info(sub_choices) ⇒ `Array<Generation>`

#get_num_tokens(text:) ⇒ `Object`

#max_tokens_for_prompt(prompt_text) ⇒ `Integer`

#modelname_to_contextsize(_modelname) ⇒ `Object`

#run(question, **kwargs) ⇒ `Object`