Class: DogTrainer::API

Inherits:

Object

Object
DogTrainer::API

show all

Includes:: Logging

Defined in:: lib/dogtrainer/api.rb

Overview

Helper methods to upsert/ensure existence and configuration of DataDog Monitors, TimeBoards and ScreenBoards.

Instance Method Summary collapse

#check_dog_result(r, accepted_codes = ['200']) ⇒ Object

Check the result of a Dogapi::Client call.
#create_monitor(_mon_name, mon_params) ⇒ Object

Create a monitor that doesn’t already exist; return its id.
#generate_messages(metric_desc, comparison, mon_type) ⇒ Object

Given the name of a metric we’re monitoring and the comparison method, generate alert messages for the monitor.
#get_existing_monitor_by_name(mon_name) ⇒ Object

Get all monitors from DataDog; return the one named “mon_name“ or nil.
#get_existing_screenboard_by_name(dash_name) ⇒ Object

get all screenboards from DataDog; return the one named “dash_name“ or nil returns the screenboard definition hash from the DataDog API.
#get_existing_timeboard_by_name(dash_name) ⇒ Object

get all timeboards from DataDog; return the one named “dash_name“ or nil returns the timeboard definition hash from the DataDog API.
#get_git_url_for_directory(dir_path) ⇒ Object

Given the path to a directory on disk that may be a git repository, return the URL to its first remote, or nil otherwise.
#get_monitors ⇒ Object

Get all monitors from DataDog, caching them in an instance variable.
#get_repo_path ⇒ Object

Return a human-usable string identifying where to make changes to the resources created by this class.
#graphdef(title, queries, markers = {}) ⇒ Object

Create a graph definition (graphdef) to use with Boards APIs.
#initialize(api_key, app_key, notify_to, repo_path = nil) ⇒ API constructor

Initialize class; set instance configuration.
#mute_monitor_by_id(mon_id, options = { end_timestamp: nil }) ⇒ Object

Mute the monitor identified by the specified unique ID, with an optional duration.
#mute_monitor_by_name(mon_name, options = { end_timestamp: nil }) ⇒ Object

Mute the monitor identified by the specified name, with an optional duration.
#mute_monitors_by_regex(mon_name_regex, options = { end_timestamp: nil }) ⇒ Object

Mute all monitors with names matching the specified regex, with an optional duration.
#params_for_monitor(name, message, query, threshold, options = { escalation_message: nil, alert_no_data: true, mon_type: 'metric alert', renotify_interval: 60, no_data_timeframe: 20, evaluation_delay: nil }) ⇒ Object

Return a hash of parameters for a monitor with the specified configuration.
#unmute_monitor_by_id(mon_id) ⇒ Object

Unute the monitor identified by the specified unique ID.
#unmute_monitor_by_name(mon_name) ⇒ Object

Unmute the monitor identified by the specified name.
#unmute_monitors_by_regex(mon_name_regex) ⇒ Object

Unmute all monitors with names matching the specified regex.
#upsert_monitor(mon_name, query, threshold, comparator, options = { alert_no_data: true, mon_type: 'metric alert', renotify_interval: 60, no_data_timeframe: 20, evaluation_delay: nil, message: nil }) ⇒ Object

Create or update a monitor in DataDog with the given name and data/params.
#upsert_screenboard(dash_name, widgets) ⇒ Object

Create or update a screenboard in DataDog with the given name and data/params.
#upsert_timeboard(dash_name, graphs) ⇒ Object

Create or update a timeboard in DataDog with the given name and data/params.

Methods included from Logging

debug_formatter, default_formatter, default_outputter, #logger, #logger_name

Constructor Details

#initialize(api_key, app_key, notify_to, repo_path = nil) ⇒ `API`

Initialize class; set instance configuration.

Parameters:

api_key (String) —

DataDog API Key
app_key (String) —

DataDog Application Key
notify_to (String) —

DataDog notification recpipent string for monitors. This is generally one or more @-prefixed DataDog users or notification recipients. It can be set to nil if you are only managing screenboards and timeboards. For further information, see: docs.datadoghq.com/monitoring/#notifications
repo_path (String) (defaults to: nil) —

Git or HTTP URL to the repository containing code that calls this class. Will be added to notification messages so that humans know where to make changes to monitors. If nil, the return value of #get_repo_path

# File 'lib/dogtrainer/api.rb', line 25

def initialize(api_key, app_key, notify_to, repo_path = nil)
  logger.debug 'initializing DataDog API client'
  @dog = Dogapi::Client.new(api_key, app_key)
  @monitors = nil
  @timeboards = nil
  @screenboards = nil
  @notify_to = notify_to
  if repo_path.nil?
    @repo_path = get_repo_path
    logger.debug "using repo_path: #{@repo_path}"
  else
    @repo_path = repo_path
  end
end

Instance Method Details

#check_dog_result(r, accepted_codes = ['200']) ⇒ `Object`

Check the result of a Dogapi::Client call.

Dogapi::Client returns responses as arrays, with the first element being the HTTP response code and the second element being the actual response.

Check the specified

Parameters:

r (Array) —

the Dogapi result/response
accepted_codes (Array) (defaults to: ['200']) —

Array of acceptable (success) HTTP codes

Raises:

(DogApiException) —

if the response code indicates an error



50
51
52

# File 'lib/dogtrainer/api.rb', line 50

def check_dog_result(r, accepted_codes = ['200'])
  raise DogApiException, r unless accepted_codes.include?(r[0])
end

#create_monitor(_mon_name, mon_params) ⇒ `Object`

Create a monitor that doesn’t already exist; return its id

Parameters:

_mon_name (String) —

mane of the monitor to create
mon_params (Hash) —

params to pass to the DataDog API call. Must include “type” and “query” keys.

# File 'lib/dogtrainer/api.rb', line 361

def create_monitor(_mon_name, mon_params)
  res = @dog.monitor(mon_params['type'], mon_params['query'], mon_params)
  if res[0] == '200'
    logger.info "\tMonitor #{res[1]['id']} created successfully"
    return res[1]['id']
  else
    logger.error "\tError creating monitor: #{res}"
  end
end

#generate_messages(metric_desc, comparison, mon_type) ⇒ `Object`

Given the name of a metric we’re monitoring and the comparison method, generate alert messages for the monitor.

This method is intended for internal use by the class, but can be overridden if the implementation is not desired.

Parameters:

metric_desc (String) —

description/name of the metric being monitored.
comparison (String) —

comparison operator or description for metric vs threshold; i.e. “>=”, “<=”, “=”, “<”, etc.
mon_type (Hash) —

a customizable set of options

Options Hash (mon_type):

type (String) —

of monitor as defined in DataDog API docs.

# File 'lib/dogtrainer/api.rb', line 117

def generate_messages(metric_desc, comparison, mon_type)
  if mon_type == 'service check'
    message = [
      "{{#is_alert}}'#{metric_desc}' is FAILING: {{check_message}}",
      "{{/is_alert}}\n",
      "{{#is_warning}}'#{metric_desc}' is WARNING: {{check_message}}",
      "{{/is_warning}}\n",
      "{{#is_recovery}}'#{metric_desc}' recovered: {{check_message}}",
      "{{/is_recovery}}\n",
      "{{#is_no_data}}'#{metric_desc}' is not reporting data",
      "{{/is_no_data}}\n",
      # repo path and notify to
      '(monitor and threshold configuration for this alert is managed by ',
      "#{@repo_path}) #{@notify_to}"
    ].join('')
    escalation = "'#{metric_desc}' is still in error state: " \
      '{{check_message}}'
    return [message, escalation]
  end
  message = [
    "{{#is_alert}}'#{metric_desc}' should be #{comparison} {{threshold}}, ",
    "but is {{value}}.{{/is_alert}}\n",
    "{{#is_recovery}}'#{metric_desc}' recovered  (current value {{value}} ",
    "is #{comparison} threshold of {{threshold}}).{{/is_recovery}}\n",
    '(monitor and threshold configuration for this alert is managed by ',
    "#{@repo_path}) #{@notify_to}"
  ].join('')
  escalation = "'#{metric_desc}' is still in error state (current value " \
    "{{value}} is #{comparison} threshold of {{threshold}})"
  [message, escalation]
end

#get_existing_monitor_by_name(mon_name) ⇒ `Object`

Get all monitors from DataDog; return the one named “mon_name“ or nil

This caches all monitors from DataDog in an instance variable.

Parameters:

mon_name (String) —

name of the monitor to return

# File 'lib/dogtrainer/api.rb', line 376

def get_existing_monitor_by_name(mon_name)
  get_monitors.each do |mon|
    return mon if mon['name'] == mon_name
  end
  nil
end

#get_existing_screenboard_by_name(dash_name) ⇒ `Object`

get all screenboards from DataDog; return the one named “dash_name“ or nil returns the screenboard definition hash from the DataDog API

# File 'lib/dogtrainer/api.rb', line 688

def get_existing_screenboard_by_name(dash_name)
  if @screenboards.nil?
    @screenboards = @dog.get_all_screenboards
    puts "Found #{@screenboards[1]['screenboards'].length} existing " \
      'screenboards in DataDog'
    if @screenboards[1]['screenboards'].empty?
      puts 'ERROR: Docker API call returned no existing screenboards. ' \
        'Something is wrong.'
      exit 1
    end
  end
  @screenboards[1]['screenboards'].each do |dash|
    return @dog.get_screenboard(dash['id'])[1] if dash['title'] == dash_name
  end
  nil
end

#get_existing_timeboard_by_name(dash_name) ⇒ `Object`

get all timeboards from DataDog; return the one named “dash_name“ or nil returns the timeboard definition hash from the DataDog API

# File 'lib/dogtrainer/api.rb', line 669

def get_existing_timeboard_by_name(dash_name)
  if @timeboards.nil?
    @timeboards = @dog.get_dashboards
    puts "Found #{@timeboards[1]['dashes'].length} existing timeboards " \
      'in DataDog'
    if @timeboards[1]['dashes'].empty?
      puts 'ERROR: Docker API call returned no existing timeboards. ' \
        'Something is wrong.'
      exit 1
    end
  end
  @timeboards[1]['dashes'].each do |dash|
    return @dog.get_dashboard(dash['id'])[1] if dash['title'] == dash_name
  end
  nil
end

#get_git_url_for_directory(dir_path) ⇒ `Object`

Given the path to a directory on disk that may be a git repository, return the URL to its first remote, or nil otherwise.

Parameters:

dir_path (String) —

Path to possible git repository

# File 'lib/dogtrainer/api.rb', line 84

def get_git_url_for_directory(dir_path)
  logger.debug "trying to find git remote for: #{dir_path}"
  conf = nil
  Dir.chdir(dir_path) do
    begin
      conf = `git config --local -l`
    rescue
      conf = nil
    end
  end
  return nil if conf.nil?
  conf.split("\n").each do |line|
    return Regexp.last_match(1) if line =~ /^remote\.[^\.]+\.url=(.+)/
  end
  nil
end

#get_monitors ⇒ `Object`

Get all monitors from DataDog, caching them in an instance variable.

# File 'lib/dogtrainer/api.rb', line 384

def get_monitors
  if @monitors.nil?
    @monitors = @dog.get_all_monitors(group_states: 'all')
    logger.info "Found #{@monitors[1].length} existing monitors in DataDog"
    if @monitors[1].empty?
      raise 'ERROR: DataDog API call returned no existing monitors. ' \
        'Something is wrong.'
    end
  end
  @monitors[1]
end

#get_repo_path ⇒ `Object`

Return a human-usable string identifying where to make changes to the resources created by this class. Returns the first of:

“GIT_URL“ environment variable, if set and not empty
“CIRCLE_REPOSITORY_URL“ environment variable, if set and not empty
If the code calling this class is part of a git repository on disk and “git“ is present on the system and in PATH, the URL of the first remote for the repository.

If none of these are found, an error will be raised.

# File 'lib/dogtrainer/api.rb', line 64

def get_repo_path
  %w[GIT_URL CIRCLE_REPOSITORY_URL].each do |vname|
    return ENV[vname] if ENV.has_key?(vname) && !ENV[vname].empty?
  end
  # try to find git repository
  # get the path to the calling code;
  #   caller[0] is #initialize, caller[1] is what instantiated the class
  path, = caller[1].partition(':')
  repo_path = get_git_url_for_directory(File.dirname(path))
  if repo_path.nil?
    raise 'Unable to determine source code path; please ' \
    'specify repo_path option to DogTrainer::API'
  end
  repo_path
end

#graphdef(title, queries, markers = {}) ⇒ `Object`

Create a graph definition (graphdef) to use with Boards APIs. For further information, see: docs.datadoghq.com/graphingjson/

Parameters:

title (String) —

title of the graph
queries (Array or String) —

a single string graph query, or an Array of graph query strings.
markers (Hash) (defaults to: {}) —

a hash of markers to set on the graph, in name => value format.

# File 'lib/dogtrainer/api.rb', line 544

def graphdef(title, queries, markers = {})
  queries = [queries] unless queries.is_a?(Array)
  d = {
    'definition' => {
      'viz' => 'timeseries',
      'requests' => []
    },
    'title' => title
  }
  queries.each do |q|
    d['definition']['requests'] << {
      'q' => q,
      'conditional_formats' => [],
      'type' => 'line'
    }
  end
  unless markers.empty?
    d['definition']['markers'] = []
    markers.each do |name, val|
      d['definition']['markers'] << {
        'type' => 'error dashed',
        'val' => val.to_s,
        'value' => "y = #{val}",
        'label' => "#{name}==#{val}"
      }
    end
  end
  d
end

#mute_monitor_by_id(mon_id, options = { end_timestamp: nil }) ⇒ `Object`

Mute the monitor identified by the specified unique ID, with an optional duration.

Examples:

mute monitor 12345 indefinitely

dog = DogTrainer::API.new(api_key, app_key, notify_to)
dog.mute_monitor_by_id(12345)

mute monitor 12345 until 2016-09-17 01:39:52-00:00

dog = DogTrainer::API.new(api_key, app_key, notify_to)
dog.mute_monitor_by_id(12345, end_timestamp: 1474076393)

Parameters:

mon_id (Integer) —

ID of the monitor to mute
options (Hash) (defaults to: { end_timestamp: nil })

Options Hash (options):

:end_timestamp (Integer) —

optional timestamp for when the mute should end; Integer POSIX timestamp.

Raises:

(DogApiException) —

if the Datadog API returns an error

# File 'lib/dogtrainer/api.rb', line 412

def mute_monitor_by_id(mon_id, options = { end_timestamp: nil })
  if options.fetch(:end_timestamp, nil).nil?
    logger.info "Muting monitor by ID #{mon_id}"
    check_dog_result(@dog.mute_monitor(mon_id))
  else
    end_ts = options[:end_timestamp]
    logger.info "Muting monitor by ID #{mon_id} until #{end_ts}"
    check_dog_result(@dog.mute_monitor(mon_id, end: end_ts))
  end
end

#mute_monitor_by_name(mon_name, options = { end_timestamp: nil }) ⇒ `Object`

Mute the monitor identified by the specified name, with an optional duration.

Examples:

mute monitor named ‘My Monitor’ indefinitely

dog = DogTrainer::API.new(api_key, app_key, notify_to)
dog.mute_monitor_by_name('My Monitor')

mute monitor named ‘My Monitor’ until 2016-09-17 01:39:52-00:00

dog = DogTrainer::API.new(api_key, app_key, notify_to)
dog.mute_monitor_by_name('My Monitor', end_timestamp: 1474076393)

Parameters:

mon_name (String) —

name of the monitor to mute
options (Hash) (defaults to: { end_timestamp: nil })

Options Hash (options):

:end_timestamp (Integer) —

optional timestamp for when the mute should end; Integer POSIX timestamp.

Raises:

(RuntimeError) —

raised if the specified monitor name can’t be found
(DogApiException) —

if the Datadog API returns an error

# File 'lib/dogtrainer/api.rb', line 440

def mute_monitor_by_name(mon_name, options = { end_timestamp: nil })
  mon = get_existing_monitor_by_name(mon_name)
  raise "ERROR: Could not find monitor with name #{mon_name}" if mon.nil?
  if options.fetch(:end_timestamp, nil).nil?
    logger.info "Muting monitor by name #{mon_name} (#{mon['id']})"
    check_dog_result(@dog.mute_monitor(mon['id']))
  else
    end_ts = options[:end_timestamp]
    logger.info "Muting monitor by name #{mon_name} (#{mon['id']}) " \
      "until #{end_ts}"
    check_dog_result(@dog.mute_monitor(mon['id'], end: end_ts))
  end
end

#mute_monitors_by_regex(mon_name_regex, options = { end_timestamp: nil }) ⇒ `Object`

Mute all monitors with names matching the specified regex, with an optional duration.

Examples:

mute monitors with names matching /myapp/ indefinitely

dog = DogTrainer::API.new(api_key, app_key, notify_to)
dog.mute_monitor_by_regex(/myapp/)

mute monitors with names containing ‘foo’ indefinitely

dog = DogTrainer::API.new(api_key, app_key, notify_to)
dog.mute_monitor_by_regex('foo')

mute monitors with names matching /myapp/ until 2016-09-17

01:39:52-00:00
 dog = DogTrainer::API.new(api_key, app_key, notify_to)
 dog.mute_monitor_by_regex(/myapp/, end_timestamp: 1474076393)

Parameters:

mon_name_regex (String) —

or [Regexp] regex to match monitor names against
options (Hash) (defaults to: { end_timestamp: nil })

Options Hash (options):

:end_timestamp (Integer) —

optional timestamp for when the mute should end; Integer POSIX timestamp.

# File 'lib/dogtrainer/api.rb', line 475

def mute_monitors_by_regex(mon_name_regex, options = { end_timestamp: nil })
  if mon_name_regex.class != Regexp
    mon_name_regex = Regexp.new(mon_name_regex)
  end
  if options.fetch(:end_timestamp, nil).nil?
    logger.info "Muting monitors by regex #{mon_name_regex.source}"
    end_ts = nil
  else
    logger.info "Muting monitors by regex #{mon_name_regex.source} " \
      "until #{end_ts}"
    end_ts = options[:end_timestamp]
  end
  logger.debug "Searching for monitors matching: #{mon_name_regex.source}"
  get_monitors.each do |mon|
    if mon['name'] =~ mon_name_regex
      logger.info "Muting monitor '#{mon['name']}' (#{mon['id']})"
      mute_monitor_by_id(mon['id'], end_timestamp: end_ts)
    end
  end
end

#params_for_monitor(name, message, query, threshold, options = { escalation_message: nil, alert_no_data: true, mon_type: 'metric alert', renotify_interval: 60, no_data_timeframe: 20, evaluation_delay: nil }) ⇒ `Object`

Return a hash of parameters for a monitor with the specified configuration. For further information, see: docs.datadoghq.com/api/#monitors

Parameters:

name (String) —

name for the monitor; must be unique per DataDog account
message (String) —

alert/notification message for the monitor
query (String) —

query for the monitor to evaluate
threshold (Float or Hash) —

evaluation threshold for the monitor; if a Float is passed, it will be provided as the “critical“ threshold; otherise, a Hash in the form taken by the DataDog API should be provided (“critical“, “warning“ and/or “ok“ keys, Float values)
options (Hash) (defaults to: { escalation_message: nil, alert_no_data: true, mon_type: 'metric alert', renotify_interval: 60, no_data_timeframe: 20, evaluation_delay: nil })

Options Hash (options):

:escalation_message (String) —

optional escalation message for escalation notifications. Defaults to nil.
:alert_no_data (Boolean) —

whether or not to alert on lack of data. Defaults to true.
:mon_type (String) —

type of monitor as defined in DataDog API docs. Defaults to ‘metric alert’.
:renotify_interval (Integer) —

the number of minutes after the last notification before a monitor will re-notify on the current status. It will re-notify only if not resolved. Default: 60. Set to nil to disable re-notification.
:no_data_timeframe (Integer) —

the number of minutes before a monitor will notify when data stops reporting. Must be at least 2x the monitor timeframe for metric alerts or 2 minutes for service checks. Defaults to 20 minutes; API default is 2x the monitor timeframe.
:evaluation_delay (Integer) — default: metric monitors only —

Time (in seconds) to delay evaluation, as a non-negative integer. For example, if the value is set to 300 (5min), the timeframe is set to last_5m and the time is 7:00, the monitor will evaluate data from 6:50 to 6:55. This is useful for AWS CloudWatch and other backfilled metrics to ensure the monitor will always have data during evaluation.

# File 'lib/dogtrainer/api.rb', line 182

def params_for_monitor(
  name,
  message,
  query,
  threshold,
  options = {
    escalation_message: nil,
    alert_no_data: true,
    mon_type: 'metric alert',
    renotify_interval: 60,
    no_data_timeframe: 20,
    evaluation_delay: nil
  }
)
  options[:alert_no_data] = true unless options.key?(:alert_no_data)
  options[:mon_type] = 'metric alert' unless options.key?(:mon_type)
  options[:renotify_interval] = 60 unless options.key?(:renotify_interval)
  options[:no_data_timeframe] = 20 unless options.key?(:no_data_timeframe)
  options[:evaluation_delay] = nil unless options.key?(:evaluation_delay)

  # handle threshold hash
  thresh = if threshold.is_a?(Hash)
             threshold
           else
             { 'critical' => threshold }
           end

  monitor_data = {
    'name' => name,
    'type' => options[:mon_type],
    'query' => query,
    'message' => message,
    'tags' => [],
    'options' => {
      'notify_audit' => false,
      'locked' => false,
      'timeout_h' => 0,
      'silenced' => {},
      'thresholds' => thresh,
      'require_full_window' => false,
      'notify_no_data' => options[:alert_no_data],
      'renotify_interval' => options[:renotify_interval],
      'no_data_timeframe' => options[:no_data_timeframe]
    }
  }
  unless options[:escalation_message].nil?
    monitor_data['options']['escalation_message'] = \
      options[:escalation_message]
  end
  unless options[:evaluation_delay].nil?
    monitor_data['options']['evaluation_delay'] = options[:evaluation_delay]
  end
  monitor_data
end

#unmute_monitor_by_id(mon_id) ⇒ `Object`

Unute the monitor identified by the specified unique ID.

Parameters:

mon_id (Integer) —

ID of the monitor to mute

Raises:

(DogApiException) —

if the Datadog API returns an error

# File 'lib/dogtrainer/api.rb', line 500

def unmute_monitor_by_id(mon_id)
  logger.info "Unmuting monitor by ID #{mon_id}"
  check_dog_result(@dog.unmute_monitor(mon_id, all_scopes: true))
end

#unmute_monitor_by_name(mon_name) ⇒ `Object`

Unmute the monitor identified by the specified name.

Parameters:

mon_name (String) —

name of the monitor to mute

Raises:

(RuntimeError) —

raised if the specified monitor name can’t be found

# File 'lib/dogtrainer/api.rb', line 509

def unmute_monitor_by_name(mon_name)
  mon = get_existing_monitor_by_name(mon_name)
  logger.info "Unmuting monitor by name #{mon_name}"
  raise "ERROR: Could not find monitor with name #{mon_name}" if mon.nil?
  unmute_monitor_by_id(mon['id'])
end

#unmute_monitors_by_regex(mon_name_regex) ⇒ `Object`

Unmute all monitors with names matching the specified regex.

Parameters:

mon_name_regex (String) —

regex to match monitor names against

# File 'lib/dogtrainer/api.rb', line 519

def unmute_monitors_by_regex(mon_name_regex)
  if mon_name_regex.class != Regexp
    mon_name_regex = Regexp.new(mon_name_regex)
  end
  logger.info "Unmuting monitors by regex #{mon_name_regex.source}"
  get_monitors.each do |mon|
    if mon['name'] =~ mon_name_regex
      logger.info "Unmuting monitor '#{mon['name']}' (#{mon['id']})"
      unmute_monitor_by_id(mon['id'])
    end
  end
end

#upsert_monitor(mon_name, query, threshold, comparator, options = { alert_no_data: true, mon_type: 'metric alert', renotify_interval: 60, no_data_timeframe: 20, evaluation_delay: nil, message: nil }) ⇒ `Object`

Create or update a monitor in DataDog with the given name and data/params. This method handles either creating the monitor if one with the same name doesn’t already exist in the specified DataDog account, or else updating an existing monitor with the same name if one exists but the parameters differ.

For further information on parameters and options, see: docs.datadoghq.com/api/#monitors

This method calls #generate_messages to build the notification messages and #params_for_monitor to generate the parameters.

Parameters:

mon_name (String) —

name for the monitor; must be unique per DataDog account
query (String) —

query for the monitor to evaluate
threshold (Float or Hash) —

evaluation threshold for the monitor; if a Float is passed, it will be provided as the “critical“ threshold; otherise, a Hash in the form taken by the DataDog API should be provided (“critical“, “warning“ and/or “ok“ keys, Float values)
comparator (String) —

comparison operator for metric vs threshold, describing the inverse of the query. I.e. if the query is checking for “< 100”, then the comparator would be “>=”.
options (Hash) (defaults to: { alert_no_data: true, mon_type: 'metric alert', renotify_interval: 60, no_data_timeframe: 20, evaluation_delay: nil, message: nil })

Options Hash (options):

:alert_no_data (Boolean) —

whether or not to alert on lack of data. Defaults to true.
:mon_type (String) —

type of monitor as defined in DataDog API docs. Defaults to ‘metric alert’.
:renotify_interval (Integer) —

the number of minutes after the last notification before a monitor will re-notify on the current status. It will re-notify only if not resolved. Default: 60. Set to nil to disable re-notification.
:no_data_timeframe (Integer) —

the number of minutes before a monitor will notify when data stops reporting. Must be at least 2x the monitor timeframe for metric alerts or 2 minutes for service checks. Defaults to 20 minutes; API default is 2x the monitor timeframe.
:evaluation_delay (Integer) — default: metric monitors only —

Time (in seconds) to delay evaluation, as a non-negative integer. For example, if the value is set to 300 (5min), the timeframe is set to last_5m and the time is 7:00, the monitor will evaluate data from 6:50 to 6:55. This is useful for AWS CloudWatch and other backfilled metrics to ensure the monitor will always have data during evaluation.
:message (String) —

alert/notification message for the monitor; if omitted, will be generated by #generate_messages
:escalation_message (String) —

optional escalation message for escalation notifications. If omitted, will be generated by #generate_messages; explicitly set to nil to not add an escalation message to the monitor.

# File 'lib/dogtrainer/api.rb', line 284

def upsert_monitor(
  mon_name,
  query,
  threshold,
  comparator,
  options = {
    alert_no_data: true,
    mon_type: 'metric alert',
    renotify_interval: 60,
    no_data_timeframe: 20,
    evaluation_delay: nil,
    message: nil
  }
)
  options[:alert_no_data] = true unless options.key?(:alert_no_data)
  options[:mon_type] = 'metric alert' unless options.key?(:mon_type)
  options[:renotify_interval] = 60 unless options.key?(:renotify_interval)
  options[:no_data_timeframe] = 20 unless options.key?(:no_data_timeframe)
  options[:evaluation_delay] = nil unless options.key?(:evaluation_delay)

  msg, esc = generate_messages(mon_name, comparator, options[:mon_type])
  message = if options[:message].nil?
              msg
            else
              options[:message]
            end
  escalation = if options.key?(:escalation_message)
                 options[:escalation_message]
               else
                 esc
               end

  rno = options[:renotify_interval]
  mon_params = params_for_monitor(
    mon_name, message, query, threshold,
    escalation_message: escalation,
    alert_no_data: options[:alert_no_data],
    mon_type: options[:mon_type],
    renotify_interval: rno,
    no_data_timeframe: options[:no_data_timeframe],
    evaluation_delay: options[:evaluation_delay]
  )
  logger.info "Upserting monitor: #{mon_name}"
  monitor = get_existing_monitor_by_name(mon_name)
  return create_monitor(mon_name, mon_params) if monitor.nil?
  logger.debug "\tfound existing monitor id=#{monitor['id']}"
  do_update = false
  mon_params.each do |k, _v|
    unless monitor.include?(k)
      logger.debug "\tneeds update based on missing key: #{k}"
      do_update = true
      break
    end
    next unless monitor[k] != mon_params[k]
    logger.debug "\tneeds update based on difference in key #{k}; " \
      "current='#{monitor[k]}' desired='#{mon_params[k]}'"
    do_update = true
    break
  end
  unless do_update
    logger.debug "\tmonitor is correct in DataDog."
    return monitor['id']
  end
  res = @dog.update_monitor(monitor['id'], mon_params['query'], mon_params)
  if res[0] == '200'
    logger.info "\tMonitor #{monitor['id']} updated successfully"
    return monitor['id']
  else
    logger.error "\tError updating monitor #{monitor['id']}: #{res}"
  end
end

#upsert_screenboard(dash_name, widgets) ⇒ `Object`

Create or update a screenboard in DataDog with the given name and data/params. For further information, see: docs.datadoghq.com/api/screenboards/ and docs.datadoghq.com/api/?lang=ruby#screenboards

Parameters:

dash_name (String) —

Account-unique dashboard name
widgets (Array) —

Array of Hash widget definitions to pass to the DataDog API. For further information, see: docs.datadoghq.com/api/screenboards/

Raises:

(DogApiException) —

if the Datadog API returns an error

# File 'lib/dogtrainer/api.rb', line 628

def upsert_screenboard(dash_name, widgets)
  logger.info "Upserting screenboard: #{dash_name}"
  desc = "created by DogTrainer RubyGem via #{@repo_path}"
  dash = get_existing_screenboard_by_name(dash_name)
  if dash.nil?
    d = @dog.create_screenboard(board_title: dash_name,
                                description: desc,
                                widgets: widgets)
    check_dog_result(d)
    logger.info "Created screenboard #{d[1]['id']}"
    return
  end
  logger.debug "\tfound existing screenboard id=#{dash['id']}"
  needs_update = false
  if dash['description'] != desc
    logger.debug "\tneeds update of description"
    needs_update = true
  end
  if dash['board_title'] != dash_name
    logger.debug "\tneeds update of title"
    needs_update = true
  end
  if dash['widgets'] != widgets
    logger.debug "\tneeds update of widgets"
    needs_update = true
  end

  if needs_update
    logger.info "\tUpdating screenboard #{dash['id']}"
    d = @dog.update_screenboard(dash['id'], board_title: dash_name,
                                            description: desc,
                                            widgets: widgets)
    check_dog_result(d)
    logger.info "\tScreenboard updated."
  else
    logger.info "\tScreenboard is up-to-date"
  end
end

#upsert_timeboard(dash_name, graphs) ⇒ `Object`

Create or update a timeboard in DataDog with the given name and data/params. For further information, see: docs.datadoghq.com/api/#timeboards

Parameters:

dash_name (String) —

Account-unique dashboard name
graphs (Array) —

Array of graphdefs to add to dashboard

Raises:

(DogApiException) —

if the Datadog API returns an error

# File 'lib/dogtrainer/api.rb', line 581

def upsert_timeboard(dash_name, graphs)
  logger.info "Upserting timeboard: #{dash_name}"
  desc = "created by DogTrainer RubyGem via #{@repo_path}"
  dash = get_existing_timeboard_by_name(dash_name)
  if dash.nil?
    d = @dog.create_dashboard(dash_name, desc, graphs)
    check_dog_result(d)
    logger.info "Created timeboard #{d[1]['dash']['id']}"
    return
  end
  logger.debug "\tfound existing timeboard id=#{dash['dash']['id']}"
  needs_update = false
  if dash['dash']['description'] != desc
    logger.debug "\tneeds update of description"
    needs_update = true
  end
  if dash['dash']['title'] != dash_name
    logger.debug "\tneeds update of title"
    needs_update = true
  end
  if dash['dash']['graphs'] != graphs
    logger.debug "\tneeds update of graphs"
    needs_update = true
  end

  if needs_update
    logger.info "\tUpdating timeboard #{dash['dash']['id']}"
    d = @dog.update_dashboard(
      dash['dash']['id'], dash_name, desc, graphs
    )
    check_dog_result(d)
    logger.info "\tTimeboard updated."
  else
    logger.info "\tTimeboard is up-to-date"
  end
end

Class: DogTrainer::API

Overview

Instance Method Summary collapse

Methods included from Logging

Constructor Details

#initialize(api_key, app_key, notify_to, repo_path = nil) ⇒ API

Instance Method Details

#check_dog_result(r, accepted_codes = ['200']) ⇒ Object

#create_monitor(_mon_name, mon_params) ⇒ Object

#generate_messages(metric_desc, comparison, mon_type) ⇒ Object

#get_existing_monitor_by_name(mon_name) ⇒ Object

#get_existing_screenboard_by_name(dash_name) ⇒ Object

#get_existing_timeboard_by_name(dash_name) ⇒ Object

#get_git_url_for_directory(dir_path) ⇒ Object

#get_monitors ⇒ Object

#get_repo_path ⇒ Object

#graphdef(title, queries, markers = {}) ⇒ Object

#mute_monitor_by_id(mon_id, options = { end_timestamp: nil }) ⇒ Object

#mute_monitor_by_name(mon_name, options = { end_timestamp: nil }) ⇒ Object

#mute_monitors_by_regex(mon_name_regex, options = { end_timestamp: nil }) ⇒ Object

#params_for_monitor(name, message, query, threshold, options = { escalation_message: nil, alert_no_data: true, mon_type: 'metric alert', renotify_interval: 60, no_data_timeframe: 20, evaluation_delay: nil }) ⇒ Object

#unmute_monitor_by_id(mon_id) ⇒ Object

#unmute_monitor_by_name(mon_name) ⇒ Object

#unmute_monitors_by_regex(mon_name_regex) ⇒ Object

#upsert_monitor(mon_name, query, threshold, comparator, options = { alert_no_data: true, mon_type: 'metric alert', renotify_interval: 60, no_data_timeframe: 20, evaluation_delay: nil, message: nil }) ⇒ Object

#upsert_screenboard(dash_name, widgets) ⇒ Object

#upsert_timeboard(dash_name, graphs) ⇒ Object

#initialize(api_key, app_key, notify_to, repo_path = nil) ⇒ `API`

#check_dog_result(r, accepted_codes = ['200']) ⇒ `Object`

#create_monitor(_mon_name, mon_params) ⇒ `Object`

#generate_messages(metric_desc, comparison, mon_type) ⇒ `Object`

#get_existing_monitor_by_name(mon_name) ⇒ `Object`

#get_existing_screenboard_by_name(dash_name) ⇒ `Object`

#get_existing_timeboard_by_name(dash_name) ⇒ `Object`

#get_git_url_for_directory(dir_path) ⇒ `Object`

#get_monitors ⇒ `Object`

#get_repo_path ⇒ `Object`

#graphdef(title, queries, markers = {}) ⇒ `Object`

#mute_monitor_by_id(mon_id, options = { end_timestamp: nil }) ⇒ `Object`

#mute_monitor_by_name(mon_name, options = { end_timestamp: nil }) ⇒ `Object`

#mute_monitors_by_regex(mon_name_regex, options = { end_timestamp: nil }) ⇒ `Object`

#params_for_monitor(name, message, query, threshold, options = { escalation_message: nil, alert_no_data: true, mon_type: 'metric alert', renotify_interval: 60, no_data_timeframe: 20, evaluation_delay: nil }) ⇒ `Object`

#unmute_monitor_by_id(mon_id) ⇒ `Object`

#unmute_monitor_by_name(mon_name) ⇒ `Object`

#unmute_monitors_by_regex(mon_name_regex) ⇒ `Object`

#upsert_monitor(mon_name, query, threshold, comparator, options = { alert_no_data: true, mon_type: 'metric alert', renotify_interval: 60, no_data_timeframe: 20, evaluation_delay: nil, message: nil }) ⇒ `Object`

#upsert_screenboard(dash_name, widgets) ⇒ `Object`

#upsert_timeboard(dash_name, graphs) ⇒ `Object`