Class: WWW::Mechanize::Page::Meta

Inherits:
Link
  • Object
show all
Defined in:
lib/www/mechanize/page/meta.rb

Overview

This class encapsulates a Meta tag. Mechanize treats meta tags just like ‘a’ tags. Meta objects will contain links, but most likely will have no text.

Constant Summary collapse

CONTENT_REGEXP =

Matches the content attribute of a meta tag. After the match:

$1:: delay
$3:: url
/^\s*(\d+\.?\d*)(;|;\s*url=\s*['"]?(\S*?)['"]?)?\s*$/i

Instance Attribute Summary

Attributes inherited from Link

#attributes, #href, #node, #page, #text

Class Method Summary collapse

Methods inherited from Link

#click, #initialize, #pretty_print, #uri

Constructor Details

This class inherits a constructor from WWW::Mechanize::Page::Link

Class Method Details

.parse(content, uri) ⇒ Object

Parses the delay and url from the content attribute of a meta tag. Parse requires the uri of the current page to infer a url when no url is specified. If a block is given, the parsed delay and url will be passed to it for further processing.

Returns nil if the delay and url cannot be parsed.

# <meta http-equiv="refresh" content="5;url=http://example.com/" />
uri = URI.parse('http://current.com/')

Meta.parse("5;url=http://example.com/", uri)  # => ['5', 'http://example.com/']
Meta.parse("5;url=", uri)                     # => ['5', 'http://current.com/']
Meta.parse("5", uri)                          # => ['5', 'http://current.com/']
Meta.parse("invalid content", uri)            # => nil


32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
# File 'lib/www/mechanize/page/meta.rb', line 32

def parse(content, uri)
  if content && content =~ CONTENT_REGEXP
    delay, url = $1, $3

    url = case url
    when nil, "" then uri.to_s
    when /^http/i then url
    else "http://#{uri.host}#{url}"
    end

    block_given? ? yield(delay, url) : [delay, url]
  else
    nil
  end
end