Method: Wgit::Crawler.supported_file_extensions

Defined in:
lib/wgit/crawler.rb

.supported_file_extensionsObject (readonly)

The URL file extensions (from <a> hrefs) which will be crawled by #crawl_site. The idea is to omit anything that isn't HTML and therefore doesn't keep the crawl of the site going. All URL's without a file extension will be crawled, because they're assumed to be HTML. The #crawl method will crawl anything since it's given the URL(s). You can add your own site's URL file extension e.g. Wgit::Crawler.supported_file_extensions << 'html5' etc.



32
33
34
# File 'lib/wgit/crawler.rb', line 32

def supported_file_extensions
  @supported_file_extensions
end