Method: Wgit::Crawler.supported_file_extensions
- Defined in:
- lib/wgit/crawler.rb
.supported_file_extensions ⇒ Object (readonly)
The URL file extensions (from <a> hrefs) which will be crawled by
#crawl_site. The idea is to omit anything that isn't HTML and therefore
doesn't keep the crawl of the site going. All URL's without a file
extension will be crawled, because they're assumed to be HTML.
The #crawl method will crawl anything since it's given the URL(s).
You can add your own site's URL file extension e.g.
Wgit::Crawler.supported_file_extensions << 'html5' etc.
32 33 34 |
# File 'lib/wgit/crawler.rb', line 32 def supported_file_extensions @supported_file_extensions end |