Module: NewsScraper::Trainer
- Extended by:
- Trainer
- Included in:
- Trainer
- Defined in:
- lib/news_scraper/trainer.rb,
lib/news_scraper/trainer/url_trainer.rb,
lib/news_scraper/trainer/preset_selector.rb
Defined Under Namespace
Classes: PresetSelector, UrlTrainer
Instance Method Summary collapse
-
#train(query: '') ⇒ Object
Fetches articles from Extraction sources and trains on the results.
Instance Method Details
#train(query: '') ⇒ Object
Fetches articles from Extraction sources and trains on the results
Training is a process where we take an untrained url (root domain is not in article_scrape_patterns.yml
) and determine patterns and methods to match the data_types listed in article_scrape_patterns.yml
, then record them to the article_scrape_patterns.yml
file
Params
-
query
: a keyword arugment specifying the query to train on
18 19 20 21 22 23 |
# File 'lib/news_scraper/trainer.rb', line 18 def train(query: '') article_urls = Extractors::GoogleNewsRss.new(query: query).extract article_urls.each do |url| Trainer::UrlTrainer.new(url).train end end |