Module: ThinkingSphinx::SearchMethods::ClassMethods
- Included in:
- ThinkingSphinx
- Defined in:
- lib/thinking_sphinx/search_methods.rb
Instance Method Summary collapse
- #count(*args) ⇒ Object
-
#facets(*args) ⇒ Object
Model.facets *args ThinkingSphinx.facets *args ThinkingSphinx.facets *args, :all_facets => true ThinkingSphinx.facets *args, :class_facet => false.
-
#search(*args) ⇒ Object
Searches through the Sphinx indexes for relevant matches.
- #search_context ⇒ Object
- #search_count(*args) ⇒ Object
-
#search_for_id(id, index, options = {}) ⇒ Object
Checks if a document with the given id exists within a specific index.
-
#search_for_ids(*args) ⇒ Object
Searches for results that match the parameters provided.
Instance Method Details
#count(*args) ⇒ Object
404 405 406 |
# File 'lib/thinking_sphinx/search_methods.rb', line 404 def count(*args) search_context ? super : search_count(*args) end |
#facets(*args) ⇒ Object
Model.facets *args ThinkingSphinx.facets *args ThinkingSphinx.facets *args, :all_facets => true ThinkingSphinx.facets *args, :class_facet => false
421 422 423 |
# File 'lib/thinking_sphinx/search_methods.rb', line 421 def facets(*args) ThinkingSphinx::FacetSearch.new *(args) end |
#search(*args) ⇒ Object
Searches through the Sphinx indexes for relevant matches. There’s various ways to search, sort, group and filter - which are covered below.
Also, if you have WillPaginate installed, the search method can be used just like paginate. The same parameters - :page and :per_page - work as expected, and the returned result set can be used by the will_paginate helper.
Basic Searching
The simplest way of searching is straight text.
ThinkingSphinx.search "pat"
ThinkingSphinx.search "google"
User.search "pat", :page => (params[:page] || 1)
Article.search "relevant news issue of the day"
If you specify :include, like in an #find call, this will be respected when loading the relevant models from the search results.
User.search "pat", :include => :posts
Match Modes
Sphinx supports 5 different matching modes. By default Thinking Sphinx uses :all, which unsurprisingly requires all the supplied search terms to match a result.
Alternative modes include:
User.search "pat allan", :match_mode => :any
User.search "pat allan", :match_mode => :phrase
User.search "pat | allan", :match_mode => :boolean
User.search "@name pat | @username pat", :match_mode => :extended
Any will find results with any of the search terms. Phrase treats the search terms a single phrase instead of individual words. Boolean and extended allow for more complex query syntax, refer to the sphinx documentation for further details.
Weighting
Sphinx has support for weighting, where matches in one field can be considered more important than in another. Weights are integers, with 1 as the default. They can be set per-search like this:
User.search "pat allan", :field_weights => { :alias => 4, :aka => 2 }
If you’re searching multiple models, you can set per-index weights:
ThinkingSphinx.search "pat", :index_weights => { User => 10 }
See sphinxsearch.com/doc.html#weighting for further details.
Searching by Fields
If you want to step it up a level, you can limit your search terms to specific fields:
User.search :conditions => {:name => "pat"}
This uses Sphinx’s extended match mode, unless you specify a different match mode explicitly (but then this way of searching won’t work). Also note that you don’t need to put in a search string.
Searching by Attributes
Also known as filters, you can limit your searches to documents that have specific values for their attributes. There are three ways to do this. The first two techniques work in all scenarios - using the :with or :with_all options.
ThinkingSphinx.search :with => {:tag_ids => 10}
ThinkingSphinx.search :with => {:tag_ids => [10,12]}
ThinkingSphinx.search :with_all => {:tag_ids => [10,12]}
The first :with search will match records with a tag_id attribute of 10. The second :with will match records with a tag_id attribute of 10 OR 12. If you need to find records that are tagged with ids 10 AND 12, you will need to use the :with_all search parameter. This is particuarly useful in conjunction with Multi Value Attributes (MVAs).
The third filtering technique is only viable if you’re searching with a specific model (not multi-model searching). With a single model, Thinking Sphinx can figure out what attributes and fields are available, so you can put it all in the :conditions hash, and it will sort it out.
Node.search :conditions => {:parent_id => 10}
Filters can be single values, arrays of values, or ranges.
Article.search "East Timor", :conditions => {:rating => 3..5}
Excluding by Attributes
Sphinx also supports negative filtering - where the filters are of attribute values to exclude. This is done with the :without option:
User.search :without => {:role_id => 1}
Excluding by Primary Key
There is a shortcut to exclude records by their ActiveRecord primary key:
User.search :without_ids => 1
Pass an array or a single value.
The primary key must be an integer as a negative filter is used. Note that for multi-model search, an id may occur in more than one model.
Infix (Star) Searching
Enable infix searching by something like this in config/sphinx.yml:
development:
enable_star: 1
min_infix_len: 2
Note that this will make indexing take longer.
With those settings (and after reindexing), wildcard asterisks can be used in queries:
Location.search "*elbourn*"
To automatically add asterisks around every token (but not operators), pass the :star option:
Location.search "elbourn -ustrali", :star => true,
:match_mode => :boolean
This would become “elbourn -ustrali”. The :star option only adds the asterisks. You need to make the config/sphinx.yml changes yourself.
By default, the tokens are assumed to match the regular expression /w+/u+. If you’ve modified the charset_table, pass another regular expression, e.g.
User.search("[email protected]", :star => /[\w@.]+/u)
to search for “*[email protected]*” and not “oo@bar.c”.
Sorting
Sphinx can only sort by attributes, so generally you will need to avoid using field names in your :order option. However, if you’re searching on a single model, and have specified some fields as sortable, you can use those field names and Thinking Sphinx will interpret accordingly. Remember: this will only happen for single-model searches, and only through the :order option.
Location.search "Melbourne", :order => :state
User.search :conditions => {:role_id => 2}, :order => "name ASC"
Keep in mind that if you use a string, you must specify the direction (ASC or DESC) else Sphinx won’t return any results. If you use a symbol then Thinking Sphinx assumes ASC, but if you wish to state otherwise, use the :sort_mode option:
Location.search "Melbourne", :order => :state, :sort_mode => :desc
Of course, there are other sort modes - check out the Sphinx documentation for that level of detail though.
If desired, you can sort by a column in your model instead of a sphinx field or attribute. This sort only applies to the current page, so is most useful when performing a search with a single page of results.
User.search("pat", :sql_order => "name")
Grouping
For this you can use the group_by, group_clause and group_function options - which are all directly linked to Sphinx’s expectations. No magic from Thinking Sphinx. It can get a little tricky, so make sure you read all the relevant documentation first.
Grouping is done via three parameters within the options hash
-
:group_function
determines the way grouping is done -
:group_by
determines the field which is used for grouping -
:group_clause
determines the sorting order
As a convenience, you can also use
-
:group
which sets :group_by and defaults to :group_function of :attr
group_function
Valid values for :group_function are
-
:day
,:week
,:month
,:year
- Grouping is done by the respective timeframes. -
:attr
,:attrpair
- Grouping is done by the specified attributes(s)
group_by
This parameter denotes the field by which grouping is done. Note that the specified field must be a sphinx attribute or index.
group_clause
This determines the sorting order of the groups. In a grouping search, the matches within a group will sorted by the :sort_mode
and :order
parameters. The group matches themselves however, will be sorted by :group_clause
.
The syntax for this is the same as an order parameter in extended sort mode. Namely, you can specify an SQL-like sort expression with up to 5 attributes (including internal attributes), eg: “@relevance DESC, price ASC, @id DESC”
Grouping by timestamp
Timestamp grouping groups off items by the day, week, month or year of the attribute given. In order to do this you need to define a timestamp attribute, which pretty much looks like the standard defintion for any attribute.
define_index do
#
# All your other stuff
#
has :created_at
end
When you need to fire off your search, it’ll go something to the tune of
Fruit.search "apricot", :group_function => :day,
:group_by => 'created_at'
The @groupby
special attribute will contain the date for that group. Depending on the :group_function
parameter, the date format will be:
-
:day
- YYYYMMDD -
:week
- YYYYNNN (NNN is the first day of the week in question, counting from the start of the year ) -
:month
- YYYYMM -
:year
- YYYY
Grouping by attribute
The syntax is the same as grouping by timestamp, except for the fact that the :group_function
parameter is changed.
Fruit.search "apricot", :group_function => :attr, :group_by => 'size'
Geo/Location Searching
Sphinx - and therefore Thinking Sphinx - has the facility to search around a geographical point, using a given latitude and longitude. To take advantage of this, you will need to have both of those values in attributes. To search with that point, you can then use one of the following syntax examples:
Address.search "Melbourne", :geo => [1.4, -2.217],
:order => "@geodist asc"
Address.search "Australia", :geo => [-0.55, 3.108],
:order => "@geodist asc" :latitude_attr => "latit",
:longitude_attr => "longit"
The first example applies when your latitude and longitude attributes are named any of lat, latitude, lon, long or longitude. If that’s not the case, you will need to explicitly state them in your search, or you can do so in your model:
define_index do
has :latit # Float column, stored in radians
has :longit # Float column, stored in radians
set_property :latitude_attr => "latit"
set_property :longitude_attr => "longit"
end
Now, geo-location searching really only has an affect if you have a filter, sort or grouping clause related to it - otherwise it’s just a normal search, and _will not_ return a distance value otherwise. To make use of the positioning difference, use the special attribute “@geodist” in any of your filters or sorting or grouping clauses.
And don’t forget - both the latitude and longitude you use in your search, and the values in your indexes, need to be stored as a float in radians, not degrees. Keep in mind that if you do this conversion in SQL you will need to explicitly declare a column type of :float.
define_index do
has 'RADIANS(lat)', :as => :lat, :type => :float
# ...
end
Once you’ve got your results set, you can access the distances as follows:
@results.each_with_geodist do |result, distance|
# ...
end
The distance value is returned as a float, representing the distance in metres.
Filtering by custom attributes
Do note that this applies only to sphinx 0.9.9
Should you find yourself in desperate need of a filter that involves selecting either one of multiple conditions, one solution could be provided by the :sphinx_select option within the search. This handles which fields are selected by sphinx from its store.
The default value is “*”, and you can add custom fields using syntax similar to sql:
Flower.search "foo",
:sphinx_select => "*, petals < 1 or color = 2 as grass"
This will add the ‘grass’ attribute, which will now be usable in your filters.
Handling a Stale Index
Especially if you don’t use delta indexing, you risk having records in the Sphinx index that are no longer in the database. By default, those will simply come back as nils:
>> pat_user.delete
>> User.search("pat")
Sphinx Result: [1,2]
=> [nil, <#User id: 2>]
(If you search across multiple models, you’ll get ActiveRecord::RecordNotFound.)
You can simply Array#compact these results or handle the nils in some other way, but Sphinx will still report two results, and the missing records may upset your layout.
If you pass :retry_stale => true to a single-model search, missing records will cause Thinking Sphinx to retry the query but excluding those records. Since search is paginated, the new search could potentially include missing records as well, so by default Thinking Sphinx will retry three times. Pass :retry_stale => 5 to retry five times, and so on. If there are still missing ids on the last retry, they are shown as nils.
367 368 369 |
# File 'lib/thinking_sphinx/search_methods.rb', line 367 def search(*args) ThinkingSphinx::Search.new *(args) end |
#search_context ⇒ Object
10 11 12 13 14 15 16 17 18 |
# File 'lib/thinking_sphinx/search_methods.rb', line 10 def search_context # Comparing to name string to avoid class inheritance complications case self.class.name when 'Class' self else nil end end |
#search_count(*args) ⇒ Object
408 409 410 411 412 413 414 |
# File 'lib/thinking_sphinx/search_methods.rb', line 408 def search_count(*args) search = ThinkingSphinx::Search.new( *(args, :ids_only => true) ) search.first # forces the query search.total_entries end |
#search_for_id(id, index, options = {}) ⇒ Object
Checks if a document with the given id exists within a specific index. Expected parameters:
-
ID of the document
-
Index to check within
-
Options hash (defaults to {})
Example:
ThinkingSphinx.search_for_id(10, "user_core", :class => User)
394 395 396 397 398 399 400 401 402 |
# File 'lib/thinking_sphinx/search_methods.rb', line 394 def search_for_id(id, index, = {}) ThinkingSphinx::Search.new( *([], :ids_only => true, :index => index, :id_range => id..id ) ).any? end |
#search_for_ids(*args) ⇒ Object
Searches for results that match the parameters provided. Will only return the ids for the matching objects. See #search for syntax examples.
Note that this only searches the Sphinx index, with no ActiveRecord queries. Thus, if your index is not in sync with the database, this method may return ids that no longer exist there.
379 380 381 |
# File 'lib/thinking_sphinx/search_methods.rb', line 379 def search_for_ids(*args) ThinkingSphinx::Search.new *(args, :ids_only => true) end |