You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Fields hl_title and hl_subtitle are defined as stringTokens which is a defined type based on solr.TextField.
The definition contains a number of query analyzers:
The content_citation field is defined as a text type, and is defined similarly to stringTokens but has additional index and query analyzers.
The query ones are:
most notably the <filter class="solr.StopFilterFactory" words="stopwords.txt" ignoreCase="true"/> filter seems to cause some performance issues, doing the same query against the two fields has on the order of 5 times performance difference: /select?q=(content_citation:(Technology development for identification of citrus Citrus spp rootstocks based on Sequence Tagged Microsatellite marker)) results in :
whereas the hl_title query: /select?q=(hl_title:(Technology development for identification of citrus Citrus spp rootstocks based on Sequence Tagged Microsatellite marker))
removing the stopwords manually: select?q=(hl_title:(Technology development identification citrus Citrus spp rootstocks based Sequence Tagged Microsatellite marker))
results in :
Adding the stopwords query filter to the stringTokens definition might be worthwhile. note
Adding the filter does not remove the stopwords from the field or from quoted queries.
Modifying the field definition will affect other fields of that type, may need to assess impact.
Changing the field definition to type text would likely require a reindex.
The text was updated successfully, but these errors were encountered:
Fields hl_title and hl_subtitle are defined as
stringTokens
which is a defined type based onsolr.TextField
.The definition contains a number of query analyzers:
The content_citation field is defined as a
text
type, and is defined similarly tostringTokens
but has additional index and query analyzers.The query ones are:
most notably the
<filter class="solr.StopFilterFactory" words="stopwords.txt" ignoreCase="true"/>
filter seems to cause some performance issues, doing the same query against the two fields has on the order of 5 times performance difference:/select?q=(content_citation:(Technology development for identification of citrus Citrus spp rootstocks based on Sequence Tagged Microsatellite marker))
results in :whereas the
hl_title
query:/select?q=(hl_title:(Technology development for identification of citrus Citrus spp rootstocks based on Sequence Tagged Microsatellite marker))
removing the stopwords manually:
select?q=(hl_title:(Technology development identification citrus Citrus spp rootstocks based Sequence Tagged Microsatellite marker))
results in :
Adding the stopwords query filter to the
stringTokens
definition might be worthwhile.note
Adding the filter does not remove the stopwords from the field or from quoted queries.
Modifying the field definition will affect other fields of that type, may need to assess impact.
Changing the field definition to type
text
would likely require a reindex.The text was updated successfully, but these errors were encountered: