Skip to content

Commit

Permalink
Add nightly task for disjunctions of many terms. (#305)
Browse files Browse the repository at this point in the history
These queries are interesting because they are less favorable to dynamic
pruning, and things like evaluating the maximum score per block or
re-partitioning can be expensive.
  • Loading branch information
jpountz authored Oct 15, 2024
1 parent cafa33d commit 94a2200
Show file tree
Hide file tree
Showing 2 changed files with 8 additions and 0 deletions.
1 change: 1 addition & 0 deletions src/python/nightlyBench.py
Original file line number Diff line number Diff line change
Expand Up @@ -1408,6 +1408,7 @@ def writeIndexHTML(searchChartData, days):
writeOneLine(w, done, 'Or3Terms', 'Disjunction of 3 terms')
writeOneLine(w, done, 'And3Terms', 'Conjunction of 3 terms')
writeOneLine(w, done, 'OrHighRare', 'Disjunction of a very frequent term and a very rare term')
writeOneLine(w, done, 'OrMany', 'Disjunction of many terms')

w('<br><br><b>CombinedFieldsQuery:</b>')
writeOneLine(w, done, 'CombinedTerm', 'Combined high-freq')
Expand Down
7 changes: 7 additions & 0 deletions tasks/wikinightly.tasks
Original file line number Diff line number Diff line change
Expand Up @@ -306,3 +306,10 @@ And3Terms: +law +school +rankings
OrHighRare: the incredibles
OrHighRare: a horsefly
OrHighRare: some groundnuts

# Disjunctions across many terms
OrMany: apache lucene is a free and open-source search engine software library originally written in java by doug cutting it is supported by the apache software foundation and is released under the apache software license lucene is widely used as a standard foundation for production search applications
OrMany: solr pronounced solar is an open-source enterprise-search platform written in java its major features include full-text search hit highlighting faceted search real-time indexing dynamic clustering database integration nosql features and rich document word pdf handling
OrMany: a database index is a data structure that improves the speed of data retrieval operations on a database table at the cost of additional writes and storage space to maintain the index data structure indexes are used to quickly locate data without having to search every row in a database table every time said table is accessed
OrMany: in computing a search engine is an information retrieval software system designed to help find information stored on one or more computer systems search engines discover crawl transform and store information for retrieval and presentation in response to user queries
OrMany: a java virtual machine jvm is a virtual machine that enables a computer to run java programs as well as programs written in other languages that are also compiled to java bytecode the jvm is detailed by a specification that formally describes what is required in a jvm implementation

0 comments on commit 94a2200

Please sign in to comment.