File:From Zero to Hero - Anticipating Zero Results From Query Features, Ignoring Content.pdf
English: The Discovery Department uses the zero results rate -- the proportion of searches that yield zero results -- to measure the performance of our search system. However, little is known about possible patterns that affect the quantity (and quality) of results our users see. In this report, we use random forest and logistic regression models to shed light on the types of queries that tend to yield zero results.
Namely, we found that whether the query has an even number of double quotes is one of the most important indicators of whether it will yield zero results. Other notable features that impact the quantity of results include: whether the query is only punctuation and spaces, whether it ends with ?, and whether it has logical operators. For a full list of features and their importance and impact, please see Figures 4 and 5.Going forward, we may want to rewrite queries to not have quotes when the original query (with double quotation marks) yielded 0 results. We may also want to address question queries (that end with ?).
||Copyright in this work is either owned/co-owned by the Wikimedia Foundation or the content has been licensed to the Wikimedia Foundation. The uploader asserts that he or she is acting as an agent for the Wikimedia Foundation in uploading this content. In reusing this media under the specified license, please attribute the creator.
|This file is licensed under the Creative Commons Attribution-Share Alike 4.0 International license.|
|™ Wikimedia Foundation, Inc.
This file is (or includes) one of the official logos or designs used by the Wikimedia Foundation or by one of its projects. Use of the Wikimedia logos and trademarks is subject to the Wikimedia trademark policy and visual identity guidelines, and may require permission.
Click on a date/time to view the file as it appeared at that time.
|current||23:17, 24 May 2016||1,275 × 1,650, 8 pages (1.81 MB)||MPopov (WMF)||User created page with UploadWizard|
- You cannot overwrite this file.