Based on the finalized design from T146663 and T139310, this A/B/C test will have two test groups that will be shown a new right sidebar that will contain relevant search results that were found from cross-wiki searching. A control group will see the currently existing search results page. The two test groups will be shown results using two different randomizations.
This test is expected to last at least a week and will be run on Persian, Italian, Catalan and Polish Wikipedias (their selection was based on community input).
Test group users will see:
- additional search results from sister wikis in a right sidebar
- each result for the sister wiki(s) will display:
- the top ranked result from any wiki that contain relative search results
- an icon that denotes which wiki the result is from
- article name of the search result
- description of the search result
- typical bolding of the search result term(s)
- link below the search result that is labeled 'more results'
- this link will open a new browser tab and display a search results page for the original search term on that sister wiki
- separate section for multimedia results above the other sister wiki results
- up to 3 images will be displayed that are relevant to the original search term
- display a link that will open a new browser tab and display a search results page of multimedia for the original search term from the native wikipedia that the user is on
- for example, if a user searched for 'gutenberg' on English Wikipedia, and clicked on the more multimedia link, the user will be displayed search results for multimedia for 'gutenberg' on English Wikipedia in a new tab.
Order of projects will be randomized:
- one group of users will see results based on recall - most to least number of articles returned from each project
- one group of users will see results based on a random order
- results from Commons will always be displayed first
- Wikispecies will most likely not be included in this test cycle
Bucket testing logic generally is as follows:
- 1 in 200 users are included in EventLogging
- Of those 1 in 200 users, 1 in 10 are included in the test
- Of those 1 in 10 users
- 1/3 will go in a test group, labeled "recall_sidebar_results"
- 1/3 will go in a test group, labeled "random_sidebar_results"
- the remaining 1/3 of users will go in a control group, labeled "no_sidebar"
- The remaining chunk of the original bucketed 200 users will get a NULL (the string null, or the MySQL null, we can detect either).
Eventlogging needs to capture:
- if the user clicked on an individual result and what wiki project that result came from
- what position in the list was the selected result
- if the user clicked on the 'more from' on any wiki project result that was displayed
- important to compare control group that has sister wiki results vs test group that also has sister wiki results
Eventlogging data will be joined against CirrusSearchRequestSet logging to capture:
- if results were shown and from which wiki projects
Notes to take into account:
- for Italian wiki - note that for those users that aren't selected in the bucketing, we'll need to show the existing sister wiki search results as currently existing.
- a few days after the test starts, we'll need to take a look at the initial results:
- do we need to increase the sampling rate?
- do we need to increase the amount of wikipedias the test is being run on?
- list of languages that have all or nearly all projects that we might want to also test on (in no particular order):
- Arabic
- Czech
- English
- Finnish
- French
- German
- Greek
- Hebrew
- Russian
- Portuguese
- Swedish
- Chinese
- Ukrainian
- list of languages that have all or nearly all projects that we might want to also test on (in no particular order):
Draft sample image of what this test could look like: