Abstract
Due to the unconstrained nature of language, search engines (such as the Google search engine) are developed and compared by obtaining a document set, a sample set of queries and the associated relevance judgments for the queries on the document set. The de facto standard function used to measure the accuracy of each search engine on the test data is called mean Average Precision (AP). It is common practice to report mean AP scores and the results of paired significance tests against baseline search engines, but the confidence in the mean AP score is never reported. In this article, we investigate the utility of bootstrap confidence intervals for mean AP. We find that our Standardised logit bootstrap confidence intervals are very accurate for all levels of confidence examined and sample sizes.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 4th Applied Statistics Education and Research Collaboration (ASEARC) Conference, Paramatta, February 17-18, 2011 |
| Publisher | University of Wollongong |
| Pages | 51-54 |
| Number of pages | 4 |
| ISBN (Print) | 9781741281958 |
| Publication status | Published - 2011 |
| Event | ASEARC Research Conference - Duration: 17 Feb 2011 → … |
Conference
| Conference | ASEARC Research Conference |
|---|---|
| Period | 17/02/11 → … |
Fingerprint
Dive into the research topics of 'Bootstrap confidence intervals for mean average precision'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver