Bootstrap confidence intervals for mean average precision

Research output: Chapter in Book / Conference PaperConference Paperpeer-review

Abstract

Due to the unconstrained nature of language, search engines (such as the Google search engine) are developed and compared by obtaining a document set, a sample set of queries and the associated relevance judgments for the queries on the document set. The de facto standard function used to measure the accuracy of each search engine on the test data is called mean Average Precision (AP). It is common practice to report mean AP scores and the results of paired significance tests against baseline search engines, but the confidence in the mean AP score is never reported. In this article, we investigate the utility of bootstrap confidence intervals for mean AP. We find that our Standardised logit bootstrap confidence intervals are very accurate for all levels of confidence examined and sample sizes.
Original languageEnglish
Title of host publicationProceedings of the 4th Applied Statistics Education and Research Collaboration (ASEARC) Conference, Paramatta, February 17-18, 2011
PublisherUniversity of Wollongong
Pages51-54
Number of pages4
ISBN (Print)9781741281958
Publication statusPublished - 2011
EventASEARC Research Conference -
Duration: 17 Feb 2011 → …

Conference

ConferenceASEARC Research Conference
Period17/02/11 → …

Fingerprint

Dive into the research topics of 'Bootstrap confidence intervals for mean average precision'. Together they form a unique fingerprint.

Cite this