poster

Evaluating non-deterministic retrieval systems

Authors:
Gaya K. Jayasinghe

RMIT University, Melbourne, Australia

RMIT University, Melbourne, Australia
View Profile

,
William Webber

William Webber Consulting, Melbourne, Australia

William Webber Consulting, Melbourne, Australia
View Profile

,
Mark Sanderson

RMIT University, Melbourne, Australia

RMIT University, Melbourne, Australia
View Profile

,
Lasitha S. Dharmasena

Deakin University, Burwood, Australia

Deakin University, Burwood, Australia
View Profile

,
J. Shane Culpepper

RMIT University, Melbourne, Australia

RMIT University, Melbourne, Australia
View Profile

SIGIR '14: Proceedings of the 37th international ACM SIGIR conference on Research & development in information retrievalJuly 2014Pages 911–914https://doi.org/10.1145/2600428.2609472

Published:03 July 2014Publication History

SIGIR '14: Proceedings of the 37th international ACM SIGIR conference on Research & development in information retrieval

Pages 911–914

ABSTRACT

The use of sampling, randomized algorithms, or training based on the unpredictable inputs of users in Information Retrieval often leads to non-deterministic outputs. Evaluating the effectiveness of systems incorporating these methods can be challenging since each run may produce different effectiveness scores. Current IR evaluation techniques do not address this problem. Using the context of distributed information retrieval as a case study for our investigation, we propose a solution based on multivariate linear modeling. We show that the approach provides a consistent and reliable method to compare the effectiveness of non-deterministic IR algorithms, and explain how statistics can safely be used to show that two IR algorithms have equivalent effectiveness.

References

R. H. Baayen, D. J. Davidson, and D. M. Bates. Mixed-effects modeling with crossed random effects for subjects and items. Journal of memory and language, 59(4):390--412, 2008.Google ScholarCross Ref
B. Carterette, E. Kanoulas, and E. Yilmaz. Simulating simple user for system effectiveness evaluation. In CIKM, pages 611--620, 2011. Google ScholarDigital Library
A. Kulkarni and J. Callan. Document allocation policies for selective searching of distributed indexes. In CIKM, pages 449--458, 2010. Google ScholarDigital Library
D. Metzler and W. B. Croft. A markov random field model for term dependencies. In SIGIR, pages 472--479, 2005. Google ScholarDigital Library
S. E. Robertson and E. Kanoulas. On per-topic variance in IR evaluation. In SIGIR, pages 891--900, 2012. Google ScholarDigital Library
L. Si and J. Callan. Relevant document distribution estimation method for resource selection. In SIGIR, pages 298--305, 2003. Google ScholarDigital Library

Index Terms

Evaluating non-deterministic retrieval systems
1. Information systems
  1. Information retrieval
    1. Evaluation of retrieval results

Recommendations

The effectiveness of query-specific hierarchic clustering in information retrieval

Hierarchic document clustering has been widely applied to information retrieval (IR) on the grounds of its potential improved effectiveness over inverted file search (IFS). However, previous research has been inconclusive as to whether clustering does ...
Read More
Multiple testing in statistical analysis of systems-based information retrieval experiments

High-quality reusable test collections and formal statistical hypothesis testing together support a rigorous experimental environment for information retrieval research. But as Armstrong et al. [2009b] recently argued, global analysis of experiments ...
Read More
University of Alicante at WiQA 2006
Evaluation of Multilingual and Multi-modal Information Retrieval

This paper presents the participation of University of Alicante at the WiQA pilot task organized as part of the CLEF 2006 campaign. For a given set of topics, this task presupposes the discovery of important novel information distributed across ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
SIGIR '14: Proceedings of the 37th international ACM SIGIR conference on Research & development in information retrieval
July 2014
1330 pages
ISBN:9781450322577
DOI:10.1145/2600428
General Chairs:
Shlomo Geva
Queensland University of Technology
,
Andrew Trotman
University of Dunedin
,
Program Chairs:
Peter Bruza
Queensland University of Technology
,
Charles L.A. Clarke
University of Waterloo
,
Kal Järvelin
University of Tampere
Copyright © 2014 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 3 July 2014
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
effectiveness evaluation
experimental design
experimentation
information retrieval
measurement
statistical analysis
Qualifiers
- poster
Conference

Acceptance Rates
SIGIR '14 Paper Acceptance Rate82of387submissions,21%Overall Acceptance Rate792of3,983submissions,20%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 3
  Total Citations
  View Citations
- 201
  Total Downloads
- Downloads (Last 12 months)3
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Evaluating non-deterministic retrieval systems

SIGIR '14: Proceedings of the 37th international ACM SIGIR conference on Research & development in information retrieval

ABSTRACT

References

Cited By

Index Terms

Recommendations

The effectiveness of query-specific hierarchic clustering in information retrieval

Multiple testing in statistical analysis of systems-based information retrieval experiments

University of Alicante at WiQA 2006