skip to main content
10.1145/2600428.2609472acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
poster

Evaluating non-deterministic retrieval systems

Published:03 July 2014Publication History

ABSTRACT

The use of sampling, randomized algorithms, or training based on the unpredictable inputs of users in Information Retrieval often leads to non-deterministic outputs. Evaluating the effectiveness of systems incorporating these methods can be challenging since each run may produce different effectiveness scores. Current IR evaluation techniques do not address this problem. Using the context of distributed information retrieval as a case study for our investigation, we propose a solution based on multivariate linear modeling. We show that the approach provides a consistent and reliable method to compare the effectiveness of non-deterministic IR algorithms, and explain how statistics can safely be used to show that two IR algorithms have equivalent effectiveness.

References

  1. R. H. Baayen, D. J. Davidson, and D. M. Bates. Mixed-effects modeling with crossed random effects for subjects and items. Journal of memory and language, 59(4):390--412, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  2. B. Carterette, E. Kanoulas, and E. Yilmaz. Simulating simple user for system effectiveness evaluation. In CIKM, pages 611--620, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. A. Kulkarni and J. Callan. Document allocation policies for selective searching of distributed indexes. In CIKM, pages 449--458, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. D. Metzler and W. B. Croft. A markov random field model for term dependencies. In SIGIR, pages 472--479, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. S. E. Robertson and E. Kanoulas. On per-topic variance in IR evaluation. In SIGIR, pages 891--900, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. L. Si and J. Callan. Relevant document distribution estimation method for resource selection. In SIGIR, pages 298--305, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Evaluating non-deterministic retrieval systems

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      SIGIR '14: Proceedings of the 37th international ACM SIGIR conference on Research & development in information retrieval
      July 2014
      1330 pages
      ISBN:9781450322577
      DOI:10.1145/2600428

      Copyright © 2014 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 3 July 2014

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • poster

      Acceptance Rates

      SIGIR '14 Paper Acceptance Rate82of387submissions,21%Overall Acceptance Rate792of3,983submissions,20%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader