Conference paper
Machine translation and monolingual information retrieval
Martin Franz, J. Scott McCarley
SIGIR 1999
Search algorithms in most current text retrieval systems use index data structures extracted from the original text documents. In this paper we focus on reducing the size of the indices by reducing the amount of space dedicated to store term frequencies. In experiments using TREC Ad Hoc [2, 3] corpora and query sets, we show that it is possible to store the term frequency in only two bits without decreasing retrieval performance.
Martin Franz, J. Scott McCarley
SIGIR 1999
Martin Franz, Miroslav Novak
INTERSPEECH - Eurospeech 1999
Douglas W. Oard, Dagobert Soergel, et al.
SIGIR 2004
J. Scott McCarley
ACL 1999