| Sign In to gain access to subscriptions and/or personal tools. |
A study of the effect of term proximity on query expansionDepartment of Management Sciences, University of Waterloo, Waterloo, Canada, ovechtom{at}uwaterloo.ca
Toshiba of Canada Limited, Markham, Canada Query expansion terms are often used to enhance original query formulations in document retrieval. Such terms are usually selected from the entire documents or from windows or passages surrounding query term occurrences. Arguably, the semantic relatedness between terms weakens with the increase in the distance separating them. In this paper we report a study that was conducted to systematically evaluate different distance functions for selecting query expansion terms. We propose a distance factor that can be effectively combined with the statistical term association measure of mutual information for selecting query expansion terms. Evaluation of the TREC collection shows that distance-weighted mutual information is more effective than mutual information alone in selecting terms for query expansion.
Key Words: information retrieval query expansion term proximity word collocation mutual information
This version was published on August
1, 2006 Journal of Information Science, Vol. 32, No. 4,
324-333 (2006) |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||