Data fusion in information retrieval has been investigated by many researchers and quite a few data fusion methods have been proposed. However, their impact on effectiveness has not been well understood. In this paper, we apply statistical principles to data fusion and present a statistical data fusion model, which specifies the algorithm for fusion and conditions to be satisfied. The statistical model can be used as a guideline for data fusion methods. Based on this analysis, we compare CombSum and CombMNZ, which are the two best-known data fusion methods. We explain why sometimes CombMNZ does outperform Comb- Sum and what can be done to make CombSum more effective. Experimental results with TREC data are reported to support the conclusion that our enhancements to the algorithm improve effectiveness.
|Title of host publication||Unknown Host Publication|
|Number of pages||7|
|Publication status||Published - 2007|
|Event||IEEE International Conference on Systems, Man and Cybernetics, 2007 - Montreal, Canada|
Duration: 1 Jan 2007 → …
|Conference||IEEE International Conference on Systems, Man and Cybernetics, 2007|
|Period||1/01/07 → …|