David Lillis: Probabilistic Data Fusion on a Large Document Collection

Probabilistic Data Fusion on a Large Document Collection

David Lillis, Fergus Toolan, Rem Collier and John Dunnion

In Proceedings of the 17th Irish Conference on Artificial Intelligence and Cognitive Science (AICS 2006), Belfast, Northern Ireland, 2006.

Abstract

Data Fusion is the process of combining the output of a number of Information Retrieval algorithms into a single result set, to achieve greater retrieval performance. ProbFuse is a probabilistic data fusion algorithm that has been shown to outperform the CombMNZ algorithm in a number of previous experiments. This paper builds upon this previous work and applies probFuse to the much larger Web Track document collection from the 2004 Text REtreival Conference.