CWI and University of Twente used PF/Tijah, a flexible XML retrieval system, to evaluate structured document retrieval, multi-media retrieval, and entity ranking tasks in the context of INEX 2007. For the retrieval of textual and multimedia elements in the Wikipedia data, we investigated various length priors and found that biasing towards longer elements than the ones retrieved by our language modelling approach can be useful. For retrieving images in isolation, we found that their associated text is a very good source of evidence in the Wikipedia collection. For the entity ranking task, we used random walks to model multi-step relevance propagation from the articles describing entities to all related entities and further, and obtained promising results.

,
Springer
N. Fuhr (Norbert) , M. Lalmas , A. Trotman , J. Kamps
Lecture Notes in Computer Science
Image Indexing and reTrievAL in the Large Scale
Initiative for the Evaluation of XML Retrieval
Human-Centered Data Analytics

Tsikrika, T., Serdyukov, P., Rode, H., Westerveld, T., Aly, R., Hiemstra, D., & de Vries, A. (2008). Structured Document Retrieval, Multimedia Retrieval, and Entity Ranking Using PF/Tijah. In N. Fuhr, M. Lalmas, A. Trotman, & J. Kamps (Eds.), Focused Access to XML Documents (pp. 306–320). Springer.