CWI and University of Twente used PF/Tijah, a flexible XML retrieval system, to evaluate structured document retrieval, multi-media retrieval, and entity ranking tasks in the context of INEX 2007. For the retrieval of textual and multimedia elements in the Wikipedia data, we investigated various length priors and found that biasing towards longer elements than the ones retrieved by our language modelling approach can be useful. For retrieving images in isolation, we found that their associated text is a very good source of evidence in the Wikipedia collection. For the entity ranking task, we used random walks to model multi-step relevance propagation from the articles describing entities to all related entities and further, and obtained promising results.

Additional Metadata
THEME Information (theme 2), Information (theme 2)
Publisher Springer
Editor N. Fuhr , M. Lalmas , A. Trotman , J. Kamps
Series Lecture Notes in Computer Science
Project Image Indexing and reTrievAL in the Large Scale
Conference Initiative for the Evaluation of XML Retrieval
Citation
Tsikrika, T, Serdyukov, P, Rode, H, Westerveld, T.H.W, Aly, R, Hiemstra, D, & de Vries, A.P. (2008). Structured Document Retrieval, Multimedia Retrieval, and Entity Ranking Using PF/Tijah. In N Fuhr, M Lalmas, A Trotman, & J Kamps (Eds.), Focused Access to XML Documents (pp. 306–320). Springer.