Experiences with IR Top N optimization in a main memory DBMS: applying `the database approach' in new domains

Blok, Henk Ernst; de Vries, Arjen; Blanken, H.M.; Apers, Peter

H.E. Blok (Henk Ernst), A.P. de Vries (Arjen), H.M. Blanken and P.M.G. Apers (Peter)

2001

Experiences with IR Top N optimization in a main memory DBMS: applying `the database approach' in new domains

Presented at the British National Conference on Databases

Data abstraction and query processing techniques are usually studied in the domain of administrative applications. We present a case-study in the non-standard domain of (multimedia) information retrieval, mainly intended as a feasibility study in favor of the `database approach' to data management. Top-N queries form a natural query class when dealing with content retrieval. In the IR field, a lot of research has been done on processing top-N queries efficiently. Unfortunately, these results cannot directly be ported to the database environment, because their tuple-oriented nature would seriously limit the freedom of the query optimizer to select appropriate query plans. By horizontally fragmenting our database containing document statistics, we are able to combine some of the best of the IR and database optimization principles, providing good retrieval quality as well as database `goodies' like flexibility, scalability, efficiency, and generality. Key issues we address in this paper concern the effects of our fragmentation approach on speed and quality of the answers, opportunities for scalability, supported by experimental results.

Additional Metadata
Keywords	Top-N, Indexing, Query optimization, Content based retrieval, Multimedia, Databases
THEME	Information (theme 2)
Publisher	Springer
Series	Lecture Notes in Computer Science
Conference	British National Conference on Databases
Organisation	Database Architectures
Citation APA APA Style APA-ALL Style AAA Style Cell Style Chicago Style Harvard Style IEEE Style MLA Style Nature Style Vancouver Style American-Institute-of-Physics Style Council-of-Science-Editors Style BibTex Format Endnote Format RIS Format CSL Format DOIs only Format	Blok, H. E., de Vries, A., Blanken, H. M.& Apers, P. (2001). Experiences with IR Top N optimization in a main memory DBMS: applying `the database approach' in new domains. Proceedings of British National Conference on Databases 2001 (18), 126–151.

Experiences with IR Top N optimization in a main memory DBMS: applying `the database approach' in new domains

Publication

Publication

Address

CWI researchers

Questions or comments?

Experiences with IR Top N optimization in a main memory DBMS: applying `the database approach' in new domains

Publication

Publication

Workflow

Workflow

Add Content