Traditional optimizers fail to pick good execution plans, when faced with increasingly complex queries and large data sets. This failure is even more acute in the context of XQuery, due to the structured nature of the XML language. To overcome the vulnerabilities of traditional optimizers, we have previously proposed ROX, a Run-time Optimizer for XQueries, which interleaves optimization and execution of full tables. ROX has proved to be robust, even in the presence of strong correlations, but it has one limitation: it uses full materialization of intermediate results making it unsuitable for pipelined systems. Therefore, this paper proposes ROX-sampled, a variant of ROX, which executes small data samples, thus generating smaller intermediates. We conduct extensive experiments which proved that ROX-sampled is comparable to ROX in performance, and that it is still robust against correlations. The main benefit of ROX-sampled is that it allows the large number of pipelined databases to import the ROX idea into their optimization paradigm.

M. Jeusfeld c/o Redaktion Sun SITE, Informatik V, RWTH Aachen
Alberto Mendelzon International Workshop on Foundations of Data Management
Database Architectures

Abdel Kader, R., van Keulen, M., Boncz, P., & Manegold, S. (2010). Run-time Optimization for Pipelined Systems. In Proceedings of Alberto Mendelzon International Workshop on Foundations of Data Management 2010 (4). M. Jeusfeld c/o Redaktion Sun SITE, Informatik V, RWTH Aachen.