In this paper we introduce LDBC Graphalytics, a new industrial-grade benchmark for graph analysis platforms. It consists of six deterministic algorithms, standard datasets, synthetic dataset generators, and reference output, that enable the objective comparison of graph analysis platforms. Its test harness produces deep metrics that quantify multiple kinds of system scalability, such as horizontal/vertical and weak/strong, and of robustness, such as failures and performance variability. The benchmark comes with open-source software for generating data and monitoring performance. We describe and analyze six implementations of the benchmark (three from the community, three from the industry), providing insights into the strengths and weaknesses of the platforms. Key to our contribution, vendors perform the tuning and benchmarking of their platforms.
Additional Metadata
THEME Information (theme 2), Energy (theme 4)
Stakeholder Oracle Labs, Intel labs, IBM Research, Huawei Research America
Persistent URL dx.doi.org/10.14778/3007263.3007270
Project Graphalyzing4Security
Conference International Conference on Very Large Data Bases
Citation
Iosup, A, Hegeman, T, Ngai, W.L, Heldens, S, Prat, A, Manhardt, T, … Boncz, P.A. (2016). LDBC Graphalytics: A benchmark for large-scale graph analysis on parallel and distributed platforms. In Proceedings of the VLDB Endowment (pp. 1317–1328). doi:10.14778/3007263.3007270