LDBC Graphalytics: A benchmark for large-scale graph analysis on parallel and distributed platforms
Presented at the International Conference on Very Large Data Bases (September 2016), New Delhi
In this paper we introduce LDBC Graphalytics, a new industrial-grade benchmark for graph analysis platforms. It consists of six deterministic algorithms, standard datasets, synthetic dataset generators, and reference output, that enable the objective comparison of graph analysis platforms. Its test harness produces deep metrics that quantify multiple kinds of system scalability, such as horizontal/vertical and weak/strong, and of robustness, such as failures and performance variability. The benchmark comes with open-source software for generating data and monitoring performance. We describe and analyze six implementations of the benchmark (three from the community, three from the industry), providing insights into the strengths and weaknesses of the platforms. Key to our contribution, vendors perform the tuning and benchmarking of their platforms.
|Information (theme 2), Energy (theme 4)|
|Oracle Labs, Intel labs, IBM Research, Huawei Research America|
|International Conference on Very Large Data Bases|
Iosup, A, Hegeman, T, Ngai, W.L, Heldens, S, Prat, A, Manhardt, T, … Boncz, P.A. (2016). LDBC Graphalytics: A benchmark for large-scale graph analysis on parallel and distributed platforms. In Proceedings of the VLDB Endowment (pp. 1317–1328). doi:10.14778/3007263.3007270