LDBC Graphalytics graphs

Data sets for the LDBC Graphalytics benchmark. Stored in zstd-compressed CSV and Matrix Market files.

  • graphalytics-graph-data-sets: The Graphalytics graphs represented as vertex/edge files and the expected results of the benchmark's algorithms (BFS, CDLP, LCC, PR, SSSP, WCC).
  • graphalytics-sparse-matrices-matrix-market: The Graphalytics graphs represented as sparse matrices, encoded in the Matrix Market format. For weighted graphs, two variants are available: the original weighted graph (fp64) and an unweighted variant (bool).

Website: https://ldbcouncil.org/benchmarks/graphalytics/

Related publication: Alexandru Iosup, Ahmed Musaafir, Alexandru Uta, Arnau Prat-Pérez, Gábor Szárnyas, Hassan Chafi, Ilie Gabriel Tanase, Lifeng Nai, Michael J. Anderson, Mihai Capota, Narayanan Sundaram, Peter A. Boncz, Siegfried Depner, Stijn Heldens, Thomas Manhardt, Tim Hegeman, Wing Lung Ngai, Yinglong Xia: The LDBC Graphalytics Benchmark. CoRR abs/2011.15028 (2020). https://arxiv.org/abs/2011.15028

Statistics:

  • cit-Patents, vertices: 3774768, edges: 16518947
  • com-friendster, vertices: 65608366, edges: 1806067135
  • datagen-7_5-fb, vertices: 633432, edges: 34185747
  • datagen-7_6-fb, vertices: 754147, edges: 42162988
  • datagen-7_7-zf, vertices: 13180508, edges: 32791267
  • datagen-7_8-zf, vertices: 16521886, edges: 41025255
  • datagen-7_9-fb, vertices: 1387587, edges: 85670523
  • datagen-8_0-fb, vertices: 1706561, edges: 107507376
  • datagen-8_1-fb, vertices: 2072117, edges: 134267822
  • datagen-8_2-zf, vertices: 43734497, edges: 106440188
  • datagen-8_3-zf, vertices: 53525014, edges: 130579909
  • datagen-8_4-fb, vertices: 3809084, edges: 269479177
  • datagen-8_5-fb, vertices: 4599739, edges: 332026902
  • datagen-8_6-fb, vertices: 5667674, edges: 421988619
  • datagen-8_7-zf, vertices: 145050709, edges: 340157363
  • datagen-8_8-zf, vertices: 168308893, edges: 413354288
  • datagen-8_9-fb, vertices: 10572901, edges: 848681908
  • datagen-9_0-fb, vertices: 12857671, edges: 1049527225
  • datagen-9_1-fb, vertices: 16087483, edges: 1342158397
  • datagen-9_2-zf, vertices: 434943376, edges: 1042340732
  • datagen-9_3-zf, vertices: 555270053, edges: 1309998551
  • datagen-9_4-fb, vertices: 29310565, edges: 2588948669
  • datagen-sf3k-fb, vertices: 33484375, edges: 2912009743
  • datagen-sf10k-fb, vertices: 100218750, edges: 9404822538
  • dota-league, vertices: 61170, edges: 50870313
  • example-directed, vertices: 10, edges: 17
  • example-undirected, vertices: 9, edges: 12
  • graph500-22, vertices: 2396657, edges: 64155735
  • graph500-23, vertices: 4610222, edges: 129333677
  • graph500-24, vertices: 8870942, edges: 260379520
  • graph500-25, vertices: 17062472, edges: 523602831
  • graph500-26, vertices: 32804978, edges: 1051922853
  • graph500-27, vertices: 63081040, edges: 2111642032
  • graph500-28, vertices: 121242388, edges: 4236163958
  • graph500-29, vertices: 232999630, edges: 8493569115
  • graph500-30, vertices: 447797986, edges: 17022117362
  • kgs, vertices: 832247, edges: 17891698
  • twitter_mpi, vertices: 52579678, edges: 1963263508
  • wiki-Talk, vertices: 2394385, edges: 5021410
Identifier
DOI https://doi.org/10.25606/SURF.e8e60a7e282917f5
PID https://hdl.handle.net/11112/7ec6a51e-6fdb-bf8d-4507-456ccadc9291
Related Identifier https://arxiv.org/abs/2011.15028
Metadata Access https://repository.surfsara.nl/api/oai2?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:repository.surfsara.nl:cwi_graphalytics
Provenance
Creator Alexandru Iosup; Ahmed Musaafir; Alexandru Uta; Arnau Prat-Pérez; Gábor Szárnyas; Hassan Chafi; Ilie Gabriel Tanase; Lifeng Nai; Michael J. Anderson; Mihai Capota; Narayanan Sundaram; Peter A. Boncz; Siegfried Depner; Stijn Heldens; Thomas Manhardt; Tim Hegeman; Wing Lung Ngai; Yinglong Xia
Publisher SURF
Publication Year 2022
Rights LDBC Data set license; info:eu-repo/semantics/openAccess
OpenAccess true
Representation
Language No linguistic content; Not applicable
Resource Type Dataset
Format application/zstd
Discipline Other