LDBC Social Network Benchmark graphs

The graphs were generated by the LDBC Social Network Benchmark's Datagen (Hadoop variant, https://github.com/ldbc/ldbc_snb_datagen_hadoop/releases/tag/v0.3.5). The available scale factors are: SF1, SF3, ..., SF1000. Stored in zstd-compressed CSV files.

  • social_network-csv_basic: data sets produced using the CsvBasic serializer and the "yyyy-MM-dd'T'HH:mm:ss.SSSZ" datetime formatter (StringDateFormatter).
  • social_network-csv_basic-longdateformatter: data sets produced using the CsvBasic serializer and the unix epoch milli datetime formatter (LongDateFormatter).
  • social_network-csv_composite: data sets produced using the CsvComposite serializer and the "yyyy-MM-dd'T'HH:mm:ss.SSSZ" datetime formatter (StringDateFormatter).
  • social_network-csv_composite-longdateformatter: data sets produced using the CsvComposite serializer and the unix epoch milli datetime formatter (LongDateFormatter).
  • social_network-csv_composite_merge_foreign: data sets produced using the CsvCompositeMergeForeign serializer and the "yyyy-MM-dd'T'HH:mm:ss.SSSZ" datetime formatter (StringDateFormatter).
  • social_network-csv_composite_merge_foreign-longdateformatter: data sets produced using the CsvCompositeMergeForeign serializer and the unix epoch milli datetime formatter (LongDateFormatter).
  • social_network-csv_merge_foreign: data sets produced using the CsvMergeForeign serializer and the "yyyy-MM-dd'T'HH:mm:ss.SSSZ" datetime formatter (StringDateFormatter).
  • social_network-csv_merge_foreign-longdateformatter: data sets produced using the CsvMergeForeign serializer and the unix epoch milli datetime formatter (LongDateFormatter).
  • social_network-ttl: data sets produced using the Turtle serializer and the "yyyy-MM-dd'T'HH:mm:ss.SSSXXX" datetime formatter (StringDateFormatter).
  • substitution_parameters: query substitution parameters produced by the parameter generator (paramgen) component.
  • updatestreams: update streams defining insert operations. They are available in variants with the following partition numbers: 2^k (1, 2, 4, 8, 16, 32, 64, 128, 256, 512, 1024) and 6×2^k (24, 48, 96, 192, 384, 768).

Disclaimer. These data sets (including the social network graphs, query substitution parameters, and update streams) are part of the LDBC Social Network Benchmark's Interactive workload which is protected by the LDBC trademark.

When using these data sets in publications, it is required to indicate which tier the benchmarked implementation belongs to based on the tiers described in the LDBC fair use policy (https://ldbcouncil.org/benchmarks/snb/).

If you have any questions, please reach out to info@ldbcouncil.org.

For direct links and an automated download script, visit: https://github.com/ldbc/data-sets-surf-repository

Website: https://ldbcouncil.org/benchmarks/snb/

Related publication: Renzo Angles, János Benjamin Antal, Alex Averbuch, Peter A. Boncz, Orri Erling, Andrey Gubichev, Vlad Haprian, Moritz Kaufmann, Josep Lluís Larriba-Pey, Norbert Martínez-Bazan, József Marton, Marcus Paradies, Minh-Duc Pham, Arnau Prat-Pérez, Mirko Spasic, Benjamin A. Steer, Gábor Szárnyas, Jack Waudby: The LDBC Social Network Benchmark. CoRR abs/2001.02299 (2020). https://arxiv.org/abs/2001.02299

Identifier
DOI https://doi.org/10.25606/SURF.8f3ac424d6694282
PID https://hdl.handle.net/11112/e6e00558-a2c3-9214-473e-04a16de09bf8
Related Identifier https://arxiv.org/abs/2001.02299
Related Identifier https://github.com/ldbc/ldbc_snb_datagen_hadoop
Metadata Access https://repository.surfsara.nl/api/oai2?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:repository.surfsara.nl:cwi_snb
Provenance
Creator Gábor Szárnyas
Publisher SURF
Publication Year 2022
Rights LDBC Data set license; info:eu-repo/semantics/openAccess
OpenAccess true
Representation
Language No linguistic content; Not applicable
Resource Type Dataset
Format application/zstd
Discipline Other