The graphs were generated by the LDBC Social Network Benchmark's Datagen (Hadoop variant, https://github.com/ldbc/ldbc_snb_datagen_hadoop/releases/tag/v0.3.5). The available scale factors are: SF1, SF3, ..., SF1000. Stored in zstd-compressed CSV files.
- social_network-csv_basic: data sets produced using the CsvBasic serializer and the "yyyy-MM-dd'T'HH:mm:ss.SSSZ" datetime formatter (StringDateFormatter).
- social_network-csv_basic-longdateformatter: data sets produced using the CsvBasic serializer and the unix epoch milli datetime formatter (LongDateFormatter).
- social_network-csv_composite: data sets produced using the CsvComposite serializer and the "yyyy-MM-dd'T'HH:mm:ss.SSSZ" datetime formatter (StringDateFormatter).
- social_network-csv_composite-longdateformatter: data sets produced using the CsvComposite serializer and the unix epoch milli datetime formatter (LongDateFormatter).
- social_network-csv_composite_merge_foreign: data sets produced using the CsvCompositeMergeForeign serializer and the "yyyy-MM-dd'T'HH:mm:ss.SSSZ" datetime formatter (StringDateFormatter).
- social_network-csv_composite_merge_foreign-longdateformatter: data sets produced using the CsvCompositeMergeForeign serializer and the unix epoch milli datetime formatter (LongDateFormatter).
- social_network-csv_merge_foreign: data sets produced using the CsvMergeForeign serializer and the "yyyy-MM-dd'T'HH:mm:ss.SSSZ" datetime formatter (StringDateFormatter).
- social_network-csv_merge_foreign-longdateformatter: data sets produced using the CsvMergeForeign serializer and the unix epoch milli datetime formatter (LongDateFormatter).
- social_network-ttl: data sets produced using the Turtle serializer and the "yyyy-MM-dd'T'HH:mm:ss.SSSXXX" datetime formatter (StringDateFormatter).
- substitution_parameters: query substitution parameters produced by the parameter generator (paramgen) component.
- updatestreams: update streams defining insert operations. They are available in variants with the following partition numbers: 2^k (1, 2, 4, 8, 16, 32, 64, 128, 256, 512, 1024) and 6×2^k (24, 48, 96, 192, 384, 768).
Disclaimer. These data sets (including the social network graphs, query substitution parameters, and update streams) are part of the LDBC Social Network Benchmark's Interactive workload which is protected by the LDBC trademark.
When using these data sets in publications, it is required to indicate which tier the benchmarked implementation belongs to based on the tiers described in the LDBC fair use policy (https://ldbcouncil.org/benchmarks/snb/).
If you have any questions, please reach out to info@ldbcouncil.org.
For direct links and an automated download script, visit: https://github.com/ldbc/data-sets-surf-repository
Website: https://ldbcouncil.org/benchmarks/snb/
Related publication: Renzo Angles, János Benjamin Antal, Alex Averbuch, Peter A. Boncz, Orri Erling, Andrey Gubichev, Vlad Haprian, Moritz Kaufmann, Josep Lluís Larriba-Pey, Norbert Martínez-Bazan, József Marton, Marcus Paradies, Minh-Duc Pham, Arnau Prat-Pérez, Mirko Spasic, Benjamin A. Steer, Gábor Szárnyas, Jack Waudby: The LDBC Social Network Benchmark. CoRR abs/2001.02299 (2020). https://arxiv.org/abs/2001.02299