SLTrans

The dataset consists of source code and LLVM IR pairs generated from accepted and de-duped programming contest solutions. The dataset is divided into language configs and mode splits. The language can be one of C, C++, D, Fortran, Go, Haskell, Nim, Objective-C, Python, Rust and Swift, indicating the source files' languages. The mode split indicates the compilation mode, which can be wither Size_Optimized or Perf_Optimized.

Identifier
Source https://tudatalib.ulb.tu-darmstadt.de/handle/tudatalib/4246
Related Identifier IsCitedBy https://arxiv.org/abs/2403.03894
Metadata Access https://tudatalib.ulb.tu-darmstadt.de/oai/openairedata?verb=GetRecord&metadataPrefix=oai_datacite&identifier=oai:tudatalib.ulb.tu-darmstadt.de:tudatalib/4246
Provenance
Creator Paul, Indraneil; Glavas, Goran; Gurevych, Iryna
Publisher TU Darmstadt
Contributor TU Darmstadt
Publication Year 2024
Rights Creative Commons Attribution 4.0; info:eu-repo/semantics/openAccess
OpenAccess true
Contact https://tudatalib.ulb.tu-darmstadt.de/page/contact
Representation
Language English
Resource Type Dataset
Format application/pdf
Discipline Other