Code and Data for: Better by default: Strong pre-tuned MLPs and boosted trees on tabular data

DOI

This dataset contains code and data for our paper "Better by default: Strong pre-tuned MLPs and boosted trees on tabular data". The main code is provided in pytabkit_code.zip and contains further documentation in README.md and the docs folder. The main code is also provided on GitHub. Here, we additionally provide the data that is generated by the code as well as the plots. See the documentation in docs/source/bench/download_results.md in the main code for instructions on how/when to download which data. The code for the Grinsztajn et al. (2022) benchmark is provided in grinsztajn_benchmarking_code.zip and on GitHub.

Identifier
DOI https://doi.org/10.18419/darus-4255
Metadata Access https://darus.uni-stuttgart.de/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.18419/darus-4255
Provenance
Creator Holzmüller, David ORCID logo; Grinsztajn, Léo ORCID logo; Steinwart, Ingo ORCID logo
Publisher DaRUS
Contributor David Holzmüller; Léo Grinsztajn; Ingo Steinwart; Katharina Strecker; Jérôme Dockès
Publication Year 2024
Funding Reference DFG EXC 2075 - 390740016 ; GENCI-IDRIS 2023-AD011012804R1 ; GENCI-IDRIS 2024-AD011012804R2
Rights info:eu-repo/semantics/restrictedAccess
OpenAccess false
Contact David Holzmüller (INRIA - Institut National de Recherche en Informatique et Automatique); Léo Grinsztajn (INRIA - Institut National de Recherche en Informatique et Automatique); Ingo Steinwart (Universität Stuttgart)
Representation
Resource Type Dataset
Format text/plain; application/zip; application/x-gzip
Size 138951917618; 1176073; 91446250; 115859308340; 582041600; 279676568318; 135429315397; 7226554; 138082636521; 4530749440; 3346135040; 118528529176
Version 1.0
Discipline Other