Code and Data for: Better by default: Strong pre-tuned MLPs and boosted trees on tabular data

Dataset

DOI

This dataset contains code and data for our paper "Better by default: Strong pre-tuned MLPs and boosted trees on tabular data". The main code is provided in pytabkit_code.zip and contains further documentation in README.md and the docs folder. The main code is also provided on GitHub. Here, we additionally provide the data that is generated by the code as well as the plots. See the documentation in docs/source/bench/download_results.md in the main code for instructions on how/when to download which data. The code for the Grinsztajn et al. (2022) benchmark is provided in grinsztajn_benchmarking_code.zip and on GitHub.

Identifier
DOI	https://doi.org/10.18419/darus-4255
Metadata Access	https://darus.uni-stuttgart.de/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.18419/darus-4255

Provenance
Creator	Holzmüller, David ; Grinsztajn, Léo ; Steinwart, Ingo
Publisher	DaRUS
Contributor	David Holzmüller; Léo Grinsztajn; Ingo Steinwart; Katharina Strecker; Jérôme Dockès
Publication Year	2024
Funding Reference	DFG EXC 2075 - 390740016 ; GENCI-IDRIS 2023-AD011012804R1 ; GENCI-IDRIS 2024-AD011012804R2
Rights	info:eu-repo/semantics/restrictedAccess
OpenAccess	false
Contact	David Holzmüller (INRIA - Institut National de Recherche en Informatique et Automatique); Léo Grinsztajn (INRIA - Institut National de Recherche en Informatique et Automatique); Ingo Steinwart (Universität Stuttgart)

Representation
Resource Type	Dataset
Format	text/plain; application/zip; application/x-gzip
Size	138951917618; 1176073; 91446250; 115859308340; 582041600; 279676568318; 135429315397; 7226554; 138082636521; 4530749440; 3346135040; 118528529176
Version	1.0
Discipline	Other