MASSIVE: Model Assessment and Stochastic Search for Instrumental Variable Estimation

DOI

The recent availability of huge, many-dimensional data sets, like those arising from genome-wide association studies (GWAS), provides many opportunities for strengthening causal inference. One popular approach is to utilize these many-dimensional measurements as instrumental variables (instruments) for improving the causal effect estimate between other pairs of variables. Unfortunately, searching for proper instruments in a many-dimensional set of candidates is a daunting task due to the intractable model space and the fact that we cannot directly test which of these candidates are valid. We propose a general and efficient causal inference algorithm (MASSIVE) consisting of Model Assessment and Stochastic Search for Instrumental Variable Estimation. The MASSIVE algorithm accounts for model uncertainty by performing Bayesian model averaging over the most promising many dimensional instrumental variable models, while at the same time employing weaker assumptions regarding the data generating process compared to similar methods.The data set contains source code implementing the MASSIVE algorithm, which is described in the article titled "MASSIVE: Tractable and Robust Bayesian Learning of Many-Dimensional Instrumental Variable Models"(http://proceedings.mlr.press/v124/gabriel-bucur20a.html) by Ioan Gabriel Bucur, Tom Claassen and Tom Heskes. The data set also contains simulated data necessary for reproducing the figures in the article as well as routines necessary for recreating it. This research is presented in Chapter 5 of the PhD thesis titled "Being Bayesian about Causal Inference" byIoan Gabriel Bucur. The code is written in the R and C++ programming languages.

Identifier
DOI https://doi.org/10.17026/dans-xuj-vggs
Metadata Access https://phys-techsciences.datastations.nl/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.17026/dans-xuj-vggs
Provenance
Creator I.G. Bucur; T. Claassen; T.M. Heskes
Publisher DANS Data Station Phys-Tech Sciences
Contributor RU Radboud University
Publication Year 2020
Rights CC BY 4.0; info:eu-repo/semantics/openAccess; http://creativecommons.org/licenses/by/4.0
OpenAccess true
Contact RU Radboud University
Representation
Resource Type Dataset
Format text/xml; text/plain; application/zip; text/html
Size 5092; 626; 17778; 9656; 136533023
Version 1.0
Discipline Life Sciences; Medicine