The solar wind, a stream of charged particles originating from the Sun and transcending interplanetary space, poses risks to technology and astronauts. In particular geomagnetic storm, can lead to severe damage. One major cause of geomagnetic storms are high-speed solar wind streams (HSSs), emitted by coronal holes on the solar surface.
This data set provides (1) coronal hole segmentation maps, extracted from solar EUV images, (2) the heliospheric latitude of Earth, needed to compute coronal hole area features, and (3) a machine learning data set, consisting of input features related to coronal holes and the solar wind speed at Earth as the target output, used to predict the solar wind speed with a lead time of four days. Additionally, we provide (4) a list of high-speed solar wind streams (HSSs) and coronal mass ejections (CMEs), which can be used for investigating the effectiveness of a prediction model with regards to HSSs and CMEs. All data is provided as used in the publication Collin et al. (2024) TBD. For all details on the data preparation and usage, we refer to the original paper.