Global-scale mining polygons (Version 1)

DOI

This data set provides spatially explicit estimates of the area directly used for surface mining on a global scale. It contains more than 21,000 polygons of activities related to mining, mainly of coal and metal ores. Several data sources were compiled to identify the approximate location of mines active at any time between the years 2000 to 2017. This data set does not cover all existing mining locations across the globe. The polygons were delineated by experts using Sentinel-2 cloudless (https://s2maps.eu by EOX IT Services GmbH (contains modified Copernicus Sentinel data 2017 & 2018)) and very high-resolution satellite images available from Google Satellite and Bing Imagery. The derived polygons cover the direct land used by mining activities, including open cuts, tailing dams, waste rock dumps, water ponds, and processing infrastructure. The main data set consists of a GeoPackage (GPKG) file, including the following variables: ISO3_CODE, COUNTRY_NAME, AREA in squared kilometres, FID with the feature ID, and geom in geographical coordinates WGS84. The summary of the mining area per country is available in comma-separated values (CSV) file, including the following variables: ISO3_CODE, COUNTRY_NAME, AREA in squared kilometers, and N_FEATURES number of mapped features. Grid data sets with the mining area per cell were derived from the polygons. The grid data is available at 30 arc-second resolution (approximately 1x1 km at the equator), 5 arc-minute (approximately 10x10 km at the equator), and 30 arc-minute resolution (approximately 55x55 km at the equator). We performed an independent validation of the mining data set using control points. For that, we draw a 1,000 random samples stratified between two classes: mine and no-mine. The control points are also available as a GPKG file, including the variables: MAPPED, REFERENCE, FID with the feature ID, and geom in geographical coordinates WGS84. The overall accuracy calculated from the control points was 88.4%, other accuracy metrics are shown below. Confusion Matrix and Statistics ReferencePrediction Mine No-mine Mine 394 106 No-mine 10 490 Accuracy : 0.88495% CI : (0.8625, 0.9032)No Information Rate : 0.596P-Value [Acc > NIR] : < 2.2e-16 Kappa : 0.768 Mcnemar's Test P-Value : < 2.2e-16 Sensitivity : 0.9752Specificity : 0.8221Pos Pred Value : 0.7880Neg Pred Value : 0.9800Precision : 0.7880Recall : 0.9752F1 : 0.8717Prevalence : 0.4040Detection Rate : 0.3940Detection Prevalence : 0.5000Balanced Accuracy : 0.8987This work was supported by the European Research Council (ERC) under the European Union's Horizon 2020 research and innovation programme grant number 725525 FINEPRINT project (https://www.fineprint.global/).

Identifier
DOI https://doi.org/10.1594/PANGAEA.910894
Related Identifier https://doi.org/10.1594/PANGAEA.942325
Related Identifier https://doi.org/10.1038/s41597-020-00624-w
Metadata Access https://ws.pangaea.de/oai/provider?verb=GetRecord&metadataPrefix=datacite4&identifier=oai:pangaea.de:doi:10.1594/PANGAEA.910894
Provenance
Creator Maus, Victor ORCID logo; Giljum, Stefan ORCID logo; Gutschlhofer, Jakob ORCID logo; da Silva, Dieison M ORCID logo; Probst, Michael ORCID logo; Gass, Sidnei L B ORCID logo; Luckeneder, Sebastian (ORCID: 0000-0001-6735-301X); Lieber, Mirko (ORCID: 0000-0002-7152-486X); McCallum, Ian
Publisher PANGAEA
Publication Year 2020
Funding Reference Horizon 2020 https://doi.org/10.13039/501100007601 Crossref Funder ID 725525 https://cordis.europa.eu/project/id/725525 Spatially explicit material footprints: fine-scale assessment of Europe's global environmental and social impacts
Rights Creative Commons Attribution-ShareAlike 4.0 International; https://creativecommons.org/licenses/by-sa/4.0/
OpenAccess true
Representation
Resource Type Dataset
Format application/zip
Size 18.4 MBytes
Discipline Environmental Research; Geosciences; Land Use; Natural Sciences