The manifest and store data of 870,515 Android mobile applications

DOI

We built a crawler to collect data from the Google Play store including the application's metadata and APK files. The manifest files were extracted from the APK files and then processed to extract the features. The data set is composed of 870,515 records/apps, and for each app we produced 48 features. The data set was used to built and test two bootstrap aggregating of multiple XGBoost machine learning classifiers. The dataset were collected between April 2017 and November 2018. We then checked the status of these applications on three different occasions; December 2018, February 2019, and May-June 2019.

Identifier
DOI https://doi.org/10.34894/H0YJFT
Metadata Access https://dataverse.nl/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.34894/H0YJFT
Provenance
Creator Mohsen, Fadi ORCID logo; Karastoyanova, Dimka ORCID logo; Azzopardi, George ORCID logo
Publisher DataverseNL
Contributor Digital Competence Centre; Mohsen, Fadi
Publication Year 2022
Rights CC0 Waiver; info:eu-repo/semantics/openAccess; https://creativecommons.org/publicdomain/zero/1.0/
OpenAccess true
Contact Digital Competence Centre (University of Groningen)
Representation
Resource Type textual data; Dataset
Format application/zip
Size 202636617
Version 1.0
Discipline Other