GobhiSet: Dataset of raw, manually and automatically annotated RGB images across phenology of Brassica oleracea var. Botrytis

This dataset encompasses a compilation of unprocessed aerial RGB images and orthomosaics. These images, captured via a DJI Phantom 4, span several dates and depict Brassica oleracea crops. The images are uniformly distributed across crop spaces and have undergone both manual and automatic annotation. This data pool is engineered to facilitate the detection, segmentation, and growth modelling of crops, utilizing pixel information annotated both manually and automatically. The publicly accessible repository houses 244 raw RGB images, acquired over six distinct dates in October and November of 2020. The experimental farm is located in Portici, Italy. Each raw image bears a dimension of 5472×3648 pixels. The initial three sets of images, captured on October 8, 2020, October 21, 2020, and October 29, 2020, were manually annotated using bounding boxes via the Visual Geometry Group Image Annotator (VIA). These annotations were exported in the Common Objects in Context (COCO) segmentation format. The manual labelling data of the imagery dated October 8, October 21, and October 29, including region and shape attributes, is detailed in JavaScript Object Notation (JSON). These three dates served as training data for the annotator to improve the automated labelling across all dates: 8 October, 21 October, 29 October, 11 November, 18 November, and 25 November. The benchmark annotation was noted to be of 21 October, 2020, in terms of quantitative assessment criteria. Seven classes, designated as Row 1 through Row 7, have been identified for crop labelling within them. Additional attributes such as individual crop ID and the repetitiveness of individual crop specimens are delineated in the Comma Separated Values (CSV) version of the manual annotation. For the generation of automated annotations, the manual annotations were trained over a framework of Grounding DINO + Segment Anything Model (SAM), and the labels were archived in Pascal Visual Object Classes (PASCAL VOC) format. The segmentation masks, derived from automated annotations, are furnished in the form of Portable Network Graphics (PNG) images, catering to three distinct scenarios: aerial images, individual crop rows, and orthomosaics. These automated annotations facilitate the monitoring of growth across the crop phenology, employing evaluation based on binary masks of individually identified crop rows, captured across various dates. The codes utilized for these processes are accessible to ensure transparency and reproducibility. The dataset not only furnishes annotation information but can also assist in the refinement of various machine learning models.

THIS DATASET IS ARCHIVED AT DANS/EASY, BUT NOT ACCESSIBLE HERE. TO VIEW A LIST OF FILES AND ACCESS THE FILES IN THIS DATASET CLICK ON THE DOI-LINK ABOVE

Identifier
DOI https://doi.org/10.17632/dcjjcwc5dh.2
PID https://nbn-resolving.org/urn:nbn:nl:ui:13-cx-mssw
Metadata Access https://easy.dans.knaw.nl/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=oai:easy.dans.knaw.nl:easy-dataset:339616
Provenance
Creator Rana, S
Publisher Data Archiving and Networked Services (DANS)
Contributor Shubham Rana
Publication Year 2024
Rights info:eu-repo/semantics/openAccess; License: http://creativecommons.org/licenses/by/4.0; http://creativecommons.org/licenses/by/4.0
OpenAccess true
Representation
Resource Type Dataset
Discipline Other