Using machine learning to generate an open-access cropland map from satellite images time series in the Indian Himalayan region
Abstract
Crop maps are crucial for agricultural monitoring and food management and can additionally support domain-specific applications, such as setting cold supply chain infrastructure in developing countries. Machine learning (ML) models, combined with freely-available satellite imagery, can be used to produce cost-effective and high spatial-resolution crop maps. However, accessing ground truth data for supervised learning is especially challenging in developing countries due to factors such as smallholding and fragmented geography, which often results in a lack of crop type maps or even reliable cropland maps. Our area of interest for this study lies in Himachal Pradesh, India, where we aim at producing an open-access binary cropland map at 10-m resolution for the Kullu, Shimla, and Mandi districts. To this end, we developed an ML pipeline that relies on Sentinel-2 satellite images time series. We investigated two pixel-based supervised classifiers, support vector machines (SVM) and random forest (RF), which are used to classify per-pixel time series for binary cropland mapping. The ground truth data used for training, validation and testing was manually annotated from a combination of field survey reference points and visual interpretation of very high resolution (VHR) imagery. We trained and validated the models via spatial cross-validation to account for local spatial autocorrelation and improve the generalization capability of the model. We tested the model on hold out test sets of each district, achieving an average accuracy for the RF (our best model) of 87%. We noticed NIR band at the early and late stage of the apple harvest season (main crop in the region) to be of critical importance for the model. Finally, we used this model to generate a cropland map for three districts of Himachal Pradesh, spanning 14,600 km2, which improves the resolution and quality of existing public maps, and made the code open-source.
- Publication:
-
Remote Sensing Applications: Society and Environment
- Pub Date:
- November 2023
- DOI:
- arXiv:
- arXiv:2203.14673
- Bibcode:
- 2023RSASE..3201057L
- Keywords:
-
- Cropland mapping;
- Smallholders;
- Remote sensing;
- High-altitude region;
- Random forest;
- Feature engineering;
- Google earth engine;
- Sentinel-2;
- Computer Science - Computer Vision and Pattern Recognition;
- Computer Science - Machine Learning;
- I.4.6;
- I.5.2