A review on deep learning in UAV remote sensing
Abstract
Deep Neural Networks (DNNs) learn representation from data with an impressive capability, and brought important breakthroughs for processing images, time-series, natural language, audio, video, and many others. In the remote sensing field, surveys and literature revisions specifically involving DNNs algorithms' applications have been conducted in an attempt to summarize the amount of information produced in its subfields. Recently, Unmanned Aerial Vehicle (UAV)-based applications have dominated aerial sensing research. However, a literature revision that combines both "deep learning" and "UAV remote sensing" thematics has not yet been conducted. The motivation for our work was to present a comprehensive review of the fundamentals of Deep Learning (DL) applied in UAV-based imagery. We focused mainly on describing the classification and regression techniques used in recent applications with UAV-acquired data. For that, a total of 232 papers published in international scientific journal databases was examined. We gathered the published materials and evaluated their characteristics regarding the application, sensor, and technique used. We discuss how DL presents promising results and has the potential for processing tasks associated with UAV-based image data. Lastly, we project future perspectives, commentating on prominent DL paths to be explored in the UAV remote sensing field. This revision consisting of an approach to introduce, commentate, and summarize the state-of-the-art in UAV-based image applications with DNNs algorithms in diverse subfields of remote sensing, grouping it in the environmental, urban, and agricultural contexts.
- Publication:
-
International Journal of Applied Earth Observation and Geoinformation
- Pub Date:
- October 2021
- DOI:
- 10.1016/j.jag.2021.102456
- arXiv:
- arXiv:2101.10861
- Bibcode:
- 2021IJAEO.10202456O
- Keywords:
-
- AdaGrad;
- Adaptive Gradient Algorithm;
- AI;
- Artificial Intelligence;
- ANN;
- Artificial Neural Network;
- CEM;
- Context Enhanced Module;
- CNN;
- Convolutional Neural Network;
- DCGAN;
- Deep Convolutional Generative Adversarial network;
- DDCN;
- Deep Dual-domain Convolutional neural Network;
- DL;
- Deep Learning;
- DNN;
- Deep Neural Network;
- DEM;
- Digital Elevation Model;
- DSM;
- Digital Surface Model;
- FPS;
- Frames per Second;
- GAN;
- Generative Adversarial Network;
- GPU;
- Graphics Processing Unit;
- KL;
- Kullback-Leibler;
- LSTM;
- Long Short-Term Memory;
- IoU;
- Intersection over Union;
- ML;
- Machine Learning;
- MAE;
- Mean Absolute Error;
- MAPE;
- Mean Absolute Percentage Error;
- MRE;
- Mean Relative Error;
- MSE;
- Mean Squared Error;
- MSLE;
- Mean Squared Logarithmic Error;
- MSM;
- Multi-Stage Module;
- MVS;
- Multiview Stereo;
- NAS;
- Network Architecture Search;
- PCA;
- Principal Component Analysis;
- PPM;
- Pyramid Pooling Module;
- r;
- Correlation Coefficient;
- RMSE;
- Root Mean Squared Error;
- RNN;
- Recurrent Neural Network;
- ROC;
- Receiver Operating Characteristics;
- RPA;
- Remotely Piloted Aircraft;
- SAM;
- Spatial Attention Module;
- SGD;
- Stochastic Gradient Descent;
- SfM;
- Structure from Motion;
- UAV;
- Unmanned Aerial Vehicle;
- WOS;
- Web of Science;
- Computer Science - Computer Vision and Pattern Recognition;
- Computer Science - Artificial Intelligence
- E-Print:
- 27 pages, 10 figures