A Cloud Computing Service for Remote Sensing Data Reprojection and Science Reduction
Abstract
The combination of internet availability of remote sensing data and emerging cloud computing technologies are enabling new environmental science research. Data products from satellites such as MODIS or GRACE augment ground based measurements from networks such as Fluxnet or WATERS. Cloud computing technologies such as Microsoft’s Azure or Amazon’s EC2 offer the necessary computational, storage, and networking resources to process these large data. However, a significant data entry barrier remains due to the difficulty in gathering and subsequently harmonizing imagery data from heterogeneous sources. To lower this barrier, we have prototyped a general image processing pipeline in the cloud using Azure. Our pipeline automates data upload from known repositories such as the LPDAAC, image reprojection to common space and time representation, hosting an optional reduction algorithm provided by the scientist, and final result delivery to the scientist desktop. We have used our Azure pipeline to compute a single science variable dealing with evapotranspiration from 9 different MODIS products across the US over 10 years. This computation required uploading and reprojecting approximately 4 TB of imagery consuming 70K cpu hours to deliver approximately 50GB resulting images to the scientist desktop. We believe our pipeline can be generalized to various remote sensing data products analyses other than the MODIS.
- Publication:
-
AGU Fall Meeting Abstracts
- Pub Date:
- December 2009
- Bibcode:
- 2009AGUFM.H51H0857L
- Keywords:
-
- 1818 HYDROLOGY / Evapotranspiration;
- 1906 INFORMATICS / Computational models;
- algorithms;
- 1932 INFORMATICS / High-performance computing