Shifting from Stewardship to Analytics of Massive Science Data
Abstract
Currently, the analysis of large data collections is executed through traditional computational and data analysis approaches, which require users to bring data to their desktops and perform local data analysis. Data collection, archiving and analysis from future remote sensing missions, be it from earth science satellites, planetary robotic missions, or massive radio observatories may not scale as more capable instruments stress existing architectural approaches and systems due to more continuous data streams, data from multiple observational platforms, and measurements and models from different agencies. A new paradigm is needed in order to increase the productivity and effectiveness of scientific data analysis. This paradigm must recognize that architectural choices, data processing, management, analysis, etc are interrelated, and must be carefully coordinated in any system that aims to allow efficient, interactive scientific exploration and discovery to exploit massive data collections. Future observational systems, including satellite and airborne experiments, and research in climate modeling will significantly increase the size of the data requiring new methodological approaches towards data analytics where users can more effectively interact with the data and apply automated mechanisms for data reduction, reduction and fusion across these massive data repositories. This presentation will discuss architecture, use cases, and approaches for developing a big data analytics strategy across multiple science disciplines.
- Publication:
-
AGU Fall Meeting Abstracts
- Pub Date:
- December 2015
- Bibcode:
- 2015AGUFMIN23C1738C
- Keywords:
-
- 1914 Data mining;
- INFORMATICS;
- 1918 Decision analysis;
- INFORMATICS;
- 1942 Machine learning;
- INFORMATICS;
- 1968 Scientific reasoning/inference;
- INFORMATICS