Improving ecological niche models by data mining large environmental datasets for surrogate models
Abstract
WhyWhere is a new ecological niche modeling (ENM) algorithm for mapping and explaining the distribution of species. The algorithm uses image processing methods to efficiently sift through large amounts of data to find the few variables that best predict species occurrence. The purpose of this paper is to describe and justify the main parameterizations and to show preliminary success at rapidly providing accurate, scalable, and simple ENMs. Preliminary results for six species of plants and animals in different regions indicate a significant ( p < 0.01) 14% increase in accuracy over the GARP algorithm using models with few, typically two, variables. The increase is attributed to access to additional data, particularly remotely sensed monthly versus annual climate averages. WhyWhere is also six times faster than GARP on large datasets. A data mining based approach with transparent access to remote data archives is a new paradigm for ENM, particularly suited to finding correlates in large databases of fine resolution surfaces. Software for WhyWhere is freely available, both as a service and in a desktop downloadable form from the web site http://biodi.sdsc.edu/ww_home.html.
- Publication:
-
Ecological Modelling
- Pub Date:
- January 2006
- DOI:
- 10.1016/j.ecolmodel.2005.05.029
- arXiv:
- arXiv:q-bio/0511046
- Bibcode:
- 2006EcMod.192..188S
- Keywords:
-
- WhyWhere;
- Ecological niche modeling;
- Surrogate models;
- Data mining;
- Remote sensing;
- Quantitative Biology - Quantitative Methods;
- Computer Science - Artificial Intelligence
- E-Print:
- 16 pages, 4 figures, to appear in Ecological Modelling