Data mining tools to explore exponentially growing archives of environmental sciences.
The increasing data volumes from today’s collection systems and the need of the scientific community to include an integrated and authoritative representation of the natural environment in analysis requires a new approach to data mining, management and access. The natural environment includes elements from multiple domains such as space, terrestrial weather, oceans and terrain.
The aim of the project is to develop tools to do fuzzy queries on terabyte datasets. The ESSE architecture will rely heavily on the fuzzy logic data mining web-services to perform searching and statistical analysis of the distribution of the identified events for the user. It will allow parallel mining over the web services of distributed data sources, possibly from different subject areas of the Earth sciences, but sharing the same metadata scheme and data exchange formats.
- The ESSE toolkit sources and sample database are published at http://sourceforge.net/projects/esse.
- Mikhail Zhizhin, Alexey Poyda, Dmitry Mishin, Dmitry Medvedev, Eric Kihn, and Vassily Lyutsarev, Grid-based data mining with Environmental Scenario Search Engine, in Data mining techniques in grid computing environments, Wiley, 2009
- Mikhail Zhizhin, Eric Kihn, Robin Redmon, Alexey Poyda, Dmitry Mishin, Dmitry Medvedev, and Vassily Lyutsarev, Integrating and mining distributed environmental archives on Grids, in Concurrency and Computation: Practice and Experience, vol. 19, no. 16, pp. 2157 - 2170, November 2007
- Mikhail Zhizhin, Eric Kihn, Vassily Lyutsarev, Sergei Berezin, Alexey Poyda, Dmitry Mishin, Dmitry Medvedev, and Dmitry Voitsekhovsky, Environmental scenario search and visualization, in GIS '07: Proceedings of the 15th annual ACM international symposium on Advances in geographic information systems, ACM, November 2007
- Mikhail Zhizhin, Alexey Poyda, Dmitry Mishin, Dmitry Medvedev, Eric Kihn, and Vassily Lyutsarev, Scenario Search on the Grid of Environmental Data Sources, no. MSR-TR-2006-72, July 2006