Abstract
As more and more data can be generated at a fasterthan- ever rate nowadays, it becomes a challenge to processing large volumes of data for complex data analysis. In order to address performance and cost issues of big data processing on clouds, we present a novel design of adaptive workflow management system which includes a data mining based prediction model, workflow scheduler, and iteration controls to optimize the data processing via iterative workflow tasks. We proposed a new heuristic algorithm, called Upgrade Fit, which dynamically and continuously reallocates multiple types of cloud resources to fulfill the performance and cost requirements. The iterative workflow tasks can be bursty bags of tasks to be executed repetitively for data processing. A real application of weather forecast workflow has been used to evaluate the capability of our system for large volume image data processing. Experimental system has been set up and the results indicate that the system can effectively handle multiple types of cloud resources and optimize the performance iteratively.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 11th IEEE International Symposium on Parallel and Distributed Processing with Applications (ISPA-13), 16-18 July 2013, Melbourne, Victoria, Australia |
| Publisher | IEEE |
| Pages | 1049-1056 |
| Number of pages | 8 |
| ISBN (Print) | 9780769550220 |
| DOIs | |
| Publication status | Published - 2013 |
| Event | ISPA (Conference) - Duration: 23 Aug 2016 → … |
Conference
| Conference | ISPA (Conference) |
|---|---|
| Period | 23/08/16 → … |
Keywords
- cloud computing
- data processing
- performance
- weather forecasting
- workflow