BigDataSDNSim : a simulator for analyzing big data applications in software-defined cloud data centers

Khaled Alwasel, Rodrigo N. Calheiros, Saurabh Garg, Rajkumar Buyya, Mukaddim Pathan, Dimitrios Georgakopoulos, Rajiv Ranjan

Research output: Contribution to journalArticlepeer-review

14 Citations (Scopus)

Abstract

The integration and crosscoordination of big data processing and software-defined networking (SDN) are vital for improving the performance of big data applications. Various approaches for combining big data and SDN have been investigated by both industry and academia. However, empirical evaluations of solutions that combine big data processing and SDN are extremely costly and complicated. To address the problem of effective evaluation of solutions that combine big data processing with SDN, we present a new, self-contained simulation tool named BigDataSDNSim that enables the modeling and simulation of the big data management system YARN, its related programming models MapReduce, and SDN-enabled networks in a cloud computing environment. BigDataSDNSim supports cost-effective and easy to conduct experimentation in a controllable, repeatable, and configurable manner. The article illustrates the simulation accuracy and correctness of BigDataSDNSim by comparing the behavior and results of a real environment that combines big data processing and SDN with an equivalent simulated environment. Finally, the article presents two uses cases of BigDataSDNSim, which exhibit its practicality and features, illustrate the impact of data replication mechanisms of MapReduce in Hadoop YARN, and show the superiority of SDN over traditional networks to improve the performance of MapReduce applications.
Original languageEnglish
Pages (from-to)893-920
Number of pages28
JournalSoftware: Practice and Experience
Volume51
Issue number5
DOIs
Publication statusPublished - May 2021

Bibliographical note

Publisher Copyright:
© 2020 John Wiley & Sons Ltd

Fingerprint

Dive into the research topics of 'BigDataSDNSim : a simulator for analyzing big data applications in software-defined cloud data centers'. Together they form a unique fingerprint.

Cite this