TY - JOUR
T1 - Cloud storage reliability for Big Data applications : a state of the art survey
AU - Nachiappan, Rekha
AU - Javadi, Bahman
AU - Calheiros, Rodrigo N.
AU - Matawie, Kenan M.
PY - 2017
Y1 - 2017
N2 - Cloud storage systems are now mature enough to handle a massive volume of heterogeneous and rapidly changing data, which is known as Big Data. However, failures are inevitable in cloud storage systems as they are composed of large scale hardware components. Improving fault tolerance in cloud storage systems for Big Data applications is a significant challenge. Replication and Erasure coding are the most important data reliability techniques employed in cloud storage systems. Both techniques have their own trade-off in various parameters such as durability, availability, storage overhead, network bandwidth and traffic, energy consumption and recovery performance. This survey explores the challenges involved in employing both techniques in cloud storage systems for Big Data applications with respect to the aforementioned parameters. In this paper, we also introduce a conceptual hybrid technique to further improve reliability, latency, bandwidth usage, and storage efficiency of Big Data applications on cloud computing.
AB - Cloud storage systems are now mature enough to handle a massive volume of heterogeneous and rapidly changing data, which is known as Big Data. However, failures are inevitable in cloud storage systems as they are composed of large scale hardware components. Improving fault tolerance in cloud storage systems for Big Data applications is a significant challenge. Replication and Erasure coding are the most important data reliability techniques employed in cloud storage systems. Both techniques have their own trade-off in various parameters such as durability, availability, storage overhead, network bandwidth and traffic, energy consumption and recovery performance. This survey explores the challenges involved in employing both techniques in cloud storage systems for Big Data applications with respect to the aforementioned parameters. In this paper, we also introduce a conceptual hybrid technique to further improve reliability, latency, bandwidth usage, and storage efficiency of Big Data applications on cloud computing.
KW - big data
KW - cloud computing
KW - fault-tolerant computing
KW - replication (experimental design)
UR - http://handle.westernsydney.edu.au:8081/1959.7/uws:43120
U2 - 10.1016/j.jnca.2017.08.011
DO - 10.1016/j.jnca.2017.08.011
M3 - Article
SN - 1084-8045
VL - 97
SP - 35
EP - 47
JO - Journal of Network and Computer Applications
JF - Journal of Network and Computer Applications
ER -