3.8 Article

Analysis and Experimental Study of HDFS Performance

Journal

Publisher

ASSOC INFORMATION COMMUNICATION TECHNOLOGY EDUCATION & SCIENCE
DOI: 10.18421/TEM102-38

Keywords

HDFS; Distributed file systems; Distributed and parallel computing; Hadoop cluster

Ask authors/readers for more resources

In the era of big data, the use of distributed systems like Hadoop is becoming increasingly important. This research aims to experimentally explore the factors influencing the performance of HDFS read/write operations.
In the age of big data, the amount of data that people generate and use on a daily basis has far exceeded the storage and processing capabilities of a single computer system. That motivates the use of distributed big data storage and processing system such as Hadoop. It provides a reliable, horizontally-scalable, fault-tolerant and efficient service, based on the Hadoop Distributed File System (HDFS) and MapReduce. The purpose of this research is to experimentally determine whether (and to what extent) the network communication speed, the file replication factor, the files' sizes and their number, and the location of the HDFS client influence the performance of the HDFS read/write operations.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

3.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available