Vol.10, No.2, May 2021.                                                                                                                                                                                    ISSN: 2217-8309

                                                                                                                                                                                                                         eISSN: 2217-8333


TEM Journal



Association for Information Communication Technology Education and Science

Analysis and Experimental Study of HDFS Performance


Yordan Kalmukov, Milko Marinov, Tsvetelina Mladenova, Irena Valova


© 2021 Yordan Kalmukov, published by UIKTEN. This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. (CC BY-NC-ND 4.0)


Citation Information: TEM Journal. Volume 10, Issue 2, Pages 806-814, ISSN 2217-8309, DOI: 10.18421/TEM102-38, May 2021.


Received: 24 January 2021.

Revised:   18 March 2021.
Accepted: 23 March 2021.
Published: 27 May 2021.




In the age of big data, the amount of data that people generate and use on a daily basis has far exceeded the storage and processing capabilities of a single computer system. That motivates the use of distributed big data storage and processing system such as Hadoop. It provides a reliable, horizontallyscalable, fault-tolerant and efficient service, based on the Hadoop Distributed File System (HDFS) and MapReduce. The purpose of this research is to experimentally determine whether (and to what extent) the network communication speed, the file replication factor, the files’ sizes and their number, and the location of the HDFS client influence the performance of the HDFS read/write operations.


Keywords –HDFS, Distributed file systems, Distributed and parallel computing, Hadoop cluster.



Full text PDF >  



Copyright © 2021 UIKTEN
Copyright licence: All articles are licenced via Creative Commons CC BY-NC-ND 4.0 licence