Straggler handling approaches in mapreduce framework: a comparative study

Anwar H. Katrawi, Rosni Abdullah, Mohammed Anbar, Ibrahim AlShourbaji, Ammar Kamal Abasi

Abstract


The proliferation of information technology produces a huge amount of data called big data that cannot be processed by traditional database systems. These Various types of data come from different sources. However, stragglers are a major bottleneck in big data processing, and hence the early detection and accurate identification of stragglers can have important impacts on the performance of big data processing. This work aims to assess five stragglers identification methods: Hadoop native scheduler, LATE Scheduler, Mantri, MonTool, and Dolly. The performance of these techniques was evaluated based on three benchmarked methods: Sort, Grep and WordCount. The results show that the LATE Scheduler performs the best and it would be efficient to obtain better results for stragglers identification.

Keywords


Dolly,MonTool; Mantri; Hadoop; Spark; Dryad; MapReduce; Distributed Computing; Big Data; Hadoop; Straggler



DOI: http://doi.org/10.11591/ijece.v11i1.pp%25p
Total views : 0 times


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

ISSN 2088-8708, e-ISSN 2722-2578