Capturing Node Resource Status and Classifying Workload for Map Reduce Resource Aware Scheduler

dc.contributor.authorMude, R.G.
dc.contributor.authorBetta, A.
dc.contributor.authorDebbarma, A.
dc.date.accessioned2026-02-06T06:39:44Z
dc.date.issued2015
dc.description.abstractThere has been an enormous growth in the amount of digital data, and numerous software frameworks have been made to process the same. Hadoop MapReduce is one such popular software framework which processes large data on commodity hardware. Job scheduler is a key component of Hadoop for assigning tasks to node. Existing MapReduce scheduler assigns tasks to node without considering node heterogeneity, workload type, and the amount of available resources. This leads to overburdening of node by one type of job and reduces the overall throughput. In this paper, we propose a new scheduler which capture the node resource status after every heartbeat, classifies jobs into two types, CPU bound and IO bound, and assigns task to the node which is having less CPU/IO utilization. The experimental result shows an improvement of 15-20 % on heterogeneous and around 10 % of homogeneous cluster with respect to Hadoop native scheduler. © Springer India 2015.
dc.identifier.citationAdvances in Intelligent Systems and Computing, 2015, Vol.309 AISC, VOLUME 2, p. 247-257
dc.identifier.issn21945357
dc.identifier.urihttps://doi.org/10.1007/978-81-322-2009-1_29
dc.identifier.urihttps://idr.nitk.ac.in/handle/123456789/32499
dc.publisherSpringer Verlag service@springer.de
dc.subjectHadoop
dc.subjectHeteregeneous cluster
dc.subjectHomogeneous cluster
dc.subjectMapReduce
dc.subjectScheduler
dc.titleCapturing Node Resource Status and Classifying Workload for Map Reduce Resource Aware Scheduler

Files