Resource aware scheduling in Hadoop for heterogeneous workloads based on load estimation

dc.contributor.authorKapil, B.S.
dc.contributor.authorSowmya, Kamath S.
dc.date.accessioned2020-03-30T10:23:05Z
dc.date.available2020-03-30T10:23:05Z
dc.date.issued2013
dc.description.abstractCurrently, most cloud based applications require large scale data processing capability. Data to be processed is growing at a rate much faster than available computing power. Hadoop is used to enable distributed processing on large clusters of commodity hardware. In large clusters, the workloads may be heterogeneous in nature, that is, I/O bound, CPU bound or network intensive jobs that demand different types of resources requirement so as to run simultaneously on large cluster. Hadoops job scheduling is based on FIFO where, parallelization based on types of job has not been taken into account for scheduling. In this paper, we propose a new scheduling algorithm for Hadoop based distributed system, based on the classification of workloads to assign a specific category to a particular cluster according to current load of the cluster. The proposed scheduler increases the performance of both CPU and I/O resources in a cluster under heterogeneous workloads, by approximately 12% when compared to Hadoops FIFO scheduler. � 2013 IEEE.en_US
dc.identifier.citation2013 4th International Conference on Computing, Communications and Networking Technologies, ICCCNT 2013, 2013, Vol., , pp.-en_US
dc.identifier.urihttps://idr.nitk.ac.in/jspui/handle/123456789/8942
dc.titleResource aware scheduling in Hadoop for heterogeneous workloads based on load estimationen_US
dc.typeBook chapteren_US

Files