dc.description.abstract |
Nowadays, large computing clusters constantly strive for an optimal tradeo between resource e ciency and performance. In this thesis, we are concerned with the e cient use of system resources and we also aim to improve response time of the tasks. We tackle with the challenges of task scheduling on large heterogeneous clusters executing highly heterogeneous bursty workloads with di erent priorities, resource demands, and performance objectives. Firstly, we propose a scheduling algorithm for tasks with communication needs which improves the response time and resource utilization by controlling communication and computation resources at the same time. Secondly, we propose a novel scheduling framework for exploring various aspects of priority scheduling with heterogeneous workloads while investigating the tradeo between evictions and response time. To better understand the impact of evictions, we rst analyze simple eviction policies and wasted resources associated with evictions by using tracedriven simulations. Furthermore, by exploiting the heterogeneity of the workload, we propose a workload-aware slot con guration and task assignment methodology incorporated with slot-based priority scheduling to improve class-based response time and resource e ciency. Finally, we introduce a task scheduling policy which aims to provide scheduling and execution guarantees for low priorities while preserving the performance bene ts of high priority tasks. The proposed scheduling e ectively handles both prioritization and performance issues of low priorities by utilizing a combination of preemptive and non-preemptive scheduling. |
|