Improving Job Scheduling in GRID Environments with Use of Simple Machine Learning Methods

This paper presents an attempt to improve job scheduling over heterogeneous GRID nodes by employing machine learning methods. Our proposed architecture takes into account the fact that GRID frameworks and their modules are not easy to modify or re-implement. It is therefore our aim to provide a plug-in which can be easily added to existing frameworks, thus avoiding significant and time-consuming modifications. Furthermore, we assume that existing scheduling algorithm in the framework should not be completely overridden, but rather modified only if there are chances, based on historical data, that the modification will yield a better result. Finally, we focus on use of off-the-shelf simple machine learning methods in a black-box manner with internal parameter optimization. We present three experiments within a simulated environment, performed with synthetic data aimed at congestion of the system. The results show that improvements over the simple scheduling algorithms can be made.

[1]  Warren Smith Prediction Services for Distributed Computing , 2007, 2007 IEEE International Parallel and Distributed Processing Symposium.

[2]  Giorgio Gambosi,et al.  Complexity and approximation: combinatorial optimization problems and their approximability properties , 1999 .

[3]  John Levine,et al.  A hybrid ant algorithm for scheduling independent jobs in heterogeneous computing environments , 2004 .

[4]  Xiaonan Li,et al.  Discovering Dispatching Rules Using Data Mining , 2005, J. Sched..

[5]  Michael L. Littman,et al.  Packet Routing in Dynamically Changing Networks: A Reinforcement Learning Approach , 1993, NIPS.

[6]  Reha Uzsoy,et al.  Rapid Modeling and Discovery of Priority Dispatching Rules: An Autonomous Learning Approach , 2006, J. Sched..

[7]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[8]  Michael I. Jordan,et al.  Failure diagnosis using decision trees , 2004 .

[9]  Jizhou Sun,et al.  Ant algorithm-based task scheduling in grid computing , 2003, CCECE 2003 - Canadian Conference on Electrical and Computer Engineering. Toward a Caring and Humane Technology (Cat. No.03CH37436).

[10]  Ian H. Witten,et al.  Data mining - practical machine learning tools and techniques, Second Edition , 2005, The Morgan Kaufmann series in data management systems.

[11]  Carsten Franke,et al.  Design and evaluation of multi-objective online scheduling strategies for parallel machines using computational intelligence , 2006 .

[12]  Hui Li,et al.  An Investigation of Grid Performance Predictions Through Statistical Learning , 2006 .

[13]  Lale Özbakir,et al.  Composite Dispatching Rule Generation through Data Mining in a Simulated Job Shop , 2008, MCO.

[14]  Jiadao Li,et al.  Learning-based negotiation strategies for grid scheduling , 2006, Sixth IEEE International Symposium on Cluster Computing and the Grid (CCGRID'06).

[15]  Rabiah Ahmad,et al.  Communications in Computer and Information Science , 2010 .