Evaluation and Refinement of a Tuning Tool for Grid Applications

Tuning the performance of grid applications is cumbersome because it is very difficult to decide which code to look at. In our previous work we proposed a tool to recommend tasks in a task graph that look most promising for improvement. In the present work we first show how to extract task graphs from real grid applications with the help of SCALASCA, then we evaluate the tool with the OptSched benchmark suite of synthetic schedules. Finally, we present some extensions to the tool to adapt it to some typical situations occuring in practice.

[1]  Bernd Mohr,et al.  A scalable tool architecture for diagnosing wait states in massively parallel applications , 2009, Parallel Comput..

[2]  W. Schiffmann,et al.  A COMPREHENSIVE TEST BENCH FOR THE EVALUATION OF SCHEDULING HEURISTICS , 2004 .

[3]  Cristina Boeres,et al.  Dynamic self-scheduling for parallel applications with task dependencies , 2008, MGC '08.

[4]  M. Schulz,et al.  Extracting Critical Path Graphs from MPI Applications , 2005, 2005 IEEE International Conference on Cluster Computing.

[5]  Warren Smith,et al.  Scheduling with advanced reservations , 2000, Proceedings 14th International Parallel and Distributed Processing Symposium. IPDPS 2000.

[6]  Wolfram Schiffmann,et al.  Guiding performance tuning for grid schedules , 2009, 2009 IEEE International Symposium on Parallel & Distributed Processing.

[7]  Mark J. Clement,et al.  The Performance Impact of Advance Reservation Meta-scheduling , 2000, JSSPP.

[8]  Klara Nahrstedt,et al.  A distributed resource management architecture that supports advance reservations and co-allocation , 1999, 1999 Seventh International Workshop on Quality of Service. IWQoS'99. (Cat. No.98EX354).

[9]  Bernd Mohr,et al.  Scalable Parallel Trace-Based Performance Analysis , 2006, PVM/MPI.

[10]  Ian T. Foster,et al.  MPICH-G2: A Grid-enabled implementation of the Message Passing Interface , 2002, J. Parallel Distributed Comput..

[11]  Felix Wolf,et al.  Scalable timestamp synchronization for event traces of message-passing applications , 2009, Parallel Comput..

[12]  Alexandre C. Sena,et al.  Distributed and dynamic self-scheduling of parallel MPI Grid applications: Research Articles , 2007 .

[13]  Rajkumar Buyya,et al.  A Taxonomy of Workflow Management Systems for Grid Computing , 2005, Proceedings of the 38th Annual Hawaii International Conference on System Sciences.