Network-Aware Replica Optimization in the SCoPE Grid Infrastructure

In a Data Grid, replication of data is critical for maximizing the overall job throughput. Such replication involves the creation of copies of data files at different sites according to specific Replica Optimization strategies that define when and where replicas should be created or deleted on a per-site basis, and which replicas should be used by Grid jobs. To be really effective these strategies have to take into account the available network bandwidth as a primary resource, prior to any consideration about storage or processing power. We present a novel replica management service, integrated within the GlueDomains active network monitoring architecture, designed and implemented within the centralized collective middleware framework of the SCoPE project to provide network-aware replica optimization for data intensive applications.

[1]  Augusto Ciuffoletti,et al.  Architecture of monitoring elements for the network element modeling in a Grid infrastructure , 2003, ArXiv.

[2]  Bin Chen,et al.  A Fast Replica Selection Algorithm for Data Grid , 2007, 31st Annual International Computer Software and Applications Conference (COMPSAC 2007).

[3]  Ladislav Hluchý,et al.  Towards Scalable Grid Replica Optimization Framework , 2005, The 4th International Symposium on Parallel and Distributed Computing (ISPDC'05).

[4]  Steven Tuecke,et al.  The Physiology of the Grid An Open Grid Services Architecture for Distributed Systems Integration , 2002 .

[5]  Francisco Fernández Rivera Grid computing : first European Across Grids Conference, Santiago de Compostela, Spain, February 13-14, 2003 : revised papers , 2004 .

[6]  Peter Z. Kunszt,et al.  Giggle: A Framework for Constructing Scalable Replica Location Services , 2002, ACM/IEEE SC 2002 Conference (SC'02).

[7]  Michael Thomas,et al.  Data Intensive and Network Aware (DIANA) Grid Scheduling , 2007, Journal of Grid Computing.

[8]  Ákos Frohner,et al.  VOMS, an Authorization System for Virtual Organizations , 2003, European Across Grids Conference.

[9]  Flavia Donno,et al.  Replica Management in the European DataGrid Project , 2004, Journal of Grid Computing.

[10]  Matthew Mathis,et al.  The macroscopic behavior of the TCP congestion avoidance algorithm , 1997, CCRV.

[11]  Daniel Kouřil,et al.  Practical approaches to Grid workload and resource management in the EGEE project , 2004 .

[12]  Ian T. Foster,et al.  Secure, Efficient Data Transport and Replica Management for High-Performance Data-Intensive Computing , 2001, 2001 Eighteenth IEEE Symposium on Mass Storage Systems and Technologies.