论文信息 - Minimizing energy dissipation of matrix multiplication kernel on Virtex-II

Minimizing energy dissipation of matrix multiplication kernel on Virtex-II

In this paper, we develop energy-efficient designs for matrix multiplication on FPGAs. To analyze the energy dissipation, we develop a high-level model using domain-specific modeling techniques. In this model, we identify architecture parameters that significantly affect the total energy (system-wide energy) dissipation. Then, we explore design trade-offs by varying these parameters to minimize the system-wide energy. For matrix multiplication, we consider a uniprocessor architecture and a linear array architecture to develop energy-efficient designs. For the uniprocessor architecture, the cache size is a parameter that affects the I/O complexity and the system-wide energy. For the linear array architecture, the amount of storage per processing element is a parameter affecting the system-wide energy. By using maximum amount of storage per processing element and minimum number of multipliers, we obtain a design that minimizes the system-wide energy. We develop several energy-efficient designs for matrix multiplication. For example, for 6×6 matrix multiplication, energy savings of upto 52% for the uniprocessor architecture and 36% for the linear arrary architecture is achieved over an optimized library for Virtex-II FPGA from Xilinx.

Viktor K. Prasanna | Seonil Choi | Ju-Wook Jang

[1] Viktor K. Prasanna,et al. On Synthesizing Optimal Family of Linear Systolic Arrays for Matrix Multiplication , 1991, IEEE Trans. Computers.

[2] Sujit Dey,et al. High-Level Power Analysis and Optimization , 1997 .

[3] Viktor K. Prasanna,et al. Rapid design space exploration of heterogeneous embedded systems using symbolic search and multi-granular simulation , 2002, LCTES/SCOPES '02.

[4] Eike Schmidt,et al. System level optimization and design space exploration for low power , 2001, International Symposium on System Synthesis (IEEE Cat. No.01EX526).

[5] Luca Benini,et al. Regression-based RTL power modeling , 2000, TODE.

[6] Viktor K. Prasanna,et al. Domain-Speci fi c Modeling for Rapid System-Wide Energy Estimation of Recon fi gurable Architectures , 2002 .

[7] Viktor K. Prasanna,et al. A model-based methodology for application specific energy efficient data path design using FPGAs , 2002, Proceedings IEEE International Conference on Application- Specific Systems, Architectures, and Processors.

[8] Abbes Amira,et al. Accelerating Matrix Product on Reconfigurable Hardware for Signal Processing , 2001, FPL.

[9] Viktor K. Prasanna,et al. MILAN: A Model Based Integrated Simulation Framework for Design of Embedded Systems , 2001, OM '01.

[10] H. T. Kung,et al. I/O complexity: The red-blue pebble game , 1981, STOC '81.

[11] Trevor N. Mudge,et al. Power: A First-Class Architectural Design Constraint , 2001, Computer.