Eecient Vlsi Implementation of Iterative Solutions to Sparse Linear Systems

We propose a novel way of solving systems of linear equations with sparse coe cient matrices using iterative methods on a VLSI array. The nonzero entries of the coe cient matrix are mapped onto a processor array of size pe pe, where e is the number of nonzero elements, n is the number of equations and e n. The data transport problem that arises because of this mapping is solved using an e cient routing technique. Preprocessing is carried out on the iteration matrix of the system to compute the routing control-words that are used in the data transfer. This results in O(pe) time for each iteration of the method, with a small constant factor. As compared to existing VLSI methods for solving the problem, the proposed method yields a superior time performance, greater ease of programmability and an area e cient design. We also develop a second implementation of our algorithm that uses a slightly higher number of communication steps, but reduces the number of arithmetic operations to O(log e). The latter algorithm is suitable for many other architectures as well. The algorithm can be implemented in O(log e) time using e processors on a hypercube, shu e-exchange, and cubeconnected-cycles. 1

[1]  Franco P. Preparata,et al.  The cube-connected-cycles: A versatile network for parallel computation , 1979, 20th Annual Symposium on Foundations of Computer Science (sfcs 1979).

[2]  Clark D. Thomborson,et al.  Generalized Connection Networks for Parallel Processor Intercommunication , 1978, IEEE Trans. Computers.

[3]  Leslie G. Valiant,et al.  A fast parallel algorithm for routing in permutation networks , 1981, IEEE Transactions on Computers.

[4]  M. H. Schultz,et al.  Topological properties of hypercubes , 1988, IEEE Trans. Computers.

[5]  Jeffrey D Ullma Computational Aspects of VLSI , 1984 .

[6]  Viktor K. Prasanna,et al.  Permutations on Illiac IV-Type Networks , 1986, IEEE Transactions on Computers.

[7]  J. Gillis,et al.  Matrix Iterative Analysis , 1961 .

[8]  Grazia Lotti,et al.  VLSI implementation of iterative methods for the solution of linear systems , 1985, Integr..

[9]  Sartaj Sahni,et al.  Parallel Algorithms to Set Up the Benes Permutation Network , 1982, IEEE Transactions on Computers.

[10]  Harold S. Stone,et al.  Parallel Processing with the Perfect Shuffle , 1971, IEEE Transactions on Computers.

[11]  U. Schwiegelshohn A shortperiodic two-dimensional systolic sorting algorithm , 1988, [1988] Proceedings. International Conference on Systolic Arrays.

[12]  Omar Wing A content-addressable systolic array for sparse matrix computation , 1985, J. Parallel Distributed Comput..

[13]  Charles Clos,et al.  A study of non-blocking switching networks , 1953 .

[14]  V. Benes On rearrangeable three-stage connecting networks , 1962 .

[15]  Uwe Schwiegelshohn,et al.  Sparse matrix-vector multiplication on a systolic array , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[16]  Francesco Romani,et al.  A compact and modular VLSI design for the solution of general sparse linear systems , 1987, Integr..

[17]  Sartaj Sahni,et al.  Data broadcasting in SIMD computers , 1981, IEEE Transactions on Computers.

[18]  J. Ortega Introduction to Parallel and Vector Solution of Linear Systems , 1988, Frontiers of Computer Science.