Efficient algorithms for multi-dimensional block-cyclic redistribution of arrays
暂无分享,去创建一个
[1] Ken Kennedy,et al. Compilation techniques for block-cyclic distributions , 1994 .
[2] Viktor K. Prasanna,et al. High-performance computing for vision , 1996, Proc. IEEE.
[3] David W. Walker,et al. Redistribution of block‐cyclic data distributions using MPI , 1996 .
[4] Jehoshua Bruck,et al. Efficient algorithms for all-to-all communications in multi-port message-passing systems , 1994, SPAA '94.
[5] Guy L. Steele,et al. The High Performance Fortran Handbook , 1993 .
[6] James Demmel,et al. ScaLAPACK: A Portable Linear Algebra Library for Distributed Memory Computers - Design Issues and Performance , 1995, Proceedings of the 1996 ACM/IEEE Conference on Supercomputing.
[7] Viktor K. Prasanna,et al. Portable Implementation of Real-Time Signal Processing Benchmarks on HPC Platforms , 1998, PARA.
[8] Jack Dongarra,et al. Parallel matrix transpose algorithms on distributed memory concurrent computers , 1993, Proceedings of Scalable Parallel Libraries Conference.
[9] Geoffrey C. Fox,et al. Runtime array redistribution in HPF programs , 1994, Proceedings of IEEE Scalable High Performance Computing Conference.
[10] Viktor K. Prasanna,et al. Efficient Algorithms for Block-Cyclic Redistribution of Arrays , 1999, Algorithmica.
[11] V. K. Prasanna,et al. Communication issues in heterogeneous embedded systems , 1996, Proceedings of the 4th International Workshop on Parallel and Distributed Real-Time Systems.