Ganesh Bikshandi, Gheorghe Almasi, et al.
PGAS 2009
Tiling has proven to be an effective mechanism to develop high performance implementations of algorithms. Tiling can be used to organize computations so that communication costs in parallel programs are reduced and locality in sequential codes or sequential components of parallel programs is enhanced. In this paper, a data type - Hierarchically Tiled Arrays or HTAs - that facilitates the direct manipulation of tiles is introduced. HTA operations are overloaded array operations. We argue that the implementation of HTAs in sequential OO languages transforms these languages into powerful tools for the development of high-performance parallel codes and codes with high degree of locality. To support this claim, we discuss our experiences with the implementation of HTAs for MATLAB and C++ and the rewriting of the NAS benchmarks and a few other programs into HTA-based parallel form. Copyright © 2006 ACM.
Ganesh Bikshandi, Gheorghe Almasi, et al.
PGAS 2009
Huang Chao, Zheng Gengbin, et al.
PPoPP 2006
Ganesh Bikshandi, Basilio B. Fraguela, et al.
LCPC 2004
Guo Jia, Ganesh Bikshandi, et al.
IPDPS 2006