Beomseok Nam, Henrique Andrade, et al.
ACM/IEEE SC 2006
Several illustrations of a general technique called the Algorithm and Architecture approach was presented. The programmer controlled unrolling of loops was demonstrated equivalent to customized vectorization of RISC-type code. Its use was illustrated to show that RS/6000 processors could compute the distribution (-1, 1) at the rate of 3.25 multiply-adds. A linear congruential generators, related to the multiplicative congruential generators was also specified.
Beomseok Nam, Henrique Andrade, et al.
ACM/IEEE SC 2006
Apostol Natsev, Alexander Haubold, et al.
MMSP 2007
Chi-Leung Wong, Zehra Sura, et al.
I-SPAN 2002
Zohar Feldman, Avishai Mandelbaum
WSC 2010