M.F. Cowlishaw
IBM Systems Journal
Several illustrations of a general technique called the Algorithm and Architecture approach was presented. The programmer controlled unrolling of loops was demonstrated equivalent to customized vectorization of RISC-type code. Its use was illustrated to show that RS/6000 processors could compute the distribution (-1, 1) at the rate of 3.25 multiply-adds. A linear congruential generators, related to the multiplicative congruential generators was also specified.
M.F. Cowlishaw
IBM Systems Journal
Rajeev Gupta, Shourya Roy, et al.
ICAC 2006
Sai Zeng, Angran Xiao, et al.
CAD Computer Aided Design
Liqun Chen, Matthias Enzmann, et al.
FC 2005