Automatic parallelization of simulink applications
Arquimedes Canedo, Takeo Yoshizawa, et al.
CGO 2010
This work presents a static method implemented in a compiler for extracting high instruction level parallelism for the 32-bit QueueCore, a queue computation-based processor. The instructions of a queue processor implicitly read and write their operands, making instructions short and the programs free of false dependencies. This characteristic allows the exploitation of maximum parallelism and improves code density. Compiling for the QueueCore requires a new approach since the concept of registers disappears. We propose a new efficient code generation algorithm for the QueueCore. For a set of numerical benchmark programs, our compiler extracts more parallelism than the optimizing compiler for an RISC machine by a factor of 1.38. Through the use of QueueCore's reduced instruction set, we are able to generate 20% and 26% denser code than two embedded RISC processors. © 2010 Springer Science+Business Media, LLC.
Arquimedes Canedo, Takeo Yoshizawa, et al.
CGO 2010
Masahiro Sowa, Arquimedes Canedo
IWMST 2008
Arquimedes Canedo, Masahiro Sowa
IWMST 2008
Arquimedes Canedo, Takeo Yoshizawa, et al.
International Journal for Numerical Methods in Engineering