Conference paper
Constraint graph analysis of multithreaded programs
Harold W. Cain, Mikko H. Lipasti, et al.
Journal of Instruction-Level Parallelism
Value-based replay eliminates the need for content-addressable memories in the load queue, removing one barrier to scalable out-of-order instruction windows. Instead, correct memory ordering is maintained by simply re-executing certain load instructions in program order. A set of novel filtering heuristics reduces the average additional cache bandwidth demanded by value-based replay to less than 3.5 percent.
Harold W. Cain, Mikko H. Lipasti, et al.
Journal of Instruction-Level Parallelism
Christoph Von Praun, Harold W. Cain, et al.
ISCA 2006
Harold W. Cain, Mikko H. Lipasti
SPLASH 2012
Cǎlin Caşcaval, Colin Blundell, et al.
Queue