A scalable randomized least squares solver for dense overdetermined systems

Chander Iyer; Haim Avron; Georgios Kollias; Yves Ineichen; Christopher Carothers; Petros Drineas

doi:10.1145/2832080.2832083

ScalA 2015

Conference paper

15 Nov 2015

A scalable randomized least squares solver for dense overdetermined systems

View publication

Abstract

We present a fast randomized least-squares solver for distributedmemory platforms. Our solver is based on the Blendenpik algorithm, but employs a batchwise randomized unitary transformation scheme. The batchwise transformation enables our algorithm to scale the distributed memory vanilla implementation of Blendenpik by up to×3 and provides up to×7.5 speedup over a state-of-the-art scalable least-squares solver based on the classic QR based algorithm. Experimental evaluations on terabyte scale matrices demonstrate excellent speedups on up to 16384 cores on a Blue Gene/Q supercomputer.

Conference paper