On optimizing distributed non-negative Tucker decompositionVenkatesan T. ChakaravarthyShivmaran S. Pandianet al.2019ICS 2019Conference paper
Accelerating reduction and scan using tensor core unitsAbdul DakkakCheng Liet al.2019ICS 2019Conference paper