Shachar Don-Yehiya, Leshem Choshen, et al.
ACL 2025
In this paper, we present several algorithms for performing all-to-many personalized communication on distributed memory parallel machines. We assume that each processor sends a different message (of potentially different size) to a subset of all the processors involved in the collective communication. The algorithms are based on decomposing the communication matrix into a set of partial permutations. We study the effectiveness of our algorithms from both the view of static scheduling and runtime scheduling. © 1995 Academic Press, Inc.
Shachar Don-Yehiya, Leshem Choshen, et al.
ACL 2025
Yannis Belkhiter, Dhaval Salwala, et al.
NFV-SDN 2025
Danila Seliayeu, Quinn Pham, et al.
CASCON 2024
Conrad Albrecht, Jannik Schneider, et al.
CVPR 2025