1 code implementation • 7 Oct 2022 • Osman Asif Malik, Vivek Bharadwaj, Riley Murray
We show how to develop sampling-based alternating least squares (ALS) algorithms for decomposition of tensors into any tensor network (TN) format.
1 code implementation • 15 Mar 2022 • Vivek Bharadwaj, Aydın Buluç, James Demmel
Further, we give two communication-eliding strategies to reduce costs further for FusedMM kernels: either reusing the replication of an input dense matrix for the SDDMM and SpMM in sequence, or fusing the local SDDMM and SpMM kernels.