Codegen for sparse-dense cross-entropy #919

AlexRTer · 2024-11-19T06:24:26Z

This PR adds a codegen pass to lower the following expression to a fused operator that exploits sparsity
sum(CSRMat * ln(denseLhs @ t(denseRhs))).
It can be run simply by enabling the codegen pipeline (--mlir-codegen) and ensuring the lhs of the elementwise multiplication is a CSRMatrix (currently --select-matrix-repr) with the corresponding cli flags.
By computing the sum directly, the pass not only avoids materializing potentially large dense matrices in the dense, right matrix multiplication, it also only computes the necessary dot products corresponding to non-zero entries in the CSRMatrix. Thus, it uses constant memory and reduces runtime significantly.

An example script to test the results (--explain mlir_codegen is optional to show the generated IR):

// RUN: ./bin/daphne --select-matrix-repr --mlir-codegen --explain mlir_codegen ./fileName.daphne

seed = 1;
sparsity = 1e-6;
sparseRows = 10_000;
sparseCols = 10_000;
hiddenDim = 20;

startGeneratingMatrices = now();
sparseLhs = rand(sparseRows, sparseCols, 0.0, 1.0, sparsity, seed);

DenseU = rand(sparseRows, hiddenDim, 0.0, 1.0, 1.0, seed + 1); // sparsity: 1.0
DenseV = rand(sparseCols, hiddenDim, 0.0, 1.0, 1.0, seed + 2); // sparsity: 1.0
endGeneratingMatrices = now();

startCalc = now();
res = sum(sparseLhs * ln(DenseU @ t(DenseV)));
endCalc = now();

print(res);
print("sparse dim: " + sparseRows + "x" + sparseCols + " (sparsity: " + sparsity + "), dense dim: " + sparseRows + "x" + hiddenDim + "@" + hiddenDim + "x" + sparseCols + "->" + sparseRows + "x" + sparseCols);
print("time to generate matrices: " + as.f64(endGeneratingMatrices - startGeneratingMatrices) * 1e-9 + ", comp. time: " + as.f64(endCalc - startCalc) * 1e-9);

A more thorough description will be given once some tests have been added.

- add linalg transposeOp - rewrite some EwUnary/Binary Ops to handle matrix and scalar types - use specialized Ops in EwUnary/Binary (e.g. ipowi, fpowi) - add canonicalizer pattern for ceil/floor/round for integer types

- Minor formatting

- rework aggAll codegen - implement aggDim codegen - rework ewUnary/ewBinary codegen - add tests for new codegen (scripts and filecheck) - rename some existing tests for better overview - add some needed kernel instantiations (kernels.json)

… codegen-ops

- minor refactoring/formatting - fix some typos - general minor polishing

… specific pattern - the pattern sum(sparse * ln(dense @ dense)) with a CSRMatrix and two DenseMatrices is replaced entirely by codegen. This avoids materializing potentially large dense intermediates and fuses the whole pattern. - Added CSRMatrix -> MemRef helper functions/kernels to enable (one-way) interop with MLIR - small changes to CSRMatrix.h to enable helper functions by returning shared pointers to its underlying data arrays - slightly modified rewriteToCallKernelOp, kernelCatalogParser, and compilerUtils to handle single rank MemRefs - Added needed instantiations to kernels.json - Modified existing codegen passes to handle inputs that are not DenseMatrices/scalars (corresponding Ops are now marked as legal if input does not meet requirements) - Todo: add tests

- revert leftover duplicate code from merge - remove use of format for compatibility with docker containers - minor formatting/tidying

AlexRTer added 17 commits August 27, 2024 14:46

TransposeOp codegen rewrite with affine, linalg wip

4a0868f

codegen ops wip

751530e

- add linalg transposeOp - rewrite some EwUnary/Binary Ops to handle matrix and scalar types - use specialized Ops in EwUnary/Binary (e.g. ipowi, fpowi) - add canonicalizer pattern for ceil/floor/round for integer types

Merge remote-tracking branch 'upstream/main' into codegen-ops

dc0da16

add canonicalize methods from before merge

4cc4f12

- Rewrite AggAll to use Linalg

61346a9

- Minor formatting

Merge branch 'daphne-eu:main' into codegen-ops

b7ea6cd

Merge branch 'main' into codegen-ops

9251558

Merge branch 'daphne-eu:main' into codegen-ops

30ce260

Merge branch 'codegen-ops' of https://github.com/AlexRTer/daphne into…

103e28b

… codegen-ops

- added some briefs

295eca6

- minor refactoring/formatting - fix some typos - general minor polishing

Merge remote-tracking branch 'upstream/main' into codegen-ops

7715b62

fix small error in comment

9ee4ae0

Merge branch 'daphne-eu:main' into sparse-codegen

2f17462

Merge branch 'main' into sparse-codegen

c05c06d

fix minor issues

3405174

- revert leftover duplicate code from merge - remove use of format for compatibility with docker containers - minor formatting/tidying

philipportner closed this in bc65d5d Nov 25, 2024

AlexRTer mentioned this pull request Nov 29, 2024

Adding tests for sparse exploit codegen #928

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Codegen for sparse-dense cross-entropy #919

Codegen for sparse-dense cross-entropy #919

AlexRTer commented Nov 19, 2024

Codegen for sparse-dense cross-entropy #919

Codegen for sparse-dense cross-entropy #919

Conversation

AlexRTer commented Nov 19, 2024