tile() function part 0.5 #8

mkatliar · 2024-02-01T16:04:43Z

Partial commit for #5

README.md

include/blast/math/algorithm/Tile.hpp

petlist

Looks great but I lack background on the tiles and register matrices, so nothing really deep from me this time. Mostly questions out of curiosity :)

petlist · 2024-02-04T21:25:31Z

include/blast/math/algorithm/Tile.hpp

+    template <typename ET, StorageOrder SO, typename FF, typename FP>
+    BLAST_ALWAYS_INLINE void tile(std::size_t m, std::size_t n, FF&& f_full, FP&& f_partial)
+    {
+        size_t constexpr TILE_SIZE = TileSize_v<ET>;


What happens if we have matrix of integers, booleans or complex numbers?

Good question. I think it will not compile, because we don't have TileSize_v<ET> defined for those types.

petlist · 2024-02-04T21:28:01Z

include/blast/math/algorithm/Tile.hpp

+        {
+            size_t i = 0;
+
+            // i + 4 * TILE_SIZE != M is to improve performance in case when the remaining number of rows is 4 * TILE_SIZE:


This is a bit magic for me, can't really understand why is it more efficient :/

Where magic 3 and 4 come from?

The magic nombers come from a specific case when you have 16 AVX registers, each storing 4 doubles, and TILE_SIZE == 4. When you have 16 rows left, it is more efficient (based on performance test) to apply a 8-row kernel 2 times than a 12-row kernel and then a 4-row kernel.

This code is tied to a specific architecture and is actually very old. It should be re-written in more general way.

petlist · 2024-02-04T21:31:32Z

include/blast/math/algorithm/Tile.hpp

+            // it is more efficient to apply 2 * TILE_SIZE kernel 2 times than 3 * TILE_SIZE + 1 * TILE_SIZE kernel.
+            for (; i + 3 * TILE_SIZE <= m && i + 4 * TILE_SIZE != m; i += 3 * TILE_SIZE)
+            {
+                RegisterMatrix<ET, 3 * TILE_SIZE, TILE_SIZE, columnMajor> ker;


Why columnMajor here and not SO?

petlist · 2024-02-04T21:37:12Z

include/blast/math/algorithm/Tile.hpp

+            for (; i + 2 * TILE_SIZE <= m; i += 2 * TILE_SIZE)
+            {
+                RegisterMatrix<ET, 2 * TILE_SIZE, TILE_SIZE, columnMajor> ker;
+                f_full(ker, i, j);


Мне тут пояснительная бригада нужна, зачем мы применяем функтор к пустой матрице? А, я понял, это чтобы этими черипичками покрыть в случае четного и нечетного количества строк? Или нет?

petlist · 2024-02-04T21:38:25Z

include/blast/math/algorithm/Tile.hpp

+            if (i < m)
+            {
+                RegisterMatrix<ET, TILE_SIZE, TILE_SIZE, columnMajor> ker;
+                f_partial(ker, i, j, m - i, ker.columns());


TILE_SIZE != ker.columns() ?

petlist · 2024-02-04T21:47:53Z

include/blast/math/dense/Gemm.hpp

@@ -51,83 +54,25 @@ namespace blast
        MatrixPointer<MPC> && StorageOrder_v<MPC> == columnMajor &&
        MatrixPointer<MPD> && StorageOrder_v<MPD> == columnMajor
    )
-    BLAZE_ALWAYS_INLINE void gemm(size_t M, size_t N, size_t K, ST1 alpha, MPA A, MPB B, ST2 beta, MPC C, MPD D)
+    BLAST_ALWAYS_INLINE void gemm(size_t M, size_t N, size_t K, ST1 alpha, MPA A, MPB B, ST2 beta, MPC C, MPD D)


Question, do we need to specify M, N, K? Are not they part of MPA, MPB, MPC?

petlist · 2024-02-04T21:48:42Z

include/blast/math/dense/Gemm.hpp

    {
-        using ET = std::remove_cv_t<ElementType_t<MPA>>;
+        using ET = std::remove_cv_t<ElementType_t<MPD>>;


Does it make a difference?

petlist · 2024-02-04T21:49:36Z

include/blast/math/dense/Gemm.hpp

    {
-        using ET = std::remove_cv_t<ElementType_t<MPA>>;
+        using ET = std::remove_cv_t<ElementType_t<MPD>>;
        size_t constexpr TILE_SIZE = TileSize_v<ET>;

        BLAZE_CONSTRAINT_MUST_BE_SAME_TYPE(std::remove_cv_t<ElementType_t<MPB>>, ET);


Does it make sense to put in require above?

…iagnostics

mkatliar · 2024-02-13T21:39:23Z

Continuous integration broken because of a bug in Blaze: https://bitbucket.org/blaze-lib/blaze/commits/6058199308a8c741279373b4d81debbba5e5e850#comment-14858242

mkatliar · 2024-03-08T19:59:44Z

Blaze bug fixed https://bitbucket.org/blaze-lib/blaze/issues/451/error-compiling-bb-inv-trans-ll-for#comment-66451923, CI passes.

mkatliar added 4 commits January 21, 2024 20:05

Update README.md

d75b59c

Added tile() function

3116abf

gemm() using tile()

e33d7b4

Add BLAZE_BLAS_MODE=0 in bench-blast

c788324

mkatliar changed the title ~~Tile 0.5~~ tile() function part 0.5 Feb 1, 2024

edge -> side

6fd9c95

mkatliar requested a review from petlist February 1, 2024 16:27

petlist reviewed Feb 4, 2024

View reviewed changes

README.md Outdated Show resolved Hide resolved

petlist reviewed Feb 4, 2024

View reviewed changes

include/blast/math/algorithm/Tile.hpp Outdated Show resolved Hide resolved

petlist reviewed Feb 4, 2024

View reviewed changes

mkatliar added 2 commits February 13, 2024 20:20

Fixed spelling errors

0b79d8b

Added -ftemplate-backtrace-limit=0 in cmake.yml workflow for better d…

391e92a

…iagnostics

mkatliar merged commit 153025e into master Mar 8, 2024
1 check passed

mkatliar deleted the tile-0.5 branch March 8, 2024 20:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tile() function part 0.5 #8

tile() function part 0.5 #8

mkatliar commented Feb 1, 2024 •

edited

Loading

petlist left a comment

petlist Feb 4, 2024

mkatliar Feb 13, 2024

petlist Feb 4, 2024

petlist Feb 4, 2024

mkatliar Feb 13, 2024 •

edited

Loading

petlist Feb 4, 2024

petlist Feb 4, 2024

petlist Feb 4, 2024

petlist Feb 4, 2024

petlist Feb 4, 2024

petlist Feb 4, 2024

mkatliar commented Feb 13, 2024

mkatliar commented Mar 8, 2024

tile() function part 0.5 #8

tile() function part 0.5 #8

Conversation

mkatliar commented Feb 1, 2024 • edited Loading

petlist left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mkatliar Feb 13, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mkatliar commented Feb 13, 2024

mkatliar commented Mar 8, 2024

mkatliar commented Feb 1, 2024 •

edited

Loading

mkatliar Feb 13, 2024 •

edited

Loading