New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

SYCL: Reduce most of the compiler warnings #10748

Merged

abhilash1910 merged 24 commits into ggerganov:master from qnixsynapse:refactor

Dec 13, 2024

Contributor

qnixsynapse commented Dec 10, 2024

Most of the warnings are from unused variables, incorrect typecasts and empty functions. I tried to fix them in this PR.

Also added code comments wherever necessary.

SYCL backend tests passing with this change.

I think it would be better if -Werror flag is enabled in CI which can improve the quality of code.

cc: @airMeng @NeoZhangJianyu

qnixsynapse added 2 commits

December 10, 2024 13:13


          Try to reduce some unused and typecast warnings


          Reduce compiler warnings step 2

a708dfc

github-actions bot added ggml SYCL labels

qnixsynapse changed the title ~~Reduce most of the compiler warnings~~ SYCL: Reduce most of the compiler warnings

Rbiessy reviewed

View reviewed changes

Collaborator

Rbiessy left a comment

Thanks for the PR, I overall agree with the suggested changes. I would also be in favor of adding more warning flags.

ggml/src/ggml-sycl/ggml-sycl.cpp Outdated Show resolved Hide resolved

ggml/src/ggml-sycl/mmq.cpp Outdated Show resolved Hide resolved

ggml/src/ggml-sycl/mmq.cpp Outdated Show resolved Hide resolved

ggml/src/ggml-sycl/norm.cpp Outdated Show resolved Hide resolved

ggml/src/ggml-sycl/gemm.hpp Outdated Show resolved Hide resolved

ggml/src/ggml-sycl/common.cpp Outdated Show resolved Hide resolved

qnixsynapse marked this pull request as draft

December 10, 2024 11:09

qnixsynapse added 12 commits

December 10, 2024 17:12


          add a newline at the end of the file

fb2e66e


          Initialize nreduce as size_t

32164aa


          [SYCL] Remove pragma directives from mmq.cpp

71d84a5


          SYCL: mmq add condition to prevent blocks_per_tile_x_row variable fro…

fe5afd4

…m becoming 0


          SYCL softmax: Initialize nreduce as size_t


          ggml-sycl.cpp: fix some trailing whitespaces

4b5470f


          SYCL: remove the unused variables instead of commenting it out

7dda9aa


          SYCL poo2d kernel: set NAN for invalid pooling op

cc7cd62


          Merge branch 'master' into refactor

5a766c1


          SYCL gemm.hpp: remove pragma directives

274842d


          SYCL gemm.hpp: use const cast to properly support dnnl::memory

b0e27ad


          SYCL: wkv6 remove a comment

cb0daca

abhilash1910 reviewed

View reviewed changes

ggml/src/ggml-sycl/ggml-sycl.cpp Outdated Show resolved Hide resolved

abhilash1910 reviewed

View reviewed changes

ggml/src/ggml-sycl/mmq.cpp Outdated Show resolved Hide resolved

Rbiessy reviewed

View reviewed changes

ggml/src/ggml-sycl/ggml-sycl.cpp Outdated Show resolved Hide resolved

ggml/src/ggml-sycl/ggml-sycl.cpp Outdated Show resolved Hide resolved

ggml/src/ggml-sycl/ggml-sycl.cpp Outdated Show resolved Hide resolved

ggml/src/ggml-sycl/wkv6.cpp Outdated Show resolved Hide resolved

ggml/src/ggml-sycl/wkv6.cpp Outdated Show resolved Hide resolved

qnixsynapse added 3 commits

December 11, 2024 15:58


          SYCL: clean comments step 2

8f123ae


          SYCL: clean comments and variables step 3

39b4c47


          SYCL: Use GGML_UNUSED for unused variables

8dfac46

Contributor Author

qnixsynapse commented Dec 12, 2024 •

edited

Loading

Everything is finish I guess. I did not switch to tensor->buffer from tensor->backend for checking backend buffer type in this PR since I have no way to test multi GPU(I did switch to it locally and is working okay for single GPU), nor I fixed any of the SYCL deprecation warnings.

SYCL's clang compiler is not liking anonymous structures inside anonymous unions in ggml-common.h which I have not fixed.

Edit: At some point, I would like to enable -Werror flag in CI mirroring the CUDA backend. Also, mmq kernels need to be fixed and enabled before we can think of adding fp16 softmax and flash attention.

qnixsynapse marked this pull request as ready for review

December 12, 2024 07:08

abhilash1910 reviewed

View reviewed changes

ggml/src/ggml-sycl/ggml-sycl.cpp Show resolved Hide resolved

abhilash1910 reviewed

View reviewed changes

ggml/src/ggml-sycl/softmax.cpp Show resolved Hide resolved


          SYCL: remove extra empty lines and a comment

90fe556

qnixsynapse and others added 3 commits

December 12, 2024 12:58


          Remove TODO

46bcfe4


          cleanup spaces

ffd7c1d


          add a stdout for unsupported op

ba661a4

Collaborator

abhilash1910 commented Dec 12, 2024

Thanks @qnixsynapse for the cleanups. LGTM,
Ping @NeoZhangJianyu @airMeng , @Rbiessy @Alcpz for a look.

abhilash1910 added 2 commits

December 12, 2024 14:48


          use sycl printf over fprintf

524acb4


          remove prints for CI

b828f4a

Rbiessy reviewed

View reviewed changes

ggml/src/ggml-sycl/ggml-sycl.cpp Outdated Show resolved Hide resolved

ggml/src/ggml-sycl/ggml-sycl.cpp Outdated Show resolved Hide resolved


          SYCL ggml-sycl: pool2D use sycl::nan and remove if-else block

6b0848c

Collaborator

slaren commented Dec 12, 2024

I did not switch to tensor->buffer from tensor->backend for checking backend buffer type in this PR since I have no way to test multi GPU

Hopefully this can be addressed soon. tensor->backend as been marked as deprecated for a very long time, and at this point only the SYCL backend uses it. We need to remove it.

Rbiessy approved these changes

View reviewed changes

Collaborator

Rbiessy left a comment

LGTM, thanks for the changes

Contributor Author

qnixsynapse commented Dec 12, 2024

I did not switch to tensor->buffer from tensor->backend for checking backend buffer type in this PR since I have no way to test multi GPU

Hopefully this can be addressed soon. tensor->backend as been marked as deprecated for a very long time, and at this point only the SYCL backend uses it. We need to remove it.

I know. This PR is from an end user actually. I felt bad because this backend isn't receiving the love it deserves.

abhilash1910 merged commit 83ed24a into ggerganov:master

47 checks passed

qnixsynapse deleted the refactor branch

December 13, 2024 09:19

arthw pushed a commit to arthw/llama.cpp that referenced this pull request


          SYCL: Reduce most of the compiler warnings (ggerganov#10748)

a674ffb

* Try to reduce some unused and typecast warnings

* Reduce compiler warnings step 2

* add a newline at the end of the file

* Initialize nreduce as size_t

* [SYCL] Remove pragma directives from mmq.cpp

* SYCL: mmq add condition to prevent blocks_per_tile_x_row variable from becoming 0

* SYCL softmax: Initialize nreduce as size_t

* ggml-sycl.cpp: fix some trailing whitespaces

* SYCL: remove the unused variables instead of commenting it out

* SYCL poo2d kernel: set NAN for invalid pooling op

* SYCL gemm.hpp: remove pragma directives

* SYCL gemm.hpp: use const cast to properly support dnnl::memory

* SYCL: wkv6 remove a comment

* SYCL: clean comments step 2

* SYCL: clean comments and variables step 3

* SYCL: Use GGML_UNUSED for unused variables

* SYCL: remove extra empty lines and a comment

* Remove TODO

* cleanup spaces

* add a stdout for unsupported op

* use sycl printf over fprintf

* remove prints for CI

* SYCL ggml-sycl: pool2D use sycl::nan and remove if-else block

---------

Co-authored-by: Abhilash Majumder <[email protected]>

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ggml SYCL