-
Notifications
You must be signed in to change notification settings - Fork 55
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Issue]: Crash while compiling rocSPARSE #123
Comments
Thanks for raising this issue. I have a couple questions that might help track down the problem:
|
Issue appears to be caused by using rocprim from 6.1 but compiling latest rocsparse. The specific offending commit id in rocsparse is this: 81e4f9527b825195f53c8e3b660f6a699af829b7. Investigating a solution now. As a temporary workaround, compiling latest rocprim first and then compiling latest rocsparse should work. |
A dependency diagram would be helpful... I don't know what depends on what and there's 40 of these repositories before I arrive at my goal. But no, I am installing all of it through cloning and compiling, because I want to see if I can debug problems. However, due to how long it takes, there might be patches/new versions released during the compilation process. (As it takes several days to compile it all, and I'm still debugging the whole process, writing some patches to overcome ubuntu-assumptions, and so on...) |
Fixing PR up now. Ill comment here once it is merged. Regarding dependencies, currently rocSPARSE depends on rocPRIM and (optionally) rocBLAS. While the rocPRIM dependency is mentioned in the docs (see https://rocm.docs.amd.com/projects/rocSPARSE/en/latest/install/Linux_Install_Guide.html#linux-install), I agree we should present this information better as currently I don't think we are clear on how rocSPARSE should work when say using the latest rocSPARSE while also using older versions of rocPRIM (within the same major version). Ill look into improving that. |
Correcting something I said in my previous comment that is wrong: I identified the cause of the compilation failures you are seeing as stemming from using rocPRIM 3.1.0 (this is the version that came with your installation of rocm 6.1) and trying to compile rocSPARSE using the latest develop branch. Specifically there was a change in rocPRIM 3.2.0 that is used by rocSPARSE develop (the develop branch being much further ahead of what was packaged with your rocm 6.1 installation). This then caused compilation failures when using rocPRIM 3.1.0 since this version of rocPRIM obviously does not have those changes. All of this is correct. The part where I made an incorrect statement was regarding how rocSPARSE should work with older versions of rocPRIM. How it works is actually the opposite of what I stated. Given a rocm release with say rocPRIM version 3.1.0 and rocSPARSE version 3.1.0, it should be possible to re-build rocSPARSE 3.1.0 with any future rocPRIM 3.Y.Z version where 3.Y.Z >= 3.1.0 up to the next major version change. This then explains the failure as trying to build rocSPARSE 3.2.0 with rocPRIM 3.1.0 is not supported. Recommendations:
|
Problem Description
rocSPARSE compilation crashes, rather than producing an error or succeeding.
fail.txt
Operating System
Arch linux, kernel 6.9.7-arch1-1
CPU
AMD Threadripper 1950X
GPU
AMD Radeon RX 7900 XTX
ROCm Version
ROCm 6.1.0
ROCm Component
rocSPARSE
Steps to Reproduce
After compiling all prerequisites, try doing the following (or something like it):
(Optional for Linux users) Output of /opt/rocm/bin/rocminfo --support
No response
Additional Information
No response
The text was updated successfully, but these errors were encountered: