AMD Radeon Pro VII - improvement with ROCm 5.0 #249
Replies: 6 comments 2 replies
-
Thanks; I also checked with a wavefront exponent (109M), and what I see is a slight speed-up in single-instance-per-GPU (which is good!), but at the same time about 4% slowdown in duo-instance-per-GPU (which is not good). (this is all relative to ROCm 3.3.0). So there is progress in some situation at least. |
Beta Was this translation helpful? Give feedback.
-
The PRP timing of exponent 113926861 is 847 us/it. |
Beta Was this translation helpful? Give feedback.
-
The PRP timing of exponent 111754183 is 841 us/it. |
Beta Was this translation helpful? Give feedback.
-
Weird enough, same situation but changed mainboard and RAM: |
Beta Was this translation helpful? Give feedback.
-
Improvement with ROCm 5.1 Exponent 111755179 |
Beta Was this translation helpful? Give feedback.
-
Slight improvement with my build of latest commit. ROCm 5.1.1, 802~806 us/it. |
Beta Was this translation helpful? Give feedback.
-
The PRP timing of exponent 114837847 went from 863 us/it to 854 us/it.
Beta Was this translation helpful? Give feedback.
All reactions