Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Rocm jaxlib v0.4.31 qa cleanup #40

Merged
merged 9 commits into from
Nov 7, 2024

Conversation

hsharsha
Copy link

@hsharsha hsharsha commented Sep 9, 2024

No description provided.

Imported from GitHub PR openxla#13597

To ensure proper synchronization for the asynchronous copy, this CL makes the other stream wait for the completion of the operation (res_3) in the main stream.
  %param_1 = f32[1024]{0} parameter(1)
  %param_0 = f32[1024]{0} parameter(0)
  %res_3 = f32[1024]{0} fusion(%param_1, %param_0), kind=kInput, calls=mul
  %copy-start = (f32[1024]{0:S(5)}, f32[1024]{0}, u32[]) copy-start(f32[1024]{0} %res_3)
Copybara import of the project:

--
5e41cc4 by Jane Liu <[email protected]>:

Activation offloading dependency fix: insert a wait

--
c6c8b3e by Jane Liu <[email protected]>:

Assign streams to copy-start instructions and add a Waitfor thunk

--
9ffa12a by Jane Liu <[email protected]>:

copy-start always uses a new stream; add Waitfor for both D2H and H2D

Merging this change closes openxla#13597

COPYBARA_INTEGRATE_REVIEW=openxla#13597 from zhenying-liu:dependency-fix 9ffa12a
PiperOrigin-RevId: 666842831
Imported from GitHub PR openxla#16190

After these changes openxla@d86502a in cuda async allocator were introduced, `PjrtCApiGpuAllocatorTest.ValidOptionsParsing` started failing on ROCm.
Copybara import of the project:

--
ddc6207 by mmakevic <[email protected]>:

Skip cuda_async option for rocm in ValidOptionsParsing test

--
6d491be by mmakevic <[email protected]>:

Fix formatting issues

Merging this change closes openxla#16190

COPYBARA_INTEGRATE_REVIEW=openxla#16190 from ROCm:ci_pjrt_c_api_test_fix 6d491be
PiperOrigin-RevId: 667883110
Imported from GitHub PR openxla#16037

Copybara import of the project:

--
6303981 by Harsha HS <[email protected]>:

[ROCm] Distinguish between NVIDIA and AMD gps in `gpu_device_info_test`

--
246e410 by Harsha HS <[email protected]>:

Simplify build by removing conditionals as per review comment

Merging this change closes openxla#16037

COPYBARA_INTEGRATE_REVIEW=openxla#16037 from ROCm:ci_fix_gpu_device_info_test_20240813 246e410
PiperOrigin-RevId: 663630606
…er_test

Imported from GitHub PR openxla#16213

Copybara import of the project:

--
0dba5ad by Harsha HS <[email protected]>:

[ROCm] Bump ROCm version to 6.2 and fix gemm_rewriter_test

--
81bea23 by Harsha HS <[email protected]>:

Fix clang formatting

Merging this change closes openxla#16213

COPYBARA_INTEGRATE_REVIEW=openxla#16213 from ROCm:ci_fix_gemm_rewriter_test_20240819 81bea23
PiperOrigin-RevId: 666859220
@hsharsha hsharsha merged commit e02959b into rocm-jaxlib-v0.4.31-qa Nov 7, 2024
1 of 4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants