forked from openxla/xla
-
Notifications
You must be signed in to change notification settings - Fork 2
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[xla][gpu] Implement pipelined-p2p-rewriter.
This pass rewrite pipelined point-to-point communication by rotating the SendDone and RecvDone operations in a while-body to the beginning of the next iteration. The SendDone and RecvDone operations for the last iteration are moved to the while-op calling computation, after the while-op. Add the pass to the GPU post-scheduler pipeline. This is another approach to achieve the code pattern to pipeline two Send-Recv chains decomposed from a collective-permute with a source-target pair cycle for performance. The pipelined Send-Recv pattern puts SendDone and RecvDone before Send and Recv in the while-body, and if we generate such code pattern too early in the GPU compilation pipeline, copy-insertion may generate copies of Send causing Send and SendDone with different buffers and thus correctness problem. PiperOrigin-RevId: 630121252
- Loading branch information
1 parent
adf109d
commit a4e712c
Showing
6 changed files
with
1,519 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.