You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When I run a single NCCL kernel ,the active SMs is 15%,and When I run a single compute kernel ,the active SMs is 100% ,but when I run the compute kernel and the NCCL kernel in parallel, so that they overlap,the active SMs is 85%, how to explain this?
The text was updated successfully, but these errors were encountered:
When I run a single NCCL kernel ,the active SMs is 15%,and When I run a single compute kernel ,the active SMs is 100% ,but when I run the compute kernel and the NCCL kernel in parallel, so that they overlap,the active SMs is 85%, how to explain this?
The text was updated successfully, but these errors were encountered: