Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support for cooperative groups #121

Open
andy-shiyi-liu opened this issue Sep 14, 2023 · 0 comments
Open

Support for cooperative groups #121

andy-shiyi-liu opened this issue Sep 14, 2023 · 0 comments

Comments

@andy-shiyi-liu
Copy link

I wish to trace a kernel written with cooperative groups. To be specific, my program is something like the following:

#include <cooperative_groups.h>
#include <cuda.h>
#include <cuda_runtime_api.h>

__global__ void random_access_shared(...) {
  ... do something ...
  // __syncthreads();
  grid.sync();
  ... do something ...
}

int main(int argc, char **argv) {
  ... do something ...

  size_t shared_size = 0xffff;

  void *kernelArgs[] = {...};

  dim3 dimBlock(nthread, 1, 1);
  dim3 dimGrid(nblock, 1, 1);

  cudaLaunchCooperativeKernel((void *)random_access_shared, dimGrid, dimBlock, kernelArgs, shared_size, 0);
  
  ... do something ...
}

However, when I trace this kernel with something like the following command, I got no trace file output.

export CUDA_VERSION="11.0"; 
export CUDA_VISIBLE_DEVICES="0" ;
export TRACES_FOLDER=/trace/output/folder;
CUDA_INJECTION64_PATH=/path/to/tracer_tool.so; 
LD_PRELOAD=/path/to/tracer_tool.so /path/to/executable.out

Does NVBit support tracing kernels written with cooperative groups, or how should I use NVBit differently for tracing cooperative groups?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant