Is there a way to ask the GPU (or driver) to list the set of active (or dispatched or issued) CUDA kernels on a GPU, without attaching cuda-gdb
to the owning CPU process and suspending it?
I'm imagining something like pstack, where the interface might look like:
> list-cuda-kernels $pid
gpu 0: kernel_foo
gpu 0: kernel_bar
gpu 1: kernel_baz