as for embedding in the binary a post build step that does
ubyte[] ptx_code = import("kernels_cuda620_64.ptx");
should be doable as should invoking ptxas and doing the same.
Then proving a consistent naming convention is used the code can do its magic.
Or the files could just be read from disk.
Any input with your expertise with CUDA will be much appriciated.