Torch device cuda is available. Cuda_launch_blocking=1.. cuda warp. cuda opcode latency. thread warp cuda.