Kernel comparison with a MMA in CUDA and near-SOTA/cuBLAS performance kernel
Kernel comparison with a MMA in CUDA and near-SOTA/cuBLAS performance kernel The project is hosted in the repository: CUDA-refresh Introduction The kernel is the “kernel” in the concept of CUDA, it directly influence the compute efficiency and it’s the key to take advanage of GPU’s huge amount of computation resource and bandwidth. Here is a simple refresh of the CUDA calculation and memory hierachy and their infleunce to the computation efficiency. ...