728x90
반응형

general matrix-matrix multiplication (GEMM)



GPU Memory Hierarchy, from fastest/smallest (top) to slowest/largest (bottom).




https://medium.com/data-science-collective/learning-triton-one-kernel-at-a-time-matrix-multiplication-44851b4146dd

728x90
Posted by Mr. Slumber
,