sgemm_tcore: Fix address overlap for DMA

Enforce square shapes of tiles in smem.  TODO need to configure loop
bounds correctly.
This commit is contained in:
Hansung Kim
2024-06-18 15:06:07 -07:00
parent 36b02ad595
commit 50b843d8c4
3 changed files with 11 additions and 6 deletions

View File

@@ -19,7 +19,7 @@
// * Combining BM * BK >= (BM*BN) / (TM*TN) == threadblock yields
// BM <= BK*TM*TN
#define BM 64
#define BN 64
#define BN 32
#define BK 64
#define WM 16
#define WN 8