Hansung Kim
04643fa64d
sgemm_impl: Refactor dmem_load into one unified logic
...
Replace the confusing logic that had slightly different use of BM/BN/BK
for A and B, into one logic that accepts matrix memory layout as a
proper argument & does compile-time logic to determine the right
dimensions.
TODO: !GMEM_COALESCED_A is not updated yet
2024-08-18 22:05:22 -07:00
..
2024-06-07 18:11:19 -07:00
2023-11-14 05:37:46 -08:00
2023-11-14 22:31:30 -08:00
2023-11-14 05:37:46 -08:00
2023-11-10 02:47:05 -08:00
2023-11-14 05:37:46 -08:00
2024-08-18 16:21:22 -07:00
2024-04-24 21:10:21 -07:00
2024-06-22 01:37:00 -07:00
2023-11-14 05:37:46 -08:00
2023-11-14 05:37:46 -08:00
2023-11-14 05:37:46 -08:00
2023-11-14 05:37:46 -08:00
2023-11-14 05:37:46 -08:00
2024-06-07 18:11:19 -07:00
2024-06-19 17:45:01 -07:00
2024-08-06 02:43:44 -07:00
2024-06-12 22:44:14 -07:00
2024-08-18 22:05:22 -07:00
2024-06-06 15:19:39 -07:00
2023-11-27 02:21:47 -08:00
2023-11-14 05:37:46 -08:00
2024-03-27 15:15:52 -07:00
2024-08-12 15:22:07 -07:00
2023-11-27 02:21:47 -08:00