This website requires JavaScript.
Explore
Help
Sign In
wu-arch
/
kernels
Watch
1
Star
0
Fork
0
You've already forked kernels
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
e6db1a83af83d76c8e469b15e67ef47d970e914d
kernels
/
tests
/
regression
/
sgemm_wg
History
Hansung Kim
fa2b6e2ad0
sgemm_wg: Explicitly limit unroll to reduce stack spilling
...
This needs to be done case-by-case for different BK/TM/TN combinations and examining the assembly.
2024-03-29 02:48:29 -07:00
..
.gitignore
Add sgemm_wg C kernel
2024-02-07 21:31:08 -08:00
common.h
sgemm_wg: Implement blocking over k-dimension
2024-02-16 16:20:57 -08:00
kernel.cpp
sgemm_wg: Explicitly limit unroll to reduce stack spilling
2024-03-29 02:48:29 -07:00
main.cpp
sgemm_wg: Implement software barrier for inter-core synchronization
2024-03-12 15:34:42 -07:00
Makefile
sgemm_wg: Add missing makefile dep to common.h
2024-03-12 15:34:17 -07:00