Hansung Kim
|
fc8f0c99f0
|
Merge branch 'tensor_core' into kernels
|
2024-06-07 18:27:02 -07:00 |
|
Richard Yan
|
7cf59c9480
|
dma and demo kernels
|
2024-06-07 18:11:19 -07:00 |
|
Hansung Kim
|
483f975439
|
Merge branch 'kernels' into tensor_core
|
2024-06-07 16:27:01 -07:00 |
|
Hansung Kim
|
d5adacda30
|
Add args.bin to ELF
Change KERNEL_ARG_DEV_MEM_ADDR for sgemm_{wg,gemmini,tcore}
|
2024-06-06 15:19:39 -07:00 |
|
Richard Yan
|
33066af56e
|
cisc gemmini
|
2024-05-08 15:46:20 -07:00 |
|
Richard Yan
|
1b6ebf86a1
|
update gemmini kernels
|
2024-05-02 15:16:55 -07:00 |
|
Richard Yan
|
01f4a69ae9
|
dma mvout, double buffering & other opts
|
2024-04-28 01:18:51 -07:00 |
|
Richard Yan
|
d21e7b92c7
|
internal accumulation, forced rematerialization, better unrolling
|
2024-04-25 15:28:12 -07:00 |
|
Richard Yan
|
4e9855dc33
|
highly unrolled a/b load
|
2024-04-16 22:19:30 -07:00 |
|
Richard Yan
|
449d99f0bb
|
dram gemm kernel
|
2024-04-16 17:15:22 -07:00 |
|
Richard Yan
|
0bb7aeb45b
|
add gpu+gemmini gemm kernel
|
2024-04-15 10:13:37 -07:00 |
|