Commit Graph

18 Commits

Author SHA1 Message Date
Hansung Kim
db2789bf23 Add asm label for cisc compute 2024-10-02 10:59:14 -07:00
Hansung Kim
28b2eaec8f sgemm_gemmini_dma: Fix tile size to (128,64,128) 2024-09-10 18:29:40 -07:00
Richard Yan
dd3244fba0 large fp16 kernel 2024-09-05 16:22:38 -07:00
Richard Yan
4fddca3d1a fp16 kernel 2024-08-06 02:43:44 -07:00
Hansung Kim
63418a7496 sgemm_gemmini_dma: Skip mvout to scratchpad
Not necessary either for activation on gmem
2024-06-19 20:49:44 -07:00
Richard Yan
12a96d9c16 Merge branch 'kernels' of https://github.com/hansungk/vortex-private into kernels 2024-06-19 17:46:24 -07:00
Richard Yan
a1e165724f skip move to spad 2024-06-19 17:45:58 -07:00
Hansung Kim
bebdd3353e Use SWISH in activate_block for tcore and gemmini 2024-06-19 15:41:50 -07:00
Hansung Kim
ae9e707280 sgemm_{gemmini_dma,tcore}: Separate activate_block 2024-06-19 14:50:22 -07:00
Hansung Kim
b586e0f881 sgemm_gemmini_dma: Update activation to match tcore 2024-06-18 15:30:12 -07:00
Hansung Kim
1a44063c5d sgemm_gemmini_dma: Initial activation kernel with gemmini+DMA
Currently does spurrious fmul's in repetition.
2024-06-17 16:56:29 -07:00
Richard Yan
f37f5d5612 dual gemmini kernel + quad core vortex 2024-06-12 02:12:38 -07:00
Richard Yan
357435bc96 Merge branch 'kernels' of https://github.com/hansungk/vortex-private into kernels 2024-06-09 14:35:02 -07:00
Richard Yan
c327474e3b power specific code for kernel 2024-06-09 14:34:58 -07:00
Hansung Kim
e4eed20de3 sgemm_gemmini_dma: Fix device addr for operands 2024-06-09 13:35:16 -07:00
Hansung Kim
aaf4a89b57 Fix asm label already defined error 2024-06-07 19:55:28 -07:00
Hansung Kim
9e8988df6b Patch args device address for dma kernel 2024-06-07 18:32:07 -07:00
Richard Yan
7cf59c9480 dma and demo kernels 2024-06-07 18:11:19 -07:00