Hansung Kim
|
cc7b34ec5b
|
vecaddx: Write args.bin and input.bin
|
2024-03-26 10:44:02 -07:00 |
|
Hansung Kim
|
ff401bdec0
|
Cleanup tests/.gitignore
|
2024-03-24 01:47:00 -07:00 |
|
Hansung Kim
|
8f3474b151
|
Don't clean *.bin
|
2024-03-24 01:45:08 -07:00 |
|
Richard Yan
|
c18267443f
|
matmul kernel switch to proper fence and fsm
|
2024-03-20 15:22:25 -07:00 |
|
Richard Yan
|
94ad1850a9
|
implement correct gemmini fence and loop fsm support
|
2024-03-20 15:18:31 -07:00 |
|
Hansung Kim
|
2036d37840
|
sgemm_wg: Prevent run-ahead using ternary flags; reduce mem accesses
|
2024-03-13 21:35:24 -07:00 |
|
Hansung Kim
|
510a834db5
|
sgemm_wg: Implement software barrier for inter-core synchronization
|
2024-03-12 15:34:42 -07:00 |
|
Hansung Kim
|
fbe872c831
|
sgemm_wg: Add missing makefile dep to common.h
|
2024-03-12 15:34:17 -07:00 |
|
joshua
|
beb3dce46d
|
integer reduction unit
|
2024-03-06 01:39:17 -08:00 |
|
Hansung Kim
|
6f4dfe5a0e
|
sgemm_wg: Implement 2D threadtiling
|
2024-02-29 14:40:54 -08:00 |
|
Hansung Kim
|
a06b2dd20e
|
sgemm_wg: Cleanup & proper unroll
|
2024-02-28 21:17:42 -08:00 |
|
Hansung Kim
|
46f242e520
|
sgemm_wg: Constantify BM/BN/BK/TM, computationally set gridsize and TB/core
|
2024-02-27 22:23:25 -08:00 |
|
Hansung Kim
|
27646bb507
|
sgemm_wg: Implement multiple C per thread with sliding A/B blocks
|
2024-02-27 22:06:01 -08:00 |
|
Hansung Kim
|
2b1b5fe537
|
convolution: Fix write_operand_file after upstream merge
|
2024-02-27 15:45:22 -08:00 |
|
Hansung Kim
|
f1e7407d3a
|
sgemm_wg: Run multiple threadblock per core
|
2024-02-27 15:44:04 -08:00 |
|
Richard Yan
|
5b1c527186
|
Merge branch 'kernels' of https://github.com/hansungk/vortex into kernels
|
2024-02-24 00:27:23 -08:00 |
|
Richard Yan
|
914864206a
|
MMIO gemmini matmul kernel
|
2024-02-24 00:27:16 -08:00 |
|
Hansung Kim
|
d2da0d3394
|
sgemm_wg: Parameterize threadblock dimensions
|
2024-02-17 18:05:59 -08:00 |
|
Hansung Kim
|
301f1ca260
|
sgemm_wg: Implement blocking over k-dimension
|
2024-02-16 16:20:57 -08:00 |
|
Hansung Kim
|
5f79e8a3f1
|
sgemm_wg: reference matmul in cpu
|
2024-02-12 22:29:38 -08:00 |
|
Hansung Kim
|
6b420aceb6
|
sgemm_wg: write simple C=A*A matmul
|
2024-02-12 22:22:28 -08:00 |
|
Hansung Kim
|
a43d5eb1a7
|
Merge remote-tracking branch 'upstream/master' into kernels
|
2024-02-12 20:50:32 -08:00 |
|
Hansung Kim
|
6a1a506b64
|
sgemm_wg: save args and input bin
|
2024-02-12 20:49:08 -08:00 |
|
Hansung Kim
|
f586ab28df
|
vecadd: save operand to file
|
2024-02-12 20:49:08 -08:00 |
|
Hansung Kim
|
5a216ef9ca
|
flops: unroll by 16
|
2024-02-12 20:49:08 -08:00 |
|
Richard Yan
|
c258557999
|
Merge branch 'kernels' of https://github.com/hansungk/vortex into kernels
|
2024-02-08 17:05:37 -08:00 |
|
Richard Yan
|
12bdab8043
|
update gemmini matmul kernel
|
2024-02-08 17:00:19 -08:00 |
|
Hansung Kim
|
ad8bf9b223
|
Add sgemm_wg C kernel
|
2024-02-07 21:31:08 -08:00 |
|
Hansung Kim
|
b5bfa7d4b9
|
Fix bogus spad address
|
2024-02-01 14:05:13 -08:00 |
|
Hansung Kim
|
0462a91953
|
Update mmio kernel to do single gemm
|
2024-02-01 13:52:29 -08:00 |
|
Hansung Kim
|
7f6f1d605f
|
Add bare mmio kernel
|
2024-01-24 16:24:19 -08:00 |
|
Hansung Kim
|
a5c50b60c8
|
Add 'smemcoherence' kernel
|
2024-01-02 20:28:40 -08:00 |
|
Hansung Kim
|
edb385f138
|
Increase smem read loop iter
|
2024-01-01 14:26:19 -08:00 |
|
Hansung Kim
|
ee1ed315e2
|
Write out operand files in sharedmem kernel
|
2024-01-01 14:26:10 -08:00 |
|
Hansung Kim
|
1586ea93ef
|
Add new 'sharedmem' kernel
|
2024-01-01 13:56:46 -08:00 |
|
Blaise Tine
|
bd18b03cc3
|
minor update
|
2023-12-31 15:29:04 -08:00 |
|
Blaise Tine
|
e7f8b40d93
|
minor update
|
2023-12-31 11:46:41 -08:00 |
|
Blaise Tine
|
031d24e695
|
minor updates
|
2023-12-30 00:52:44 -08:00 |
|
Blaise Tine
|
36f5dd87fe
|
minor update
|
2023-12-28 12:22:22 -08:00 |
|
Blaise Tine
|
e217bc2c23
|
adding tracking for SFU stalls
|
2023-12-28 12:12:11 -08:00 |
|
Blaise Tine
|
e04e026a14
|
profiling update
minor updates
|
2023-12-18 04:43:44 -08:00 |
|
Blaise Tine
|
c6845a4c8d
|
profiling timing optimization
minor update
minor update
minor update
|
2023-12-18 04:43:10 -08:00 |
|
Hansung Kim
|
dd289ef964
|
Write operand to file in matmul kernel
|
2023-11-29 11:30:54 -08:00 |
|
Blaise Tine
|
9dc5793046
|
minor udpate
|
2023-11-27 02:21:47 -08:00 |
|
Blaise Tine
|
1271c9c03f
|
minor update
|
2023-11-27 02:12:12 -08:00 |
|
Blaise Tine
|
2f1171ca76
|
minor update
|
2023-11-27 02:04:22 -08:00 |
|
Blaise Tine
|
43154cf738
|
minor updates
|
2023-11-16 23:41:59 -08:00 |
|
Blaise Tine
|
ede5e1c311
|
minor update
|
2023-11-15 00:28:26 -08:00 |
|
Blaise Tine
|
61e3442ef8
|
adding opencl convolution benchmark
|
2023-11-14 22:31:30 -08:00 |
|
Blaise Tine
|
4e7a536918
|
adding tensor regression test.
|
2023-11-14 05:37:46 -08:00 |
|