Files
kernels/tests/kernel/tensor
Hansung Kim 34d0956cd5 tensor: Attempt row-major mapping for C store (WIP)
Doesn't work because 1x2 jagged mapping is required to achieve
throughput for storing the bigger C matrix (2x4, vs. 2x2 in A).
2024-10-02 15:14:55 -07:00
..
2024-08-19 17:37:26 -07:00
2024-07-31 11:55:28 -07:00
2024-03-27 00:26:04 -07:00

tensor

Unittest kernel for the tensor core.

Build

$ python3 generate_matrix.py
$ make

The generated ELF binary is run standalone; the argument and input matrix binary are hardcoded into the binary.