This is the same kernel as vecadd but repeated in a for-loop many times so that the runtime overhead at the startup is amortized.
2.4 KiB
2.4 KiB
This is the same kernel as vecadd but repeated in a for-loop many times so that the runtime overhead at the startup is amortized.