Files
kernels/tests/regression/flash_attention/kernel.cpp
Hansung Kim d3de1b674a flash: Compute exponents using prev/next/this rowmax values
maybe there is a better way than storing all three in sharedmem?
2024-08-15 22:10:02 -07:00

14 KiB