Files
kernels/tests/regression/flash_attention/flash_impl.hpp
Hansung Kim a17edac875 flash: Fix barrier stall with DEBUG
Verified for up to P_expected on 2nd iter; O_before_PV is partially
correct
2024-09-09 17:02:05 -07:00

16 KiB