Enforce square shapes of tiles in smem. TODO need to configure loop bounds correctly.
FIXME; only tested with WARP_SPECIALIZED == 0.