AMSS-NCKU/AMSS_NCKU_source/bssn_class.C at 42b9cf1ad9859e86f0e2a29e25dec26d5785b71b

64-BitBrainstorm_2026/AMSS-NCKU

Files

CGH0S7 42b9cf1ad9 Optimize MPI Sync with merged transfers, caching, and async overlap

Phase 1: Merge N+1 transfer() calls into a single transfer() per
Sync(PatchList), reducing N+1 MPI_Waitall barriers to 1 via new
Sync_merged() that collects all intra-patch and inter-patch grid
segment lists into combined per-rank arrays.

Phase 2: Cache grid segment lists and reuse grow-only communication
buffers across RK4 substeps via SyncCache struct. Caches are per-level
and per-variable-list (predictor/corrector), invalidated on regrid.
Eliminates redundant build_ghost_gsl/build_owned_gsl0/build_gstl
rebuilds and malloc/free cycles between regrids.

Phase 3: Split Sync into async Sync_start/Sync_finish to overlap
Cartesian ghost zone exchange (MPI_Isend/Irecv) with Shell patch
synchronization. Uses MPI tag 2 to avoid conflicts with SH->Synch()
which uses transfer() with tag 1.

Also updates makefile.inc paths and flags for local build environment.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

2026-02-09 21:03:37 +08:00

337 KiB

Raw Blame History

View Raw

337 KiB Raw Blame History

337 KiB

Raw Blame History