|
|
39450228f5
|
Accelerate Shell-Patch interpolation fast paths
|
2026-05-08 13:26:16 +08:00 |
|
|
|
dcc83bafcb
|
Support 2nd and 8th order CUDA AMR paths
|
2026-05-07 20:31:26 +08:00 |
|
|
|
c4d8d41b25
|
Cover Z4C CUDA AMR restrict prolong
|
2026-05-07 19:49:09 +08:00 |
|
|
|
0076b3ca18
|
Optimize 6th-order CUDA AMR stencils
|
2026-05-07 19:22:37 +08:00 |
|
|
|
5525465cad
|
Support CUDA finite-difference order selection
|
2026-05-07 16:28:02 +08:00 |
|
|
|
cb911dec06
|
Add EM GPU fast paths and defaults
|
2026-05-07 12:18:56 +08:00 |
|
|
|
dd0e20d8c7
|
Fix BSSN-EScalar CUDA boundary and scalar KO
|
2026-05-06 15:44:35 +08:00 |
|
|
|
ae64a22178
|
Complete BSSN-EScalar CUDA resident transfers
|
2026-05-05 23:57:42 +08:00 |
|
|
|
85fe29cc2e
|
Optimize BSSN-EScalar CUDA path
|
2026-05-05 10:47:46 +08:00 |
|
|
|
b1974ef146
|
Stabilize device AMR restrict across regrid
|
2026-04-30 20:01:18 +08:00 |
|
|
|
be9033f449
|
Add optional CUDA surface interpolation
|
2026-04-30 19:21:19 +08:00 |
|
|
|
8486532920
|
Add resident BSSN GPU point interpolation
|
2026-04-30 11:39:15 +08:00 |
|
|
|
18e9c9cc50
|
Optimize BSSN CUDA resident AMR prolong path
|
2026-04-30 10:58:15 +08:00 |
|
|
|
1ee229a91f
|
Add keyed BSSN CUDA resident banks
|
2026-04-29 19:44:19 +08:00 |
|
|
|
68eab03bac
|
Add opt-in BSSN CUDA resident AMR path
|
2026-04-29 19:15:37 +08:00 |
|
|
|
090d8657ae
|
Optimize BSSN CUDA state transfers
|
2026-04-29 18:34:31 +08:00 |
|
|
|
22c1e7168b
|
Optimize BSSN CUDA resident state and CUDA-aware MPI
|
2026-04-29 17:05:10 +08:00 |
|
|
|
bb20c9a876
|
fix ADM Constrant Violation Analysis
|
2026-04-15 19:19:16 +08:00 |
|
|
|
8fe60ea703
|
Add zero matter handling and interpolation for resident state in CUDA BSSN
|
2026-04-15 00:25:53 +08:00 |
|
|
|
9ab7e7c7f9
|
Fuse phases 5 and 6 for Gamma_rhs computation and optimize phases 8 and 9 for efficiency
|
2026-04-14 23:23:04 +08:00 |
|
|
|
f9119e8a2a
|
Add resident-GA mode switch and simplify sync logic
|
2026-04-14 21:09:27 +08:00 |
|
|
|
726d743376
|
Fuse Ricci assembly and optimize trK/Aij gauge kernels
|
2026-04-14 19:20:12 +08:00 |
|
|
|
af344bf1e5
|
Add Phase-10 Ricci kernels and batch launch flow
|
2026-04-14 19:00:22 +08:00 |
|
|
|
7191fc0b96
|
Move resident sync comm buffers into StepAllocation pool
|
2026-04-13 21:04:44 +08:00 |
|
|
|
b3ec244cf9
|
Add batched first/second derivative kernels for CUDA RHS
|
2026-04-13 20:51:08 +08:00 |
|
|
|
e952ee8e91
|
Batch GA/BH subset sync with indexed GPU pack/unpack buffers
|
2026-04-13 20:40:09 +08:00 |
|
|
|
c5d1268dd1
|
Batch patch-boundary copy and gate CPU BC in GPU substeps
|
2026-04-13 11:52:17 +08:00 |
|
|
|
4bdfc90f22
|
Pass pointer tables as kernel args and skip redundant symbol uploads
|
2026-04-13 11:19:00 +08:00 |
|
|
|
c49a4e00c9
|
Batch symbd_pack/lopsided/kodiss over all state variables
|
2026-04-13 11:02:55 +08:00 |
|
|
|
1b3c0b80d2
|
Refactor CUDA step buffers to remove loop-time allocations
|
2026-04-13 10:33:03 +08:00 |
|
|
|
636e35bfd8
|
Add direct CUDA resident-state sync path and profiling hooks
|
2026-04-13 00:57:05 +08:00 |
|
|
|
7f2a391dd2
|
Cache matter fields in StepContext across RK4 substeps
|
2026-04-12 22:19:45 +08:00 |
|
|
|
4fa12a2009
|
Integrate CUDA support into RK4 substep execution
|
2026-04-12 22:11:44 +08:00 |
|
|
|
86a683de26
|
Replace legacy ABEGPU stack with ABE_CUDA backend
|
2026-04-12 21:19:14 +08:00 |
|