|
|
d702aa06b9
|
Trim GPU restrict sync overhead
|
2026-04-12 19:45:34 +08:00 |
|
|
|
c8977d8356
|
Optimize GPU RK4 stage sync path
|
2026-04-12 18:36:05 +08:00 |
|
|
|
d9287ea530
|
Fix GPU RK4 boundary and sync correctness
|
2026-04-12 12:13:47 +08:00 |
|
|
|
b78874ef21
|
Refine stable GPU AMR staging path
|
2026-04-10 23:37:36 +08:00 |
|
|
|
a089041c3b
|
Stabilize GPU AMR prolong/restrict paths
|
2026-04-10 21:57:58 +08:00 |
|
|
|
c578a15ecd
|
Fix GPU interpolation cache lifetime leaks
|
2026-04-10 10:29:04 +08:00 |
|
|
|
e1a0bff43c
|
Reduce redundant GPU host buffer preparation
|
2026-04-09 21:20:45 +08:00 |
|
|
|
cf3c6d6218
|
Stabilize GPU buffer lifecycle around regrid
|
2026-04-09 20:48:06 +08:00 |
|
|
|
b0dd069a2b
|
Register GPU transfer buffers as pinned host memory
|
2026-04-09 18:36:10 +08:00 |
|
|
|
5b00d49070
|
Reduce staged GPU host-device copies
|
2026-04-09 16:44:08 +08:00 |
|
|
|
42e851d19a
|
Cache repeated interpolation plans
|
2026-04-09 15:21:01 +08:00 |
|
|
|
06fa643365
|
Refine batched CUDA interpolation kernel
|
2026-04-09 15:06:11 +08:00 |
|
|
|
c47349b7a9
|
Add batched CUDA patch interpolation path
|
2026-04-09 14:56:01 +08:00 |
|
|
|
ad999e4c5a
|
Add guarded GPU prolong3 path scaffold
|
2026-04-09 14:28:36 +08:00 |
|
|
|
e1e3b4a448
|
Reduce GPU RK4 transfer overhead
|
2026-04-09 12:11:40 +08:00 |
|
|
|
49409645c0
|
Stabilize GPU output path and MPI sync
|
2026-04-09 10:57:49 +08:00 |
|
|
|
4e3946a4f0
|
Persist GPU RK4 stage caches
|
2026-04-08 20:59:15 +08:00 |
|
|
|
a0af9b8804
|
Trim GPU main-path transfer overhead
|
2026-04-08 20:16:25 +08:00 |
|
|
|
01ac1f9250
|
Cache GPU main-path device buffers
|
2026-04-08 19:43:17 +08:00 |
|
|
|
ea470737db
|
Add runnable GPU main-path prototype
|
2026-04-08 19:14:37 +08:00 |
|