|
|
42e851d19a
|
Cache repeated interpolation plans
|
2026-04-09 15:21:01 +08:00 |
|
|
|
06fa643365
|
Refine batched CUDA interpolation kernel
|
2026-04-09 15:06:11 +08:00 |
|
|
|
c47349b7a9
|
Add batched CUDA patch interpolation path
|
2026-04-09 14:56:01 +08:00 |
|
|
|
ad999e4c5a
|
Add guarded GPU prolong3 path scaffold
|
2026-04-09 14:28:36 +08:00 |
|
|
|
e1e3b4a448
|
Reduce GPU RK4 transfer overhead
|
2026-04-09 12:11:40 +08:00 |
|
|
|
49409645c0
|
Stabilize GPU output path and MPI sync
|
2026-04-09 10:57:49 +08:00 |
|
|
|
4e3946a4f0
|
Persist GPU RK4 stage caches
|
2026-04-08 20:59:15 +08:00 |
|
|
|
a0af9b8804
|
Trim GPU main-path transfer overhead
|
2026-04-08 20:16:25 +08:00 |
|
|
|
01ac1f9250
|
Cache GPU main-path device buffers
|
2026-04-08 19:43:17 +08:00 |
|
|
|
ea470737db
|
Add runnable GPU main-path prototype
|
2026-04-08 19:14:37 +08:00 |
|