Fix BSSN C gauge RHS parity

Fix lower-order C lopsided boundary fallbacks
Fix eighth-order C derivative and lopsided stencils
2026-05-15 18:04:54 +08:00 · 2026-05-14 21:36:42 +08:00 · 2026-05-14 20:40:28 +08:00 · 2026-05-14 16:02:31 +08:00 · 2026-05-14 15:20:30 +08:00 · 2026-05-14 14:09:34 +08:00
3 changed files with 13 additions and 3 deletions
--- a/AMSS_NCKU_source/TwoPunctures.C
+++ b/AMSS_NCKU_source/TwoPunctures.C
@@ -27,7 +27,7 @@ using namespace std;
 #endif

 #include "TwoPunctures.h"
-#include <mkl_cblas.h>
+#include <cblas.h>

 TwoPunctures::TwoPunctures(double mp, double mm, double b,
                           double P_plusx, double P_plusy, double P_plusz,
--- a/AMSS_NCKU_source/bssn_rhs_c.C
+++ b/AMSS_NCKU_source/bssn_rhs_c.C
@@ -1075,6 +1075,10 @@ int f_compute_rhs_bssn(int *ex, double &T,
        }
        #endif

+        #if (GAUGE == 2 || GAUGE == 3 || GAUGE == 4 || GAUGE == 5)
+        fderivs(ex,chi,dtSfx_rhs,dtSfy_rhs,dtSfz_rhs,X,Y,Z,SYM,SYM,SYM,Symmetry,Lev);
+        #endif
+
        for (int i = 0; i < all; i += 1) {
            #if (GAUGE == 0)
            betax_rhs[i] = FF * dtSfx[i];
@@ -1160,11 +1164,17 @@ int f_compute_rhs_bssn(int *ex, double &T,
        lopsided_kodis(ex,X,Y,Z,gyz,gyz_rhs,betax,betay,betaz,Symmetry,SAA,eps);
        lopsided_kodis(ex,X,Y,Z,betaz,betaz_rhs,betax,betay,betaz,Symmetry,SSA,eps);
        lopsided_kodis(ex,X,Y,Z,dzz,gzz_rhs,betax,betay,betaz,Symmetry,SSS,eps);
+        #if (GAUGE == 0 || GAUGE == 2 || GAUGE == 3 || GAUGE == 6 || GAUGE == 7)
        lopsided_kodis(ex,X,Y,Z,dtSfx,dtSfx_rhs,betax,betay,betaz,Symmetry,ASS,eps);
+        #endif
        lopsided_kodis(ex,X,Y,Z,Axx,Axx_rhs,betax,betay,betaz,Symmetry,SSS,eps);
+        #if (GAUGE == 0 || GAUGE == 2 || GAUGE == 3 || GAUGE == 6 || GAUGE == 7)
        lopsided_kodis(ex,X,Y,Z,dtSfy,dtSfy_rhs,betax,betay,betaz,Symmetry,SAS,eps);
+        #endif
        lopsided_kodis(ex,X,Y,Z,Axy,Axy_rhs,betax,betay,betaz,Symmetry,AAS,eps);
+        #if (GAUGE == 0 || GAUGE == 2 || GAUGE == 3 || GAUGE == 6 || GAUGE == 7)
        lopsided_kodis(ex,X,Y,Z,dtSfz,dtSfz_rhs,betax,betay,betaz,Symmetry,SSA,eps);
+        #endif
        lopsided_kodis(ex,X,Y,Z,Axz,Axz_rhs,betax,betay,betaz,Symmetry,ASA,eps);
        lopsided_kodis(ex,X,Y,Z,Ayy,Ayy_rhs,betax,betay,betaz,Symmetry,SSS,eps);
        lopsided_kodis(ex,X,Y,Z,Ayz,Ayz_rhs,betax,betay,betaz,Symmetry,SAA,eps);
--- a/AMSS_NCKU_source/gaussj.C
+++ b/AMSS_NCKU_source/gaussj.C
@@ -17,8 +17,8 @@ using namespace std;
 #include <math.h>
 #endif

-// Intel oneMKL LAPACK interface
-#include <mkl_lapacke.h>
+// LAPACKE interface (AOCL for AOCC, oneMKL for Intel)
+#include <lapacke.h>
 /* Linear equation solution using Intel oneMKL LAPACK.
 a[0..n-1][0..n-1] is the input matrix. b[0..n-1] is input
 containing the right-hand side vectors. On output a is
Author	SHA1	Message	Date
CGH0S7	b83baf8bc0	Fix BSSN C gauge RHS parity	2026-05-15 18:04:54 +08:00
CGH0S7	7ca0433c65	Fix lower-order C lopsided boundary fallbacks	2026-05-14 21:36:42 +08:00
CGH0S7	1d7d853691	Fix eighth-order C derivative and lopsided stencils	2026-05-14 20:40:28 +08:00
CGH0S7	d5d8cda25a	Fix C derivative ghost-buffer indexing across FD orders	2026-05-14 16:02:31 +08:00
CGH0S7	57c93ecb91	Fix fourth-order C lopsided and KO stencil indexing	2026-05-14 15:20:30 +08:00
CGH0S7	b25d5f89dc	Fix shell C kernel symbol names for Fortran linkage (fderivs_sh_ etc.) Shell C functions must export Fortran-compatible symbols with trailing underscore so bssn_rhs_ss.f90 and getnp4.f90 can link when WithShell is active and USE_CXX_SHELL_KERNELS=1 replaces Fortran diff_new_sh.o. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-14 14:09:34 +08:00
CGH0S7	8e8a93bad0	Add C kernel for BSSN-EM (Maxwell/electromagnetic field) RHS computation New bssn_em_rhs_c.C computes EM field RHS (E,B,Kpsi,Kphi) and stress-energy tensor, then calls the C BSSN RHS kernel with source terms. Replaces empart.f90 when USE_CXX_EM_KERNEL=1. Supports all ghost_width orders via existing derivative kernels. Controlled by USE_CXX_EM_KERNEL switch (default 0, experimental). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-14 11:38:10 +08:00
CGH0S7	d46418f1c3	Add C implementations of shell-patch derivative kernels (WithShell support) New files provide C equivalents of Fortran diff_new_sh.f90 and kodiss_sh.f90: - fderivs_sh_c.C: first derivatives in shell (rho, sigma, R) coords - fdderivs_sh_c.C: second derivatives in shell coords - fderivs_shc_c.C: shell first derivs + chain rule to Cartesian - fdderivs_shc_c.C: shell second derivs + chain rule to Cartesian - kodiss_sh_c.C: Kreiss-Oliger dissipation on shell patches Also add symmetry_stbd() C implementation and shell fh indexing to share_func.h. Controlled by USE_CXX_SHELL_KERNELS switch (default 0, experimental). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-14 11:37:45 +08:00
CGH0S7	fd18380a42	Add full FD order support (2nd/4th/6th/8th) to C derivative kernels via ghost_width dispatch Wrap each C kernel in #if (ghost_width == N) blocks matching Fortran stencil coefficients from diff_new.f90, kodiss.f90, and lopsidediff.f90. Add fast-path indexing for ord=1,4,5 in share_func.h. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-14 11:36:33 +08:00
CGH0S7	5956a952a0	Migrate build system from Intel oneAPI to AMD AOCC/AOCL toolchain - Add TOOLCHAIN=aocc option with flang/clang++/mpicxx compilers - Replace Intel flags (-xHost/-fma/-ipo/-qopenmp) with AOCC flags (-march=znver5/-ffast-math/-flto/-fopenmp) targeting EPYC 9755 - Replace Intel oneMKL with AMD AOCL (BLIS + libFLAME + amdlibm) - Replace Intel TBBMALLOC with system jemalloc - Change MKL-specific headers to standard CBLAS/LAPACKE (TwoPunctures.C, gaussj.C) - Guard TBBMALLOC to Intel toolchain only Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-12 15:31:37 +08:00