Changes: 1. Enhanced compiler flags in makefile.inc: - Added -march=native -mtune=native for CPU-specific optimizations - Added -funroll-loops for loop unrolling - Added -qopt-prefetch for aggressive prefetching - Added -qopt-report=5 for optimization analysis 2. Added Intel vectorization directives to diff_new.f90: - Added !DIR$ SIMD and !DIR$ IVDEP to critical loops - Targets fderivs and related finite difference functions - Forces compiler to vectorize inner loops Expected improvement: 15-30% speedup in computation loops Mathematical equivalence: Preserved (compiler optimizations only) Precision: IEEE 754 maintained
2.6 KiB
Executable File
2.6 KiB
Executable File