CGH0S7
|
039dce4d65
|
Add aggressive compiler optimizations and vectorization directives
Changes:
1. Enhanced compiler flags in makefile.inc:
- Added -march=native -mtune=native for CPU-specific optimizations
- Added -funroll-loops for loop unrolling
- Added -qopt-prefetch for aggressive prefetching
- Added -qopt-report=5 for optimization analysis
2. Added Intel vectorization directives to diff_new.f90:
- Added !DIR$ SIMD and !DIR$ IVDEP to critical loops
- Targets fderivs and related finite difference functions
- Forces compiler to vectorize inner loops
Expected improvement: 15-30% speedup in computation loops
Mathematical equivalence: Preserved (compiler optimizations only)
Precision: IEEE 754 maintained
|
2026-01-19 10:17:31 +08:00 |
|