Baseline optimization flags:
C programs: -openmp -O3 -IPF_fp_relaxed -ipo -ansi -ansi_alias (ONESTEP)
Fortran programs: -openmp -O3 -IPF_fp_relaxed -ipo (ONESTEP)
OpenMP runtime library libguide.a statically linked
Extra Flags:
331.art_l: -DINTS_PER_CACHELINE=32 -DDBLS_PER_CACHELINE=16
User environment:
OMP_NUM_THREADS 64
limit stacksize 256000
KMP_STACKSIZE 124M
KMP_LIBRARY TURNAROUND
OMP_DYNAMIC FALSE
KMP_SCHEDULE static,balanced
Peak optimization flags
311.wupwise_l: basepeak=true
313.swim_l: basepeak=true
315.mgrid_l: basepeak=true
317.applu_l: basepeak=true
321.equake_l: basepeak=true
325.apsi_l: basepeak=true
327.gafort_l: basepeak=true
329.fma3d_l: basepeak=true
331.art_l: basepeak=true
Required alternate sources:
Add critical region around update of linked list in parallel loop.
Approved src.alt available as ompl-purdue1-20040324.tar.gz
Used for 331.art_l, base and peak.
For all benchmarks threads were bound to cores using the following submit command:
dplace -x2 -cNTM1,0 $command,
where NTM1 is the number of threads minus 1.
This binds threads in order of creation, beginning with the master
thread on core NTM1, the first slave thread on core NTM1-1, and so on.
The -x2 flag instructs dplace to skip placement of the lightweight
OpenMP monitor thread, which is created prior to the slave threads.
For a description of SGI's compiler flags, portability flags, and
system parameters used to generate this result, please refer to the
SGI-20041118-Linux-Intel8.1-IPF.txt file in the flags directory.
|