Baseline optimization flags:
C programs: -openmp -O3 -IPF_fp_relaxed -ipo -ansi_alias (ONESTEP)
Fortran programs: -openmp -O3 -IPF_fp_relaxed -ipo (ONESTEP)
OpenMP runtime library libguide.a statically linked
Extra Flags:
331.art_l: -DINTS_PER_CACHELINE=32 -DDBLS_PER_CACHELINE=16
User environment:
OMP_NUM_THREADS 128
limit stacksize 128000
KMP_STACKSIZE 124M
KMP_LIBRARY TURNAROUND
OMP_DYNAMIC FALSE
KMP_SCHEDULE static,balanced
Required alternate sources:
Add critical region around update of linked list in parallel loop.
Approved src.alt available as ompl-purdue1-20040324.tar.gz
Used for 331.art_l, base.
For all benchmarks threads were bound to cores using the following submit command:
dplace -x2 -e -cNTM1,0 $command,
where NTM1 is the number of threads minus 1.
This binds threads in order of creation, beginning with the master
thread on core NTM1, the first slave thread on core NTM1-1, and so on.
The -x2 flag instructs dplace to skip placement of the lightweight
OpenMP monitor thread, which is created prior to the slave threads.
System was configured to 64 chips by disabling extra chips at PROM.
For a description of SGI's compiler flags, portability flags, and
system parameters used to generate this result, please refer to the
SGI-20060719-Linux-Intel9.0-IPF.txt file in the flags directory.
|