Base Notes:
C: -fast -xopenmp -xalias_level=std -xipo=2
-xprefetch_level=3 -xcode=abs44 -m64 -lmtmalloc
-g -xpagesize=4m -Xc -xprofile
f90: -fast -openmp -xcode=abs44 -m64 -xipo=2 -autopar
-fma=fused -g -xpagesize=4m -xprofile
ONESTEP=yes
Extra art allowed flags:
331.art_l: -DINTS_PER_CACHELINE=16 -DDBLS_PER_CACHELINE=8
Peak Notes:
ONESTEP=yes
311.wupwise_l: -fast -openmp -xunroll=4 -autopar -m64 -xcode=abs44
-xipo=2 -fma=fused -xpagesize=512k
-Qoption iropt -Athr,-Apf:l2subblock=256,-Apf:ipa=9
-xprefetch=latx:3 -Qoption iropt -Rloop_dist
-xprofile
313.swim_l: -fast -openmp -m64 -xipo=2 -autopar -fma=fused
-xpagesize=512k -xprefetch=latx:3 -xprofile
315.mgrid_l: -fast -openmp -xipo=2 -xprefetch_level=3
-m64 -xcode=abs44 -xpagesize=512k -xprefetch=latx:4.8
-Qoption iropt -Apf:l2subblock=256 -xprofile
317.applu_l: -fast -xipo=2 -openmp -xautopar -m64
-fma=fused -xpagesize=4m -xprefetch=latx:2.8
-Qoption iropt -Rloop_dist -xunroll=3 -xprofile
321.equake_l: -fast -xopenmp -xprefetch_level=3 -xpagesize=64K
-xprefetch=latx:2 -xipo=2 -lmtmalloc
-W2,-Apf:l2subblock=256 -m64 -xprofile
325.apsi_l: -fast -openmp -m64 -xipo=2 -autopar -fma=fused
-xpagesize=4m -xprefetch=latx:3.4
-Qoption iropt -Rloop_dist -xprofile
327.gafort_l: -fast -openmp -xcode=abs44 -m64 -xipo=2
-autopar -fma=fused -xpagesize=512k -xprofile
329.fma3d_l: -fast -openmp -autopar -xipo=2 -fma=fused -m64
-unroll=6 -xprefetch=latx:4 -xpagesize=4m
-xprofile
331.art_l: -fast -xopenmp -xipo=2 -xprefetch_level=3 -m64
-xprefetch=latx:3 -xprofile
Alternate Source for Base and Peak:
315.mgrid_l: intel, correct an OpenMP coding standard problem.
Available as SPEC OMP alternative source:
ompl2001-mgrid-20071113.tar.gz
329.fam3d_l: sqrt.init, avoid a potential race condition.
Available as SPEC OMP alternative source:
ompl2001-fma3dsqrtinit-20070912.tar.gz
Alternate Source for Peak:
325.apsi_l: ompl.dd, change initial data distribution for WORK array.
Available as SPEC OMP alternative source:
ompl2001-dd-20040128.tar.gz
Feedback optimization (-xprofile) is done as follows,
unless otherwise noted:
fdo_pre0: rm -rf `pwd`/feedback.profile
PASS1: -xprofile=collect:./feedback
PASS2: -xprofile=use:./feedback
Base and Peak User Environment Settings:
unlimit stacksize (in /bin/csh)
setenv SUNW_MP_PROCBIND " 1 2 4 6 8 10 12 14
16 18 20 22 24 26 28 30 32 34 36 38 40 42 44 46
48 50 52 54 56 58 60 62 64 66 68 70 72 74 76 78
80 82 84 86 88 90 92 94 96 98 100 102 104 106 108
110 112 114 116 118 120 122 124 126 "
setenv SUNW_MP_THR_IDLE SPIN
setenv OMP_DYNAMIC FALSE
For a description of Sun Studio 12 Compiler flags, portability flags
and system parameters used to generate this result, please refer to
SUN-20080714-Studio-Solaris-sparc.txt file in the flags directory.
This result was measured on Sun SPARC Enterprise M8000.
The Sun SPARC Enterprise M8000 and the Fujitsu SPARC Enterprise
M8000 are electrically equivalent.
"CMU" = CPU/Memory Unit; each holds 2 or 4 CPU chips.
|