Compilers: Intel Compilers for C++ and Fortran, Version 9.1
for IA32/EM64T-based applications in Linux_x64
Operating system: SuSE Linux Enterprise Server 9, SP3 (x86_64)
Selecting one of the following will take you directly to that section:
optimize for speed, but disable some optimizations which increase code size for a small speed benefit. Includes inline expansion except for intrinsic functions, global optimizations, string pooling optimizations. On Linux platforms, -O1 sets the following:
-unroll0, -fno-builtin -mno-ieee-fp -fomit-frame-pointer -ffunction-sections
This is the default level of optimization. Optimizes for speed. The -O2 option includes O1 optimizations and in addition enables inlining of intrinsics and more speed optimizations.
Builds on -01 and -02 optimizations by enabling high-level optimization. This level does not guarantee higher performance unless loop and memory access transformation take place. In conjunction with -axK/-xK and -axW/-xW, this switch causes the compiler to perform more aggressive data dependency analysis than for -O2. This may result in longer compilation times. On Linux platforms, -O3 sets the following:
-fp
enable single-file IP optimizations (within files, same as -Ob2)
Multi-file ip optimizations that includes:
- inline function expansion
- interprocedural constant propogation
- dead code elimination
- propagation of function characteristics
- passing arguments in registers
- loop-invariant code motion
The -fast option enhances execution speed across the entire program by including the following options that can improve run-time performance:
-O3 (maximum speed and high-level optimizations)
-ipo (enables interprocedural optimizations across files)
-xP (generate code specialized for Intel Pentium 4 processor and compatible Intel processors with Streaming SIMD Extensions 3)
-no-prec-div (disable -prec-div) where -prec-div improves precision of FP divides (some speed impact)
-static (statically link libraries during compilation)
To override one of the options set by /fast, specify that option after the -fast option on the command line. The options set by /fast may change from release to release.
-L
when used as an EXTRA_CLIB or EXTRA_CXXLIB variable,
results in linking with MicroQuill's SmartHeap 8.1 (32-bit) library
for Linux. This is a library that optimizes calls to new, delete, malloc and free.
The -xN and -xP options target your program to run on Intel Pentium 4 and compatible Intel processors. The resulting code might contain unconditional use of features that are not supported on other processors. Programs, where the function main() is compiled with this option, will detect non compatible processors and generate an error message during execution. This option also enables new optimizations in addition to Intel processor specific optimizations.
These options also enable advanced data layout and code restructuring optimizations to improve memory accesses for Intel processors.
Allows the compiler to perform optimizations in FP divides that may impact precision while improving speed.
Instrument program for profiling for the first phase of two-phase profile guided otimization. This instrumentation gathers information about a program's execution paths and data values but does not gather information from hardware performance counters. The profile instrumentation also gathers data for optimizations which are unique to profile-feedback optimization.
Instructs the compiler to produce a profile-optimized
executable and merges available dynamic information (.dyn)
files into a pgopti.dpi file. If you perform multiple
executions of the instrumented program, -prof_use merges
the dynamic information files again and overwrites the
previous pgopti.dpi file.
Without any other options, the current directory is
searched for .dyn files
This option instructs the compiler to analyze the program to
determine if there are 64-bit pointers which can be safely
shrunk into 32-bit pointers. In order for this option to be
effective the compiler must be able to optimize using the
-ipo option, and must be able to analyze all library/external
calls the program makes. This option imposes the following
restrictions on the program:
The program cannot malloc any objects greater than 2**31 bytes
in size.
If the program does not satisfy this restriction, unpredictable
behavior may occur.
Enables improved floating-point consistency. Floating-point operations are not reordered and the result of each floating-point operation is stored in the target variable rather than being kept in the floating- point processor for use in a subsequent calculation. This is the same as specifying -fltconsistency or -mp.
The default, -mno-ieee-fp, provides better accuracy and run-time per- formance at the expense of less consistent floating-point results.
Enable using EBP as general purpose register.
Separate functions for the linker (COMDAT). Same as -fdata-sections.
Tells the compiler the maximum number of times to unroll loops.
Disables inline expansion of all intrinsic functions.
Disable using EBP as general purpose register.
This option prevents linking with shared libraries. It causes the executable to link all libraries statically.
For mixed-language benchmarks, tell the compiler to ignore the main routine in Fortran
/opt/intel/fc/9.1.045/bin/ifort invokes the 32-bit Intel Fortran compiler.
Also used to invoke linker for 32-bit programs
in Fortran and C/Fortran mixtures.
Invoke the 64-bit Intel Fortran compiler.
Also used to invoke linker for 64-bit programs
in Fortran and C/Fortran mixtures.
/opt/intel/cc/9.1.049/bin/icc invokes the 32-bit Intel C compiler.
Also used to invoke linker for 32-bit C programs.
/opt/intel/cce/9.1.049/bin/icc invokes the 64-bit Intel C compiler.
Also used to invoke linker for 64-bit C programs.
/opt/intel/cc/9.1.049/bin/icpc invokes the 32-bit Intel C++ compiler.
Also used to invoke linker for 32-bit C++ programs.
Invoke the Intel C compiler.
Also used to invoke linker for C programs.
Invoke the Intel C++ compiler.
Also used to invoke linker for C++ programs.
-I/opt/intel/cc/9.1.049/include is the include path of 32-bit Intel C/C++ compiler
-I/opt/intel/cce/9.1.049/include is the include path of 64-bit Intel C/C++ compiler
-L/opt/intel/cc/9.1.049/lib is the library path of 32-bit Intel C/C++ compiler
-L/opt/intel/cce/9.1.049/lib is the library path of 64-bit Intel C/C++ compiler
-I/opt/intel/fc/9.1.045/include is the include path of 32-bit Intel Fortran compiler
-L/opt/intel/fc/9.1.045/lib is the library path of 32-bit Intel Fortran compiler
Platform settings
One or more of the following settings may have been set. If so, the "Platform Notes" section of the report will say so; and you can read below to find out more about what these settings mean.
Power Regulator for ProLiant support (Default=HP Dynamic Power Savings Mode)
Values for this BIOS setting can be:
Adjacent Sector Prefetch (Default = Enabled):
This BIOS option allows the enabling/disabling of a processor mechanism to fetch the adjacent cache line within an 128-byte sector that contains the data needed due to a cache line miss.
In some limited cases, setting this option to Disabled may improve performance. In the majority of cases, the default value of Enabled provides better performance. Users should only disable this option after performing application benchmarking to verify improved performance in their environment.
submit= MYMASK=`printf '0x%x' \$((1<<\$SPECCOPYNUM))`; /usr/bin/taskset \$MYMASK $command
When running multiple copies of benchmarks, the SPEC config file feature submit is sometimes used to cause individual jobs to be bound to specific processors. This specific submit command is used for Linux. The description of the elements of the command are:
ulimit -s
Sets the stack size to n kbytes, or unlimited to allow the stack size to grow without limit.