OMP2012 Result Flag Description

Base Optimization Flags

C benchmarks

- -O3
- OPTIMIZE
- Enables O2 optimizations plus more aggressive optimizations, such as prefetching, scalar replacement, and loop and memory access transformations. Enables optimizations for maximum speed, such as:
  - Loop unrolling, including instruction scheduling
  - Code replication to eliminate branches
  - Padding the size of certain power-of-two arrays to allow more efficient cache use.
  On IA-32 and Intel EM64T processors, when O3 is used with options -ax or -x (Linux) or with options /Qax or /Qx (Windows), the compiler performs more aggressive data dependency analysis than for O2, which may result in longer compilation times.
  The O3 optimizations may not cause higher performance unless loop and memory access transformations take place. The optimizations may slow down code in some cases compared to O2 optimizations.
  The O3 option is recommended for applications that have loops that heavily use floating-point calculations and process large data sets. On IA-32 Windows platforms, -O3 sets the following:
  
  /GF (/Qvc7 and above), /Gf (/Qvc6 and below), and /Ob2
- Includes:
  - -GF
  - -Gf
  - -Ob_n
  - -O2
    - -Oi-
    - -Gs
    - -Oy
    - -Gy
    - -Os
    - -GF
    - -Gf
    - -Ob_n
    - -Og
    - -O1
      
      -unroll_n
      
      -Oi-
      
      -Op-
      
      -Oy
      
      -Gy
      
      -Os
      
      -GF
      
      -Gf
      
      -Ob_n
      
      -Og
      
      -Ofast
    - -Ofast
  - -Ofast
- -qopenmp
- OPTIMIZE
- Enable the compiler to generate multi-threaded code based on the OpenMP* directives(New option.)
- -ipo1
- OPTIMIZE
- -ipo[n]
  Multi-file ip optimizations that includes:
  - inline function expansion
  - interprocedural constant propogation
  - dead code elimination
  - propagation of function characteristics
  - passing arguments in registers
  - loop-invariant code motion
  (n - number of multi-file objects)
- -xCORE-AVX512
- OPTIMIZE
- Code is optimized for Intel(R) processors with support for AVX512 instructions. The resulting code may contain unconditional use of features that are not supported on other processors. This option also enables new optimizations in addition to Intel processor-specific optimizations including advanced data layout and code restructuring optimizations to improve memory accesses for Intel processors.
  
  Do not use this option if you are executing a program on a processor that is not an Intel processor. If you use this option on a non-compatible processor to compile the main program (in Fortran) or the function main() in C/C++, the program will display a fatal run-time error if they are executed on unsupported processors.
- -qopt-zmm-usage=high
- OPTIMIZE
- Defines a level of zmm registers usage.
  -qopt-zmm-usage=keywoard Specifies the level of zmm register usage. You can specify one of the following:
  
  low - Tells the compiler that the compiled program is unlikely to benefit from zmm register usage. It specifies that the compiler should avoid using zmm register unless it can prove the gain from their usage.
  
  high - Tells the compiler to generate zmm code without restrictions
- -shared-intel
- OPTIMIZE
- Link Intel provided libraries dynamically
- -ansi-alias
- COPTIMIZE
- Enable/disable(DEFAULT) use of ANSI aliasing rules in optimizations; user asserts that the program adheres to these rules.

C++ benchmarks

- -O3
- OPTIMIZE
- Enables O2 optimizations plus more aggressive optimizations, such as prefetching, scalar replacement, and loop and memory access transformations. Enables optimizations for maximum speed, such as:
  - Loop unrolling, including instruction scheduling
  - Code replication to eliminate branches
  - Padding the size of certain power-of-two arrays to allow more efficient cache use.
  On IA-32 and Intel EM64T processors, when O3 is used with options -ax or -x (Linux) or with options /Qax or /Qx (Windows), the compiler performs more aggressive data dependency analysis than for O2, which may result in longer compilation times.
  The O3 optimizations may not cause higher performance unless loop and memory access transformations take place. The optimizations may slow down code in some cases compared to O2 optimizations.
  The O3 option is recommended for applications that have loops that heavily use floating-point calculations and process large data sets. On IA-32 Windows platforms, -O3 sets the following:
  
  /GF (/Qvc7 and above), /Gf (/Qvc6 and below), and /Ob2
- Includes:
  - -GF
  - -Gf
  - -Ob_n
  - -O2
    - -Oi-
    - -Gs
    - -Oy
    - -Gy
    - -Os
    - -GF
    - -Gf
    - -Ob_n
    - -Og
    - -O1
      
      -unroll_n
      
      -Oi-
      
      -Op-
      
      -Oy
      
      -Gy
      
      -Os
      
      -GF
      
      -Gf
      
      -Ob_n
      
      -Og
      
      -Ofast
    - -Ofast
  - -Ofast
- -qopenmp
- OPTIMIZE
- Enable the compiler to generate multi-threaded code based on the OpenMP* directives(New option.)
- -ipo1
- OPTIMIZE
- -ipo[n]
  Multi-file ip optimizations that includes:
  - inline function expansion
  - interprocedural constant propogation
  - dead code elimination
  - propagation of function characteristics
  - passing arguments in registers
  - loop-invariant code motion
  (n - number of multi-file objects)
- -xCORE-AVX512
- OPTIMIZE
- Code is optimized for Intel(R) processors with support for AVX512 instructions. The resulting code may contain unconditional use of features that are not supported on other processors. This option also enables new optimizations in addition to Intel processor-specific optimizations including advanced data layout and code restructuring optimizations to improve memory accesses for Intel processors.
  
  Do not use this option if you are executing a program on a processor that is not an Intel processor. If you use this option on a non-compatible processor to compile the main program (in Fortran) or the function main() in C/C++, the program will display a fatal run-time error if they are executed on unsupported processors.
- -qopt-zmm-usage=high
- OPTIMIZE
- Defines a level of zmm registers usage.
  -qopt-zmm-usage=keywoard Specifies the level of zmm register usage. You can specify one of the following:
  
  low - Tells the compiler that the compiled program is unlikely to benefit from zmm register usage. It specifies that the compiler should avoid using zmm register unless it can prove the gain from their usage.
  
  high - Tells the compiler to generate zmm code without restrictions
- -shared-intel
- OPTIMIZE
- Link Intel provided libraries dynamically
- -ansi-alias
- CXXOPTIMIZE
- Enable/disable(DEFAULT) use of ANSI aliasing rules in optimizations; user asserts that the program adheres to these rules.

Fortran benchmarks

- -O3
- OPTIMIZE
- Enables O2 optimizations plus more aggressive optimizations, such as prefetching, scalar replacement, and loop and memory access transformations. Enables optimizations for maximum speed, such as:
  - Loop unrolling, including instruction scheduling
  - Code replication to eliminate branches
  - Padding the size of certain power-of-two arrays to allow more efficient cache use.
  On IA-32 and Intel EM64T processors, when O3 is used with options -ax or -x (Linux) or with options /Qax or /Qx (Windows), the compiler performs more aggressive data dependency analysis than for O2, which may result in longer compilation times.
  The O3 optimizations may not cause higher performance unless loop and memory access transformations take place. The optimizations may slow down code in some cases compared to O2 optimizations.
  The O3 option is recommended for applications that have loops that heavily use floating-point calculations and process large data sets. On IA-32 Windows platforms, -O3 sets the following:
  
  /GF (/Qvc7 and above), /Gf (/Qvc6 and below), and /Ob2
- Includes:
  - -GF
  - -Gf
  - -Ob_n
  - -O2
    - -Oi-
    - -Gs
    - -Oy
    - -Gy
    - -Os
    - -GF
    - -Gf
    - -Ob_n
    - -Og
    - -O1
      
      -unroll_n
      
      -Oi-
      
      -Op-
      
      -Oy
      
      -Gy
      
      -Os
      
      -GF
      
      -Gf
      
      -Ob_n
      
      -Og
      
      -Ofast
    - -Ofast
  - -Ofast
- -qopenmp
- OPTIMIZE
- Enable the compiler to generate multi-threaded code based on the OpenMP* directives(New option.)
- -ipo1
- OPTIMIZE
- -ipo[n]
  Multi-file ip optimizations that includes:
  - inline function expansion
  - interprocedural constant propogation
  - dead code elimination
  - propagation of function characteristics
  - passing arguments in registers
  - loop-invariant code motion
  (n - number of multi-file objects)
- -xCORE-AVX512
- OPTIMIZE
- Code is optimized for Intel(R) processors with support for AVX512 instructions. The resulting code may contain unconditional use of features that are not supported on other processors. This option also enables new optimizations in addition to Intel processor-specific optimizations including advanced data layout and code restructuring optimizations to improve memory accesses for Intel processors.
  
  Do not use this option if you are executing a program on a processor that is not an Intel processor. If you use this option on a non-compatible processor to compile the main program (in Fortran) or the function main() in C/C++, the program will display a fatal run-time error if they are executed on unsupported processors.
- -qopt-zmm-usage=high
- OPTIMIZE
- Defines a level of zmm registers usage.
  -qopt-zmm-usage=keywoard Specifies the level of zmm register usage. You can specify one of the following:
  
  low - Tells the compiler that the compiled program is unlikely to benefit from zmm register usage. It specifies that the compiler should avoid using zmm register unless it can prove the gain from their usage.
  
  high - Tells the compiler to generate zmm code without restrictions
- -shared-intel
- OPTIMIZE
- Link Intel provided libraries dynamically
- -align array64byte
- FOPTIMIZE
- specify how data items are aligned
  keywords: all (same as -align), none (same as -noalign),
  [no]commons, [no]dcommons,
  [no]qcommons, [no]zcommons,
  rec1byte, rec2byte, rec4byte,
  rec8byte, rec16byte, rec32byte,
  array8byte, array16byte, array32byte,
  array64byte, array128byte, array256byte,
  [no]records, [no]sequence

Implicitly Included Flags

This section contains descriptions of flags that were included implicitly by other flags, but which do not have a permanent home at SPEC.

Shell, Environment, and Other Software Settings

This result has been formatted using multiple flags files. The "sw environment" from each of them appears next.

Sw environment from Supermicro-ic2022.linux64-oneAPI

SPEC OMP2012 Flag Description for the Intel(R) C/C++ Compiler for IA32 and Intel 64 applications and Intel(R) Fortran Compiler for IA32 and Intel 64 applications

Open MP Tuning Flags

Sw environment from Supermicro-Platform-Settings-V1.2-SPR-revF

SPEC CPU2017 Platform Settings for Supermicro Systems

One or more of the following settings may have been applied to the testbed. If so, the "Platform Notes" section of the report will say so; and you can read below to find out more about what these settings mean.

Syntax
KMP_AFFINITY=[<modifier>,...]<type>[,<permute>][,<offset>]

Argument	Default	Description
modifier	noverbose respect granularity=core	Optional. String consisting of keyword and specifier. granularity=<specifier> takes the following specifiers: fine, thread, and core norespect noverbose nowarnings proclist={<proc-list>} respect verbose warnings
type	none	Required string. Indicates the thread affinity to use. compact disabled explicit none scatter logical (deprecated; instead use compact, but omit any permute value) physical (deprecated; instead use scatter, possibly with an offset value) The logical and physical types are deprecated but supported for backward compatibility.
permute	0	Optional. Positive integer value. Not valid with type values of explicit, none, or disabled.
offset	0	Optional. Positive integer value. Not valid with type values of explicit, none, or disabled.

LD_LIBRARY_PATH=<directories> (linker)
LD_LIBRARY_PATH controls the search order for both the compile-time and run-time linkers. Usually, it can be defaulted; but testers may sometimes choose to explicitly set it (as documented in the notes in the submission), in order to ensure that the correct versions of libraries are picked up.

STACKSIZE=<n>
Set the size of the stack (temporary storage area) for each slave thread of a multithreaded program.

ulimit -s <n>
Sets the stack size to n kbytes, or "unlimited" to allow the stack size to grow without limit.

Operating System Tuning Parameters

Firmware / BIOS / Microcode Settings

For questions about the meanings of these flags, please contact the tester.
For other inquiries, please contact webmaster@spec.org
Copyright 2012-2023 Standard Performance Evaluation Corporation
Tested with SPEC OMP2012 v1.1.
Report generated on Thu Dec 14 09:35:33 2023 by SPEC OMP2012 flags formatter v538.

	Indicates that the flag description came from the user flags file.
	Indicates that the flag description came from the suite-wide flags file.
	Indicates that the flag description came from a per-benchmark flags file.

OMP2012 Flag DescriptionSupermicro SuperServer SYS-221H-TN24R (INTEL XEON PLATINUM 8592+)

Base Compiler Invocation

Base Portability Flags

Base Optimization Flags

Implicitly Included Flags

Sw environment from Supermicro-ic2022.linux64-oneAPI

SPEC OMP2012 Flag Description for the Intel(R) C/C++ Compiler for IA32 and Intel 64 applications and Intel(R) Fortran Compiler for IA32 and Intel 64 applications

Syntax

Default

Description

Affinity Types

type = none (default)

type = compact

type = disabled

type = explicit

type = scatter

Deprecated Types: logical and physical

Permute and offset combinations

Modifier Values for Affinity Types

modifier = noverbose (default)

modifier = verbose

Execution modes

Serial

Turnaround

Throughput

Sw environment from Supermicro-Platform-Settings-V1.2-SPR-revF

SPEC CPU2017 Platform Settings for Supermicro Systems

OMP2012 Flag Description
Supermicro SuperServer SYS-221H-TN24R (INTEL XEON PLATINUM 8592+)