OMP2012 Result Flag Description

Base Optimization Flags

C benchmarks

- -O3
- COPTIMIZE, OPTIMIZE
- All level 1 and 2 optimizations are performed. In addition, this level enables more aggressive code hoisting and scalar replacement optimizations that may or may not be profitable.
- -tp=zen
- COPTIMIZE, OPTIMIZE
- Specify the type(s) of the target processors(s). The PGI compilers produce code specifically targeted to the type of processor on which the compilation is performed. In particular, the default is to use all supported instructions wherever possible when compiling on a given system. The default target processor is auto-selected depending on the processor on which the compilation is performed. You can specify a target processor to compile for a different processor type, such as to select a more generic processor, allowing the code to run on more system types. Specifying two or more target processors enables unified binary code generation, where two or more versions of each function may be generated, each version optimized for the specific instruction set available in each target processor. Executables created on a given system without the -tp flag may not be usable on previous generation systems. For example, executables created on an Intel Skylake processor may use instructions that are not available on earlier Intel Sandy Bridge systems. The following list contains the possible suboptions for -?tp and the processors that each suboption is intended to target. px - generate code that is usable on any x86-64 processor-based system. bulldozer - generate code for AMD Bulldozer and compatible processors. piledriver - generate code that is usable on any AMD Piledriver processor-based system. zen - generate code that is usable on any AMD Zen processor-based system (Epyc, Ryzen). sandybridge - generate code for Intel Sandy Bridge and compatible processors. haswell - generate code that is usable on any Intel Haswell processor-based system. knl - generate code that is usable on any Intel Knights Landing processor-based system. skylake - generate code that is usable on an Intel Skylake Xeon processor-based system.
- -mp
- COPTIMIZE, OPTIMIZE
- Instructs the compiler to interpret user-inserted OpenMP shared-memory parallel programming directives, and to generate an executable file which will utilize multiple processors in a shared-memory parallel system. Use the -mp option to instruct the compiler to interpret user-inserted OpenMP shared-memory parallel programming directives and to generate an executable file which utilizes multiple processors in a shared-memory parallel system. The suboptions are one or more of the following: align Forces loop iterations to be allocated to OpenMP processes using an algorithm that maximizes alignment of vector sub-sections in loops that are both parallelized and vectorized for SSE. This allocation can improve performance in program units that include many such loops. It can also result in load-balancing problems that significantly decrease performance in program units with relatively short loops that contain a large amount of work in each iteration. The numa suboption uses libnuma on systems where it is available. allcores Instructs the compiler to target all available cores. You specify this suboption at link time. bind Instructs the compiler to bind threads to cores. You specify this suboption at link time. [no]numa Uses [does not use] libnuma on systems where it is available. For a detailed description of this programming model and the associated directives, refer to Section 9, Using OpenMP of the PGI Compiler User's Guide. To set this option in PVF, use the Fortran | Language | Enable OpenMP Directives property, described in Enable OpenMP Directives.
- -m64
- COPTIMIZE, OPTIMIZE
- Use the 64-bit compiler for the default processor type.
  Description Use this option to specify the 64-bit compiler as the default processor type.
- -fast
- COPTIMIZE, OPTIMIZE
- This options creates a generally optimal set of flags for targets that support SIMD capability. It incorporates optimization options to enable use of vector streaming SIMD instructions (64-bit targets) and enable vectorization with SIMD instructions, cache aligned and flushz.
- -Mmovnt
- COPTIMIZE, OPTIMIZE
- The –Mmovnt option instructs the compiler to generate non-temporal stores and prefetch instructions, even when it cannot determine that this is beneficial.
- -Mfprelaxed
- COPTIMIZE, OPTIMIZE
- Instructs the compiler to use [not use] relaxed precision in the calculation of some intrinsic functions. Can result in improved performance at the expense of numerical accuracy.
- -Masmkeyword
- COPTIMIZE
- Instructs the compiler to allow the asm keyword in C source files. The syntax of the asm statement is as follows: asm("statement"); Where statement is a legal assembly-language statement. The quote marks are required. Note:The current default is to support gcc's extended asm, where the syntax of extended asm includes asm strings. The -M[no]asmkeyword switch is useful only if the target device is a Pentium 3 or older cpu type (-tp piii|p6|k7|athlon|athlonxp|px).
- -Mnosingle
- COPTIMIZE
- Instructs the compiler to convert float parameters to double parameters in non-prototyped functions.
- -Mschar
- COPTIMIZE
- Specifies signed char characters. The compiler treats "plain" char declarations as signed char.
- -Mnom128
- COPTIMIZE
- instructs the compiler to recognize [ignore] __m128, __m128d, and __m128i datatypes. floating-point constants as float data types, instead of double data types. This option can improve the performance of single-precision code.

C++ benchmarks

- -O3
- CXXOPTIMIZE, OPTIMIZE
- All level 1 and 2 optimizations are performed. In addition, this level enables more aggressive code hoisting and scalar replacement optimizations that may or may not be profitable.
- -tp=zen
- CXXOPTIMIZE, OPTIMIZE
- Specify the type(s) of the target processors(s). The PGI compilers produce code specifically targeted to the type of processor on which the compilation is performed. In particular, the default is to use all supported instructions wherever possible when compiling on a given system. The default target processor is auto-selected depending on the processor on which the compilation is performed. You can specify a target processor to compile for a different processor type, such as to select a more generic processor, allowing the code to run on more system types. Specifying two or more target processors enables unified binary code generation, where two or more versions of each function may be generated, each version optimized for the specific instruction set available in each target processor. Executables created on a given system without the -tp flag may not be usable on previous generation systems. For example, executables created on an Intel Skylake processor may use instructions that are not available on earlier Intel Sandy Bridge systems. The following list contains the possible suboptions for -?tp and the processors that each suboption is intended to target. px - generate code that is usable on any x86-64 processor-based system. bulldozer - generate code for AMD Bulldozer and compatible processors. piledriver - generate code that is usable on any AMD Piledriver processor-based system. zen - generate code that is usable on any AMD Zen processor-based system (Epyc, Ryzen). sandybridge - generate code for Intel Sandy Bridge and compatible processors. haswell - generate code that is usable on any Intel Haswell processor-based system. knl - generate code that is usable on any Intel Knights Landing processor-based system. skylake - generate code that is usable on an Intel Skylake Xeon processor-based system.
- -mp
- CXXOPTIMIZE, OPTIMIZE
- Instructs the compiler to interpret user-inserted OpenMP shared-memory parallel programming directives, and to generate an executable file which will utilize multiple processors in a shared-memory parallel system. Use the -mp option to instruct the compiler to interpret user-inserted OpenMP shared-memory parallel programming directives and to generate an executable file which utilizes multiple processors in a shared-memory parallel system. The suboptions are one or more of the following: align Forces loop iterations to be allocated to OpenMP processes using an algorithm that maximizes alignment of vector sub-sections in loops that are both parallelized and vectorized for SSE. This allocation can improve performance in program units that include many such loops. It can also result in load-balancing problems that significantly decrease performance in program units with relatively short loops that contain a large amount of work in each iteration. The numa suboption uses libnuma on systems where it is available. allcores Instructs the compiler to target all available cores. You specify this suboption at link time. bind Instructs the compiler to bind threads to cores. You specify this suboption at link time. [no]numa Uses [does not use] libnuma on systems where it is available. For a detailed description of this programming model and the associated directives, refer to Section 9, Using OpenMP of the PGI Compiler User's Guide. To set this option in PVF, use the Fortran | Language | Enable OpenMP Directives property, described in Enable OpenMP Directives.
- -m64
- CXXOPTIMIZE, OPTIMIZE
- Use the 64-bit compiler for the default processor type.
  Description Use this option to specify the 64-bit compiler as the default processor type.
- -fast
- CXXOPTIMIZE, OPTIMIZE
- This options creates a generally optimal set of flags for targets that support SIMD capability. It incorporates optimization options to enable use of vector streaming SIMD instructions (64-bit targets) and enable vectorization with SIMD instructions, cache aligned and flushz.
- -Mmovnt
- CXXOPTIMIZE, OPTIMIZE
- The –Mmovnt option instructs the compiler to generate non-temporal stores and prefetch instructions, even when it cannot determine that this is beneficial.
- -Mfprelaxed
- CXXOPTIMIZE, OPTIMIZE
- Instructs the compiler to use [not use] relaxed precision in the calculation of some intrinsic functions. Can result in improved performance at the expense of numerical accuracy.
- -Mnoasmkeyword
- CXXOPTIMIZE
- Instructs the compiler to allow the asm keyword in C source files. The syntax of the asm statement is as follows: asm("statement"); Where statement is a legal assembly-language statement. The quote marks are required. Note:The current default is to support gcc's extended asm, where the syntax of extended asm includes asm strings. The -M[no]asmkeyword switch is useful only if the target device is a Pentium 3 or older cpu type (-tp piii|p6|k7|athlon|athlonxp|px).

Fortran benchmarks

350.md

- -O3
- FOPTIMIZE, OPTIMIZE
- All level 1 and 2 optimizations are performed. In addition, this level enables more aggressive code hoisting and scalar replacement optimizations that may or may not be profitable.
- -tp=zen
- FOPTIMIZE, OPTIMIZE
- Specify the type(s) of the target processors(s). The PGI compilers produce code specifically targeted to the type of processor on which the compilation is performed. In particular, the default is to use all supported instructions wherever possible when compiling on a given system. The default target processor is auto-selected depending on the processor on which the compilation is performed. You can specify a target processor to compile for a different processor type, such as to select a more generic processor, allowing the code to run on more system types. Specifying two or more target processors enables unified binary code generation, where two or more versions of each function may be generated, each version optimized for the specific instruction set available in each target processor. Executables created on a given system without the -tp flag may not be usable on previous generation systems. For example, executables created on an Intel Skylake processor may use instructions that are not available on earlier Intel Sandy Bridge systems. The following list contains the possible suboptions for -?tp and the processors that each suboption is intended to target. px - generate code that is usable on any x86-64 processor-based system. bulldozer - generate code for AMD Bulldozer and compatible processors. piledriver - generate code that is usable on any AMD Piledriver processor-based system. zen - generate code that is usable on any AMD Zen processor-based system (Epyc, Ryzen). sandybridge - generate code for Intel Sandy Bridge and compatible processors. haswell - generate code that is usable on any Intel Haswell processor-based system. knl - generate code that is usable on any Intel Knights Landing processor-based system. skylake - generate code that is usable on an Intel Skylake Xeon processor-based system.
- -mp
- FOPTIMIZE, OPTIMIZE
- Instructs the compiler to interpret user-inserted OpenMP shared-memory parallel programming directives, and to generate an executable file which will utilize multiple processors in a shared-memory parallel system. Use the -mp option to instruct the compiler to interpret user-inserted OpenMP shared-memory parallel programming directives and to generate an executable file which utilizes multiple processors in a shared-memory parallel system. The suboptions are one or more of the following: align Forces loop iterations to be allocated to OpenMP processes using an algorithm that maximizes alignment of vector sub-sections in loops that are both parallelized and vectorized for SSE. This allocation can improve performance in program units that include many such loops. It can also result in load-balancing problems that significantly decrease performance in program units with relatively short loops that contain a large amount of work in each iteration. The numa suboption uses libnuma on systems where it is available. allcores Instructs the compiler to target all available cores. You specify this suboption at link time. bind Instructs the compiler to bind threads to cores. You specify this suboption at link time. [no]numa Uses [does not use] libnuma on systems where it is available. For a detailed description of this programming model and the associated directives, refer to Section 9, Using OpenMP of the PGI Compiler User's Guide. To set this option in PVF, use the Fortran | Language | Enable OpenMP Directives property, described in Enable OpenMP Directives.
- -m64
- FOPTIMIZE, OPTIMIZE
- Use the 64-bit compiler for the default processor type.
  Description Use this option to specify the 64-bit compiler as the default processor type.
- -fast
- FOPTIMIZE, OPTIMIZE
- This options creates a generally optimal set of flags for targets that support SIMD capability. It incorporates optimization options to enable use of vector streaming SIMD instructions (64-bit targets) and enable vectorization with SIMD instructions, cache aligned and flushz.
- -Mmovnt
- FOPTIMIZE, OPTIMIZE
- The –Mmovnt option instructs the compiler to generate non-temporal stores and prefetch instructions, even when it cannot determine that this is beneficial.
- -Mfprelaxed
- FOPTIMIZE, OPTIMIZE
- Instructs the compiler to use [not use] relaxed precision in the calculation of some intrinsic functions. Can result in improved performance at the expense of numerical accuracy.
- -Mallocatable=95
- FOPTIMIZE
- Controls whether Fortran 95 or Fortran 2003 semantics are used in allocatable array assignments. The default behavior is to use Fortran 95 semantics; the 03 option instructs the compiler to use Fortran 2003 semantics.
- -Mdefaultunit
- FOPTIMIZE
- Instructs the compiler to treat "*" as a synonym for standard input for reading and standard output for writing.
- -Mnostride0
- FOPTIMIZE
- Instructs the compiler to perform certain optimizations and to disallow for stride 0 array references.
- -Mnoiomutex
- FOPTIMIZE
- Instructs the compiler not to generate critical section calls around Fortran I/O statements.
- -Mcray=pointer
- FOPTIMIZE
- (Fortran only) Force Cray Fortran (CF77) compatibility with respect to the listed options. Possible values of option include:
  pointer
  
  for purposes of optimization, it is assumed that pointer-based variables do not overlay the storage of any other ariable.

362.fma3d

- -O3
- FOPTIMIZE, OPTIMIZE
- All level 1 and 2 optimizations are performed. In addition, this level enables more aggressive code hoisting and scalar replacement optimizations that may or may not be profitable.
- -tp=zen
- FOPTIMIZE, OPTIMIZE
- Specify the type(s) of the target processors(s). The PGI compilers produce code specifically targeted to the type of processor on which the compilation is performed. In particular, the default is to use all supported instructions wherever possible when compiling on a given system. The default target processor is auto-selected depending on the processor on which the compilation is performed. You can specify a target processor to compile for a different processor type, such as to select a more generic processor, allowing the code to run on more system types. Specifying two or more target processors enables unified binary code generation, where two or more versions of each function may be generated, each version optimized for the specific instruction set available in each target processor. Executables created on a given system without the -tp flag may not be usable on previous generation systems. For example, executables created on an Intel Skylake processor may use instructions that are not available on earlier Intel Sandy Bridge systems. The following list contains the possible suboptions for -?tp and the processors that each suboption is intended to target. px - generate code that is usable on any x86-64 processor-based system. bulldozer - generate code for AMD Bulldozer and compatible processors. piledriver - generate code that is usable on any AMD Piledriver processor-based system. zen - generate code that is usable on any AMD Zen processor-based system (Epyc, Ryzen). sandybridge - generate code for Intel Sandy Bridge and compatible processors. haswell - generate code that is usable on any Intel Haswell processor-based system. knl - generate code that is usable on any Intel Knights Landing processor-based system. skylake - generate code that is usable on an Intel Skylake Xeon processor-based system.
- -mp
- FOPTIMIZE, OPTIMIZE
- Instructs the compiler to interpret user-inserted OpenMP shared-memory parallel programming directives, and to generate an executable file which will utilize multiple processors in a shared-memory parallel system. Use the -mp option to instruct the compiler to interpret user-inserted OpenMP shared-memory parallel programming directives and to generate an executable file which utilizes multiple processors in a shared-memory parallel system. The suboptions are one or more of the following: align Forces loop iterations to be allocated to OpenMP processes using an algorithm that maximizes alignment of vector sub-sections in loops that are both parallelized and vectorized for SSE. This allocation can improve performance in program units that include many such loops. It can also result in load-balancing problems that significantly decrease performance in program units with relatively short loops that contain a large amount of work in each iteration. The numa suboption uses libnuma on systems where it is available. allcores Instructs the compiler to target all available cores. You specify this suboption at link time. bind Instructs the compiler to bind threads to cores. You specify this suboption at link time. [no]numa Uses [does not use] libnuma on systems where it is available. For a detailed description of this programming model and the associated directives, refer to Section 9, Using OpenMP of the PGI Compiler User's Guide. To set this option in PVF, use the Fortran | Language | Enable OpenMP Directives property, described in Enable OpenMP Directives.
- -m64
- FOPTIMIZE, OPTIMIZE
- Use the 64-bit compiler for the default processor type.
  Description Use this option to specify the 64-bit compiler as the default processor type.
- -fast
- FOPTIMIZE, OPTIMIZE
- This options creates a generally optimal set of flags for targets that support SIMD capability. It incorporates optimization options to enable use of vector streaming SIMD instructions (64-bit targets) and enable vectorization with SIMD instructions, cache aligned and flushz.
- -Mmovnt
- FOPTIMIZE, OPTIMIZE
- The –Mmovnt option instructs the compiler to generate non-temporal stores and prefetch instructions, even when it cannot determine that this is beneficial.
- -Mfprelaxed
- FOPTIMIZE, OPTIMIZE
- Instructs the compiler to use [not use] relaxed precision in the calculation of some intrinsic functions. Can result in improved performance at the expense of numerical accuracy.
- -Mallocatable=95
- FOPTIMIZE
- Controls whether Fortran 95 or Fortran 2003 semantics are used in allocatable array assignments. The default behavior is to use Fortran 95 semantics; the 03 option instructs the compiler to use Fortran 2003 semantics.
- -Mdefaultunit
- FOPTIMIZE
- Instructs the compiler to treat "*" as a synonym for standard input for reading and standard output for writing.
- -Mnostride0
- FOPTIMIZE
- Instructs the compiler to perform certain optimizations and to disallow for stride 0 array references.
- -Mnoiomutex
- FOPTIMIZE
- Instructs the compiler not to generate critical section calls around Fortran I/O statements.
- -Mcray=pointer
- FOPTIMIZE
- (Fortran only) Force Cray Fortran (CF77) compatibility with respect to the listed options. Possible values of option include:
  pointer
  
  for purposes of optimization, it is assumed that pointer-based variables do not overlay the storage of any other ariable.
- -Mnoupcase
- FOPTIMIZE
- Instructs the compiler to convert all identifiers to lower case. This selection affects the linking process. If you compile and link the same source code using-Mupcase on one occasion and -Mnoupcase on another, you may get two different executables, depending on whether the source contains uppercase letters. The standard libraries are compiled using -Mnoupcase.
- -pgf90libs
- FOPTIMIZE
- Use this option to instruct the compiler to append PGF90/PGF95/PGFORTRAN runtime libraries to the link line.

Peak Optimization Flags

C benchmarks

352.nab

- -O3
- COPTIMIZE, OPTIMIZE
- All level 1 and 2 optimizations are performed. In addition, this level enables more aggressive code hoisting and scalar replacement optimizations that may or may not be profitable.
- -tp=zen
- COPTIMIZE, OPTIMIZE
- Specify the type(s) of the target processors(s). The PGI compilers produce code specifically targeted to the type of processor on which the compilation is performed. In particular, the default is to use all supported instructions wherever possible when compiling on a given system. The default target processor is auto-selected depending on the processor on which the compilation is performed. You can specify a target processor to compile for a different processor type, such as to select a more generic processor, allowing the code to run on more system types. Specifying two or more target processors enables unified binary code generation, where two or more versions of each function may be generated, each version optimized for the specific instruction set available in each target processor. Executables created on a given system without the -tp flag may not be usable on previous generation systems. For example, executables created on an Intel Skylake processor may use instructions that are not available on earlier Intel Sandy Bridge systems. The following list contains the possible suboptions for -?tp and the processors that each suboption is intended to target. px - generate code that is usable on any x86-64 processor-based system. bulldozer - generate code for AMD Bulldozer and compatible processors. piledriver - generate code that is usable on any AMD Piledriver processor-based system. zen - generate code that is usable on any AMD Zen processor-based system (Epyc, Ryzen). sandybridge - generate code for Intel Sandy Bridge and compatible processors. haswell - generate code that is usable on any Intel Haswell processor-based system. knl - generate code that is usable on any Intel Knights Landing processor-based system. skylake - generate code that is usable on an Intel Skylake Xeon processor-based system.
- -mp
- COPTIMIZE, OPTIMIZE
- Instructs the compiler to interpret user-inserted OpenMP shared-memory parallel programming directives, and to generate an executable file which will utilize multiple processors in a shared-memory parallel system. Use the -mp option to instruct the compiler to interpret user-inserted OpenMP shared-memory parallel programming directives and to generate an executable file which utilizes multiple processors in a shared-memory parallel system. The suboptions are one or more of the following: align Forces loop iterations to be allocated to OpenMP processes using an algorithm that maximizes alignment of vector sub-sections in loops that are both parallelized and vectorized for SSE. This allocation can improve performance in program units that include many such loops. It can also result in load-balancing problems that significantly decrease performance in program units with relatively short loops that contain a large amount of work in each iteration. The numa suboption uses libnuma on systems where it is available. allcores Instructs the compiler to target all available cores. You specify this suboption at link time. bind Instructs the compiler to bind threads to cores. You specify this suboption at link time. [no]numa Uses [does not use] libnuma on systems where it is available. For a detailed description of this programming model and the associated directives, refer to Section 9, Using OpenMP of the PGI Compiler User's Guide. To set this option in PVF, use the Fortran | Language | Enable OpenMP Directives property, described in Enable OpenMP Directives.
- -m64
- COPTIMIZE, OPTIMIZE
- Use the 64-bit compiler for the default processor type.
  Description Use this option to specify the 64-bit compiler as the default processor type.
- -fast
- COPTIMIZE, OPTIMIZE
- This options creates a generally optimal set of flags for targets that support SIMD capability. It incorporates optimization options to enable use of vector streaming SIMD instructions (64-bit targets) and enable vectorization with SIMD instructions, cache aligned and flushz.
- -Mmovnt
- COPTIMIZE, OPTIMIZE
- The –Mmovnt option instructs the compiler to generate non-temporal stores and prefetch instructions, even when it cannot determine that this is beneficial.
- -Mfprelaxed
- COPTIMIZE, OPTIMIZE
- Instructs the compiler to use [not use] relaxed precision in the calculation of some intrinsic functions. Can result in improved performance at the expense of numerical accuracy.
- -Masmkeyword
- COPTIMIZE
- Instructs the compiler to allow the asm keyword in C source files. The syntax of the asm statement is as follows: asm("statement"); Where statement is a legal assembly-language statement. The quote marks are required. Note:The current default is to support gcc's extended asm, where the syntax of extended asm includes asm strings. The -M[no]asmkeyword switch is useful only if the target device is a Pentium 3 or older cpu type (-tp piii|p6|k7|athlon|athlonxp|px).
- -Mnosingle
- COPTIMIZE
- Instructs the compiler to convert float parameters to double parameters in non-prototyped functions.
- -Mschar
- COPTIMIZE
- Specifies signed char characters. The compiler treats "plain" char declarations as signed char.
- -Mnom128
- COPTIMIZE
- instructs the compiler to recognize [ignore] __m128, __m128d, and __m128i datatypes. floating-point constants as float data types, instead of double data types. This option can improve the performance of single-precision code.
- -alias=ansi
- COPTIMIZE
- select optimizations based on type-based pointer alias rules in C and C++.
  ansi
  
  Enable optimizations using ANSI C type-based pointer disambiguation.
  
  traditional
  
  Disable type-based pointer disambiguation.

372.smithwa

- -O3
- COPTIMIZE, OPTIMIZE
- All level 1 and 2 optimizations are performed. In addition, this level enables more aggressive code hoisting and scalar replacement optimizations that may or may not be profitable.
- -tp=zen
- COPTIMIZE, OPTIMIZE
- Specify the type(s) of the target processors(s). The PGI compilers produce code specifically targeted to the type of processor on which the compilation is performed. In particular, the default is to use all supported instructions wherever possible when compiling on a given system. The default target processor is auto-selected depending on the processor on which the compilation is performed. You can specify a target processor to compile for a different processor type, such as to select a more generic processor, allowing the code to run on more system types. Specifying two or more target processors enables unified binary code generation, where two or more versions of each function may be generated, each version optimized for the specific instruction set available in each target processor. Executables created on a given system without the -tp flag may not be usable on previous generation systems. For example, executables created on an Intel Skylake processor may use instructions that are not available on earlier Intel Sandy Bridge systems. The following list contains the possible suboptions for -?tp and the processors that each suboption is intended to target. px - generate code that is usable on any x86-64 processor-based system. bulldozer - generate code for AMD Bulldozer and compatible processors. piledriver - generate code that is usable on any AMD Piledriver processor-based system. zen - generate code that is usable on any AMD Zen processor-based system (Epyc, Ryzen). sandybridge - generate code for Intel Sandy Bridge and compatible processors. haswell - generate code that is usable on any Intel Haswell processor-based system. knl - generate code that is usable on any Intel Knights Landing processor-based system. skylake - generate code that is usable on an Intel Skylake Xeon processor-based system.
- -mp
- COPTIMIZE, OPTIMIZE
- Instructs the compiler to interpret user-inserted OpenMP shared-memory parallel programming directives, and to generate an executable file which will utilize multiple processors in a shared-memory parallel system. Use the -mp option to instruct the compiler to interpret user-inserted OpenMP shared-memory parallel programming directives and to generate an executable file which utilizes multiple processors in a shared-memory parallel system. The suboptions are one or more of the following: align Forces loop iterations to be allocated to OpenMP processes using an algorithm that maximizes alignment of vector sub-sections in loops that are both parallelized and vectorized for SSE. This allocation can improve performance in program units that include many such loops. It can also result in load-balancing problems that significantly decrease performance in program units with relatively short loops that contain a large amount of work in each iteration. The numa suboption uses libnuma on systems where it is available. allcores Instructs the compiler to target all available cores. You specify this suboption at link time. bind Instructs the compiler to bind threads to cores. You specify this suboption at link time. [no]numa Uses [does not use] libnuma on systems where it is available. For a detailed description of this programming model and the associated directives, refer to Section 9, Using OpenMP of the PGI Compiler User's Guide. To set this option in PVF, use the Fortran | Language | Enable OpenMP Directives property, described in Enable OpenMP Directives.
- -m64
- COPTIMIZE, OPTIMIZE
- Use the 64-bit compiler for the default processor type.
  Description Use this option to specify the 64-bit compiler as the default processor type.
- -fast
- COPTIMIZE, OPTIMIZE
- This options creates a generally optimal set of flags for targets that support SIMD capability. It incorporates optimization options to enable use of vector streaming SIMD instructions (64-bit targets) and enable vectorization with SIMD instructions, cache aligned and flushz.
- -Mmovnt
- COPTIMIZE, OPTIMIZE
- The –Mmovnt option instructs the compiler to generate non-temporal stores and prefetch instructions, even when it cannot determine that this is beneficial.
- -Mfprelaxed
- COPTIMIZE, OPTIMIZE
- Instructs the compiler to use [not use] relaxed precision in the calculation of some intrinsic functions. Can result in improved performance at the expense of numerical accuracy.
- -Masmkeyword
- COPTIMIZE
- Instructs the compiler to allow the asm keyword in C source files. The syntax of the asm statement is as follows: asm("statement"); Where statement is a legal assembly-language statement. The quote marks are required. Note:The current default is to support gcc's extended asm, where the syntax of extended asm includes asm strings. The -M[no]asmkeyword switch is useful only if the target device is a Pentium 3 or older cpu type (-tp piii|p6|k7|athlon|athlonxp|px).
- -Mnosingle
- COPTIMIZE
- Instructs the compiler to convert float parameters to double parameters in non-prototyped functions.
- -Mschar
- COPTIMIZE
- Specifies signed char characters. The compiler treats "plain" char declarations as signed char.
- -Mnom128
- COPTIMIZE
- instructs the compiler to recognize [ignore] __m128, __m128d, and __m128i datatypes. floating-point constants as float data types, instead of double data types. This option can improve the performance of single-precision code.

C++ benchmarks

- -O3
- CXXOPTIMIZE, OPTIMIZE
- All level 1 and 2 optimizations are performed. In addition, this level enables more aggressive code hoisting and scalar replacement optimizations that may or may not be profitable.
- -tp=zen
- CXXOPTIMIZE, OPTIMIZE
- Specify the type(s) of the target processors(s). The PGI compilers produce code specifically targeted to the type of processor on which the compilation is performed. In particular, the default is to use all supported instructions wherever possible when compiling on a given system. The default target processor is auto-selected depending on the processor on which the compilation is performed. You can specify a target processor to compile for a different processor type, such as to select a more generic processor, allowing the code to run on more system types. Specifying two or more target processors enables unified binary code generation, where two or more versions of each function may be generated, each version optimized for the specific instruction set available in each target processor. Executables created on a given system without the -tp flag may not be usable on previous generation systems. For example, executables created on an Intel Skylake processor may use instructions that are not available on earlier Intel Sandy Bridge systems. The following list contains the possible suboptions for -?tp and the processors that each suboption is intended to target. px - generate code that is usable on any x86-64 processor-based system. bulldozer - generate code for AMD Bulldozer and compatible processors. piledriver - generate code that is usable on any AMD Piledriver processor-based system. zen - generate code that is usable on any AMD Zen processor-based system (Epyc, Ryzen). sandybridge - generate code for Intel Sandy Bridge and compatible processors. haswell - generate code that is usable on any Intel Haswell processor-based system. knl - generate code that is usable on any Intel Knights Landing processor-based system. skylake - generate code that is usable on an Intel Skylake Xeon processor-based system.
- -mp
- CXXOPTIMIZE, OPTIMIZE
- Instructs the compiler to interpret user-inserted OpenMP shared-memory parallel programming directives, and to generate an executable file which will utilize multiple processors in a shared-memory parallel system. Use the -mp option to instruct the compiler to interpret user-inserted OpenMP shared-memory parallel programming directives and to generate an executable file which utilizes multiple processors in a shared-memory parallel system. The suboptions are one or more of the following: align Forces loop iterations to be allocated to OpenMP processes using an algorithm that maximizes alignment of vector sub-sections in loops that are both parallelized and vectorized for SSE. This allocation can improve performance in program units that include many such loops. It can also result in load-balancing problems that significantly decrease performance in program units with relatively short loops that contain a large amount of work in each iteration. The numa suboption uses libnuma on systems where it is available. allcores Instructs the compiler to target all available cores. You specify this suboption at link time. bind Instructs the compiler to bind threads to cores. You specify this suboption at link time. [no]numa Uses [does not use] libnuma on systems where it is available. For a detailed description of this programming model and the associated directives, refer to Section 9, Using OpenMP of the PGI Compiler User's Guide. To set this option in PVF, use the Fortran | Language | Enable OpenMP Directives property, described in Enable OpenMP Directives.
- -m64
- CXXOPTIMIZE, OPTIMIZE
- Use the 64-bit compiler for the default processor type.
  Description Use this option to specify the 64-bit compiler as the default processor type.
- -fast
- CXXOPTIMIZE, OPTIMIZE
- This options creates a generally optimal set of flags for targets that support SIMD capability. It incorporates optimization options to enable use of vector streaming SIMD instructions (64-bit targets) and enable vectorization with SIMD instructions, cache aligned and flushz.
- -Mmovnt
- CXXOPTIMIZE, OPTIMIZE
- The –Mmovnt option instructs the compiler to generate non-temporal stores and prefetch instructions, even when it cannot determine that this is beneficial.
- -Mfprelaxed
- CXXOPTIMIZE, OPTIMIZE
- Instructs the compiler to use [not use] relaxed precision in the calculation of some intrinsic functions. Can result in improved performance at the expense of numerical accuracy.
- -Mnoasmkeyword
- CXXOPTIMIZE
- Instructs the compiler to allow the asm keyword in C source files. The syntax of the asm statement is as follows: asm("statement"); Where statement is a legal assembly-language statement. The quote marks are required. Note:The current default is to support gcc's extended asm, where the syntax of extended asm includes asm strings. The -M[no]asmkeyword switch is useful only if the target device is a Pentium 3 or older cpu type (-tp piii|p6|k7|athlon|athlonxp|px).
- -Mnosingle
- CXXOPTIMIZE
- Instructs the compiler to convert float parameters to double parameters in non-prototyped functions.
- -alias=ansi
- CXXOPTIMIZE
- select optimizations based on type-based pointer alias rules in C and C++.
  ansi
  
  Enable optimizations using ANSI C type-based pointer disambiguation.
  
  traditional
  
  Disable type-based pointer disambiguation.

Fortran benchmarks

350.md

- -O3
- FOPTIMIZE, OPTIMIZE
- All level 1 and 2 optimizations are performed. In addition, this level enables more aggressive code hoisting and scalar replacement optimizations that may or may not be profitable.
- -tp=zen
- FOPTIMIZE, OPTIMIZE
- Specify the type(s) of the target processors(s). The PGI compilers produce code specifically targeted to the type of processor on which the compilation is performed. In particular, the default is to use all supported instructions wherever possible when compiling on a given system. The default target processor is auto-selected depending on the processor on which the compilation is performed. You can specify a target processor to compile for a different processor type, such as to select a more generic processor, allowing the code to run on more system types. Specifying two or more target processors enables unified binary code generation, where two or more versions of each function may be generated, each version optimized for the specific instruction set available in each target processor. Executables created on a given system without the -tp flag may not be usable on previous generation systems. For example, executables created on an Intel Skylake processor may use instructions that are not available on earlier Intel Sandy Bridge systems. The following list contains the possible suboptions for -?tp and the processors that each suboption is intended to target. px - generate code that is usable on any x86-64 processor-based system. bulldozer - generate code for AMD Bulldozer and compatible processors. piledriver - generate code that is usable on any AMD Piledriver processor-based system. zen - generate code that is usable on any AMD Zen processor-based system (Epyc, Ryzen). sandybridge - generate code for Intel Sandy Bridge and compatible processors. haswell - generate code that is usable on any Intel Haswell processor-based system. knl - generate code that is usable on any Intel Knights Landing processor-based system. skylake - generate code that is usable on an Intel Skylake Xeon processor-based system.
- -mp
- FOPTIMIZE, OPTIMIZE
- Instructs the compiler to interpret user-inserted OpenMP shared-memory parallel programming directives, and to generate an executable file which will utilize multiple processors in a shared-memory parallel system. Use the -mp option to instruct the compiler to interpret user-inserted OpenMP shared-memory parallel programming directives and to generate an executable file which utilizes multiple processors in a shared-memory parallel system. The suboptions are one or more of the following: align Forces loop iterations to be allocated to OpenMP processes using an algorithm that maximizes alignment of vector sub-sections in loops that are both parallelized and vectorized for SSE. This allocation can improve performance in program units that include many such loops. It can also result in load-balancing problems that significantly decrease performance in program units with relatively short loops that contain a large amount of work in each iteration. The numa suboption uses libnuma on systems where it is available. allcores Instructs the compiler to target all available cores. You specify this suboption at link time. bind Instructs the compiler to bind threads to cores. You specify this suboption at link time. [no]numa Uses [does not use] libnuma on systems where it is available. For a detailed description of this programming model and the associated directives, refer to Section 9, Using OpenMP of the PGI Compiler User's Guide. To set this option in PVF, use the Fortran | Language | Enable OpenMP Directives property, described in Enable OpenMP Directives.
- -m64
- FOPTIMIZE, OPTIMIZE
- Use the 64-bit compiler for the default processor type.
  Description Use this option to specify the 64-bit compiler as the default processor type.
- -fast
- FOPTIMIZE, OPTIMIZE
- This options creates a generally optimal set of flags for targets that support SIMD capability. It incorporates optimization options to enable use of vector streaming SIMD instructions (64-bit targets) and enable vectorization with SIMD instructions, cache aligned and flushz.
- -Mmovnt
- FOPTIMIZE, OPTIMIZE
- The –Mmovnt option instructs the compiler to generate non-temporal stores and prefetch instructions, even when it cannot determine that this is beneficial.
- -Mfprelaxed
- FOPTIMIZE, OPTIMIZE
- Instructs the compiler to use [not use] relaxed precision in the calculation of some intrinsic functions. Can result in improved performance at the expense of numerical accuracy.
- -Mallocatable=95
- FOPTIMIZE
- Controls whether Fortran 95 or Fortran 2003 semantics are used in allocatable array assignments. The default behavior is to use Fortran 95 semantics; the 03 option instructs the compiler to use Fortran 2003 semantics.
- -Mdefaultunit
- FOPTIMIZE
- Instructs the compiler to treat "*" as a synonym for standard input for reading and standard output for writing.
- -Mnostride0
- FOPTIMIZE
- Instructs the compiler to perform certain optimizations and to disallow for stride 0 array references.
- -Mnoiomutex
- FOPTIMIZE
- Instructs the compiler not to generate critical section calls around Fortran I/O statements.
- -Mcray=pointer
- FOPTIMIZE
- (Fortran only) Force Cray Fortran (CF77) compatibility with respect to the listed options. Possible values of option include:
  pointer
  
  for purposes of optimization, it is assumed that pointer-based variables do not overlay the storage of any other ariable.
- -Mbackslash
- FOPTIMIZE
- Instructs the compiler to treat the backslash as a normal character, and not as an escape character in quoted strings.
- -Mdlines
- FOPTIMIZE
- Instructs the compiler to treat lines containing "D" in column 1 as executable statements (ignoring the "D").

351.bwaves

- -O3
- FOPTIMIZE, OPTIMIZE
- All level 1 and 2 optimizations are performed. In addition, this level enables more aggressive code hoisting and scalar replacement optimizations that may or may not be profitable.
- -tp=zen
- FOPTIMIZE, OPTIMIZE
- Specify the type(s) of the target processors(s). The PGI compilers produce code specifically targeted to the type of processor on which the compilation is performed. In particular, the default is to use all supported instructions wherever possible when compiling on a given system. The default target processor is auto-selected depending on the processor on which the compilation is performed. You can specify a target processor to compile for a different processor type, such as to select a more generic processor, allowing the code to run on more system types. Specifying two or more target processors enables unified binary code generation, where two or more versions of each function may be generated, each version optimized for the specific instruction set available in each target processor. Executables created on a given system without the -tp flag may not be usable on previous generation systems. For example, executables created on an Intel Skylake processor may use instructions that are not available on earlier Intel Sandy Bridge systems. The following list contains the possible suboptions for -?tp and the processors that each suboption is intended to target. px - generate code that is usable on any x86-64 processor-based system. bulldozer - generate code for AMD Bulldozer and compatible processors. piledriver - generate code that is usable on any AMD Piledriver processor-based system. zen - generate code that is usable on any AMD Zen processor-based system (Epyc, Ryzen). sandybridge - generate code for Intel Sandy Bridge and compatible processors. haswell - generate code that is usable on any Intel Haswell processor-based system. knl - generate code that is usable on any Intel Knights Landing processor-based system. skylake - generate code that is usable on an Intel Skylake Xeon processor-based system.
- -mp
- FOPTIMIZE, OPTIMIZE
- Instructs the compiler to interpret user-inserted OpenMP shared-memory parallel programming directives, and to generate an executable file which will utilize multiple processors in a shared-memory parallel system. Use the -mp option to instruct the compiler to interpret user-inserted OpenMP shared-memory parallel programming directives and to generate an executable file which utilizes multiple processors in a shared-memory parallel system. The suboptions are one or more of the following: align Forces loop iterations to be allocated to OpenMP processes using an algorithm that maximizes alignment of vector sub-sections in loops that are both parallelized and vectorized for SSE. This allocation can improve performance in program units that include many such loops. It can also result in load-balancing problems that significantly decrease performance in program units with relatively short loops that contain a large amount of work in each iteration. The numa suboption uses libnuma on systems where it is available. allcores Instructs the compiler to target all available cores. You specify this suboption at link time. bind Instructs the compiler to bind threads to cores. You specify this suboption at link time. [no]numa Uses [does not use] libnuma on systems where it is available. For a detailed description of this programming model and the associated directives, refer to Section 9, Using OpenMP of the PGI Compiler User's Guide. To set this option in PVF, use the Fortran | Language | Enable OpenMP Directives property, described in Enable OpenMP Directives.
- -m64
- FOPTIMIZE, OPTIMIZE
- Use the 64-bit compiler for the default processor type.
  Description Use this option to specify the 64-bit compiler as the default processor type.
- -fast
- FOPTIMIZE, OPTIMIZE
- This options creates a generally optimal set of flags for targets that support SIMD capability. It incorporates optimization options to enable use of vector streaming SIMD instructions (64-bit targets) and enable vectorization with SIMD instructions, cache aligned and flushz.
- -Mmovnt
- FOPTIMIZE, OPTIMIZE
- The –Mmovnt option instructs the compiler to generate non-temporal stores and prefetch instructions, even when it cannot determine that this is beneficial.
- -Mfprelaxed
- FOPTIMIZE, OPTIMIZE
- Instructs the compiler to use [not use] relaxed precision in the calculation of some intrinsic functions. Can result in improved performance at the expense of numerical accuracy.
- -fastsse
- OPTIMIZE
- Generally optimal set of flags for targets that include SSE/SSE2 capability.
- -Mallocatable=95
- FOPTIMIZE
- Controls whether Fortran 95 or Fortran 2003 semantics are used in allocatable array assignments. The default behavior is to use Fortran 95 semantics; the 03 option instructs the compiler to use Fortran 2003 semantics.
- -Mdefaultunit
- FOPTIMIZE
- Instructs the compiler to treat "*" as a synonym for standard input for reading and standard output for writing.
- -Mnostride0
- FOPTIMIZE
- Instructs the compiler to perform certain optimizations and to disallow for stride 0 array references.
- -Mnoiomutex
- FOPTIMIZE
- Instructs the compiler not to generate critical section calls around Fortran I/O statements.
- -Mcray=pointer
- FOPTIMIZE
- (Fortran only) Force Cray Fortran (CF77) compatibility with respect to the listed options. Possible values of option include:
  pointer
  
  for purposes of optimization, it is assumed that pointer-based variables do not overlay the storage of any other ariable.
- -Mfixed
- FOPTIMIZE
- Instructs the compiler to assume input source files are in FORTRAN 77-style fixed form format.
- -pgf77libs
- FOPTIMIZE
- Use this option to instruct the compiler to append PGF77 runtime libraries to the link line.

357.bt331

- -O3
- FOPTIMIZE, OPTIMIZE
- All level 1 and 2 optimizations are performed. In addition, this level enables more aggressive code hoisting and scalar replacement optimizations that may or may not be profitable.
- -tp=zen
- FOPTIMIZE, OPTIMIZE
- Specify the type(s) of the target processors(s). The PGI compilers produce code specifically targeted to the type of processor on which the compilation is performed. In particular, the default is to use all supported instructions wherever possible when compiling on a given system. The default target processor is auto-selected depending on the processor on which the compilation is performed. You can specify a target processor to compile for a different processor type, such as to select a more generic processor, allowing the code to run on more system types. Specifying two or more target processors enables unified binary code generation, where two or more versions of each function may be generated, each version optimized for the specific instruction set available in each target processor. Executables created on a given system without the -tp flag may not be usable on previous generation systems. For example, executables created on an Intel Skylake processor may use instructions that are not available on earlier Intel Sandy Bridge systems. The following list contains the possible suboptions for -?tp and the processors that each suboption is intended to target. px - generate code that is usable on any x86-64 processor-based system. bulldozer - generate code for AMD Bulldozer and compatible processors. piledriver - generate code that is usable on any AMD Piledriver processor-based system. zen - generate code that is usable on any AMD Zen processor-based system (Epyc, Ryzen). sandybridge - generate code for Intel Sandy Bridge and compatible processors. haswell - generate code that is usable on any Intel Haswell processor-based system. knl - generate code that is usable on any Intel Knights Landing processor-based system. skylake - generate code that is usable on an Intel Skylake Xeon processor-based system.
- -mp
- FOPTIMIZE, OPTIMIZE
- Instructs the compiler to interpret user-inserted OpenMP shared-memory parallel programming directives, and to generate an executable file which will utilize multiple processors in a shared-memory parallel system. Use the -mp option to instruct the compiler to interpret user-inserted OpenMP shared-memory parallel programming directives and to generate an executable file which utilizes multiple processors in a shared-memory parallel system. The suboptions are one or more of the following: align Forces loop iterations to be allocated to OpenMP processes using an algorithm that maximizes alignment of vector sub-sections in loops that are both parallelized and vectorized for SSE. This allocation can improve performance in program units that include many such loops. It can also result in load-balancing problems that significantly decrease performance in program units with relatively short loops that contain a large amount of work in each iteration. The numa suboption uses libnuma on systems where it is available. allcores Instructs the compiler to target all available cores. You specify this suboption at link time. bind Instructs the compiler to bind threads to cores. You specify this suboption at link time. [no]numa Uses [does not use] libnuma on systems where it is available. For a detailed description of this programming model and the associated directives, refer to Section 9, Using OpenMP of the PGI Compiler User's Guide. To set this option in PVF, use the Fortran | Language | Enable OpenMP Directives property, described in Enable OpenMP Directives.
- -m64
- FOPTIMIZE, OPTIMIZE
- Use the 64-bit compiler for the default processor type.
  Description Use this option to specify the 64-bit compiler as the default processor type.
- -fast
- FOPTIMIZE, OPTIMIZE
- This options creates a generally optimal set of flags for targets that support SIMD capability. It incorporates optimization options to enable use of vector streaming SIMD instructions (64-bit targets) and enable vectorization with SIMD instructions, cache aligned and flushz.
- -Mmovnt
- FOPTIMIZE, OPTIMIZE
- The –Mmovnt option instructs the compiler to generate non-temporal stores and prefetch instructions, even when it cannot determine that this is beneficial.
- -Mfprelaxed
- FOPTIMIZE, OPTIMIZE
- Instructs the compiler to use [not use] relaxed precision in the calculation of some intrinsic functions. Can result in improved performance at the expense of numerical accuracy.
- -fastsse
- OPTIMIZE
- Generally optimal set of flags for targets that include SSE/SSE2 capability.
- -Mallocatable=95
- FOPTIMIZE
- Controls whether Fortran 95 or Fortran 2003 semantics are used in allocatable array assignments. The default behavior is to use Fortran 95 semantics; the 03 option instructs the compiler to use Fortran 2003 semantics.
- -Mdefaultunit
- FOPTIMIZE
- Instructs the compiler to treat "*" as a synonym for standard input for reading and standard output for writing.
- -Mnostride0
- FOPTIMIZE
- Instructs the compiler to perform certain optimizations and to disallow for stride 0 array references.
- -Mnoiomutex
- FOPTIMIZE
- Instructs the compiler not to generate critical section calls around Fortran I/O statements.
- -Mcray=pointer
- FOPTIMIZE
- (Fortran only) Force Cray Fortran (CF77) compatibility with respect to the listed options. Possible values of option include:
  pointer
  
  for purposes of optimization, it is assumed that pointer-based variables do not overlay the storage of any other ariable.

360.ilbdc

- -O3
- FOPTIMIZE, OPTIMIZE
- All level 1 and 2 optimizations are performed. In addition, this level enables more aggressive code hoisting and scalar replacement optimizations that may or may not be profitable.
- -tp=zen
- FOPTIMIZE, OPTIMIZE
- Specify the type(s) of the target processors(s). The PGI compilers produce code specifically targeted to the type of processor on which the compilation is performed. In particular, the default is to use all supported instructions wherever possible when compiling on a given system. The default target processor is auto-selected depending on the processor on which the compilation is performed. You can specify a target processor to compile for a different processor type, such as to select a more generic processor, allowing the code to run on more system types. Specifying two or more target processors enables unified binary code generation, where two or more versions of each function may be generated, each version optimized for the specific instruction set available in each target processor. Executables created on a given system without the -tp flag may not be usable on previous generation systems. For example, executables created on an Intel Skylake processor may use instructions that are not available on earlier Intel Sandy Bridge systems. The following list contains the possible suboptions for -?tp and the processors that each suboption is intended to target. px - generate code that is usable on any x86-64 processor-based system. bulldozer - generate code for AMD Bulldozer and compatible processors. piledriver - generate code that is usable on any AMD Piledriver processor-based system. zen - generate code that is usable on any AMD Zen processor-based system (Epyc, Ryzen). sandybridge - generate code for Intel Sandy Bridge and compatible processors. haswell - generate code that is usable on any Intel Haswell processor-based system. knl - generate code that is usable on any Intel Knights Landing processor-based system. skylake - generate code that is usable on an Intel Skylake Xeon processor-based system.
- -mp
- FOPTIMIZE, OPTIMIZE
- Instructs the compiler to interpret user-inserted OpenMP shared-memory parallel programming directives, and to generate an executable file which will utilize multiple processors in a shared-memory parallel system. Use the -mp option to instruct the compiler to interpret user-inserted OpenMP shared-memory parallel programming directives and to generate an executable file which utilizes multiple processors in a shared-memory parallel system. The suboptions are one or more of the following: align Forces loop iterations to be allocated to OpenMP processes using an algorithm that maximizes alignment of vector sub-sections in loops that are both parallelized and vectorized for SSE. This allocation can improve performance in program units that include many such loops. It can also result in load-balancing problems that significantly decrease performance in program units with relatively short loops that contain a large amount of work in each iteration. The numa suboption uses libnuma on systems where it is available. allcores Instructs the compiler to target all available cores. You specify this suboption at link time. bind Instructs the compiler to bind threads to cores. You specify this suboption at link time. [no]numa Uses [does not use] libnuma on systems where it is available. For a detailed description of this programming model and the associated directives, refer to Section 9, Using OpenMP of the PGI Compiler User's Guide. To set this option in PVF, use the Fortran | Language | Enable OpenMP Directives property, described in Enable OpenMP Directives.
- -m64
- FOPTIMIZE, OPTIMIZE
- Use the 64-bit compiler for the default processor type.
  Description Use this option to specify the 64-bit compiler as the default processor type.
- -fast
- FOPTIMIZE, OPTIMIZE
- This options creates a generally optimal set of flags for targets that support SIMD capability. It incorporates optimization options to enable use of vector streaming SIMD instructions (64-bit targets) and enable vectorization with SIMD instructions, cache aligned and flushz.
- -Mmovnt
- FOPTIMIZE, OPTIMIZE
- The –Mmovnt option instructs the compiler to generate non-temporal stores and prefetch instructions, even when it cannot determine that this is beneficial.
- -Mfprelaxed
- FOPTIMIZE, OPTIMIZE
- Instructs the compiler to use [not use] relaxed precision in the calculation of some intrinsic functions. Can result in improved performance at the expense of numerical accuracy.
- -Mallocatable=95
- FOPTIMIZE, OPTIMIZE
- Controls whether Fortran 95 or Fortran 2003 semantics are used in allocatable array assignments. The default behavior is to use Fortran 95 semantics; the 03 option instructs the compiler to use Fortran 2003 semantics.
- -Mdefaultunit
- FOPTIMIZE, OPTIMIZE
- Instructs the compiler to treat "*" as a synonym for standard input for reading and standard output for writing.
- -Mnostride0
- FOPTIMIZE, OPTIMIZE
- Instructs the compiler to perform certain optimizations and to disallow for stride 0 array references.
- -Mnoiomutex
- FOPTIMIZE, OPTIMIZE
- Instructs the compiler not to generate critical section calls around Fortran I/O statements.
- -Mcray=pointer
- FOPTIMIZE, OPTIMIZE
- (Fortran only) Force Cray Fortran (CF77) compatibility with respect to the listed options. Possible values of option include:
  pointer
  
  for purposes of optimization, it is assumed that pointer-based variables do not overlay the storage of any other ariable.
- -Mnoupcase
- OPTIMIZE
- Instructs the compiler to convert all identifiers to lower case. This selection affects the linking process. If you compile and link the same source code using-Mupcase on one occasion and -Mnoupcase on another, you may get two different executables, depending on whether the source contains uppercase letters. The standard libraries are compiled using -Mnoupcase.

362.fma3d

- -O3
- FOPTIMIZE, OPTIMIZE
- All level 1 and 2 optimizations are performed. In addition, this level enables more aggressive code hoisting and scalar replacement optimizations that may or may not be profitable.
- -tp=zen
- FOPTIMIZE, OPTIMIZE
- Specify the type(s) of the target processors(s). The PGI compilers produce code specifically targeted to the type of processor on which the compilation is performed. In particular, the default is to use all supported instructions wherever possible when compiling on a given system. The default target processor is auto-selected depending on the processor on which the compilation is performed. You can specify a target processor to compile for a different processor type, such as to select a more generic processor, allowing the code to run on more system types. Specifying two or more target processors enables unified binary code generation, where two or more versions of each function may be generated, each version optimized for the specific instruction set available in each target processor. Executables created on a given system without the -tp flag may not be usable on previous generation systems. For example, executables created on an Intel Skylake processor may use instructions that are not available on earlier Intel Sandy Bridge systems. The following list contains the possible suboptions for -?tp and the processors that each suboption is intended to target. px - generate code that is usable on any x86-64 processor-based system. bulldozer - generate code for AMD Bulldozer and compatible processors. piledriver - generate code that is usable on any AMD Piledriver processor-based system. zen - generate code that is usable on any AMD Zen processor-based system (Epyc, Ryzen). sandybridge - generate code for Intel Sandy Bridge and compatible processors. haswell - generate code that is usable on any Intel Haswell processor-based system. knl - generate code that is usable on any Intel Knights Landing processor-based system. skylake - generate code that is usable on an Intel Skylake Xeon processor-based system.
- -mp
- FOPTIMIZE, OPTIMIZE
- Instructs the compiler to interpret user-inserted OpenMP shared-memory parallel programming directives, and to generate an executable file which will utilize multiple processors in a shared-memory parallel system. Use the -mp option to instruct the compiler to interpret user-inserted OpenMP shared-memory parallel programming directives and to generate an executable file which utilizes multiple processors in a shared-memory parallel system. The suboptions are one or more of the following: align Forces loop iterations to be allocated to OpenMP processes using an algorithm that maximizes alignment of vector sub-sections in loops that are both parallelized and vectorized for SSE. This allocation can improve performance in program units that include many such loops. It can also result in load-balancing problems that significantly decrease performance in program units with relatively short loops that contain a large amount of work in each iteration. The numa suboption uses libnuma on systems where it is available. allcores Instructs the compiler to target all available cores. You specify this suboption at link time. bind Instructs the compiler to bind threads to cores. You specify this suboption at link time. [no]numa Uses [does not use] libnuma on systems where it is available. For a detailed description of this programming model and the associated directives, refer to Section 9, Using OpenMP of the PGI Compiler User's Guide. To set this option in PVF, use the Fortran | Language | Enable OpenMP Directives property, described in Enable OpenMP Directives.
- -m64
- FOPTIMIZE, OPTIMIZE
- Use the 64-bit compiler for the default processor type.
  Description Use this option to specify the 64-bit compiler as the default processor type.
- -fast
- FOPTIMIZE, OPTIMIZE
- This options creates a generally optimal set of flags for targets that support SIMD capability. It incorporates optimization options to enable use of vector streaming SIMD instructions (64-bit targets) and enable vectorization with SIMD instructions, cache aligned and flushz.
- -Mmovnt
- FOPTIMIZE, OPTIMIZE
- The –Mmovnt option instructs the compiler to generate non-temporal stores and prefetch instructions, even when it cannot determine that this is beneficial.
- -Mfprelaxed
- FOPTIMIZE, OPTIMIZE
- Instructs the compiler to use [not use] relaxed precision in the calculation of some intrinsic functions. Can result in improved performance at the expense of numerical accuracy.
- -Mallocatable=95
- FOPTIMIZE
- Controls whether Fortran 95 or Fortran 2003 semantics are used in allocatable array assignments. The default behavior is to use Fortran 95 semantics; the 03 option instructs the compiler to use Fortran 2003 semantics.
- -Mdefaultunit
- FOPTIMIZE
- Instructs the compiler to treat "*" as a synonym for standard input for reading and standard output for writing.
- -Mnostride0
- FOPTIMIZE
- Instructs the compiler to perform certain optimizations and to disallow for stride 0 array references.
- -Mnoiomutex
- FOPTIMIZE
- Instructs the compiler not to generate critical section calls around Fortran I/O statements.
- -Mcray=pointer
- FOPTIMIZE
- (Fortran only) Force Cray Fortran (CF77) compatibility with respect to the listed options. Possible values of option include:
  pointer
  
  for purposes of optimization, it is assumed that pointer-based variables do not overlay the storage of any other ariable.
- -Mnoupcase
- FOPTIMIZE
- Instructs the compiler to convert all identifiers to lower case. This selection affects the linking process. If you compile and link the same source code using-Mupcase on one occasion and -Mnoupcase on another, you may get two different executables, depending on whether the source contains uppercase letters. The standard libraries are compiled using -Mnoupcase.
- -pgf90libs
- FOPTIMIZE
- Use this option to instruct the compiler to append PGF90/PGF95/PGFORTRAN runtime libraries to the link line.

363.swim

- -O3
- FOPTIMIZE, OPTIMIZE
- All level 1 and 2 optimizations are performed. In addition, this level enables more aggressive code hoisting and scalar replacement optimizations that may or may not be profitable.
- -tp=zen
- FOPTIMIZE, OPTIMIZE
- Specify the type(s) of the target processors(s). The PGI compilers produce code specifically targeted to the type of processor on which the compilation is performed. In particular, the default is to use all supported instructions wherever possible when compiling on a given system. The default target processor is auto-selected depending on the processor on which the compilation is performed. You can specify a target processor to compile for a different processor type, such as to select a more generic processor, allowing the code to run on more system types. Specifying two or more target processors enables unified binary code generation, where two or more versions of each function may be generated, each version optimized for the specific instruction set available in each target processor. Executables created on a given system without the -tp flag may not be usable on previous generation systems. For example, executables created on an Intel Skylake processor may use instructions that are not available on earlier Intel Sandy Bridge systems. The following list contains the possible suboptions for -?tp and the processors that each suboption is intended to target. px - generate code that is usable on any x86-64 processor-based system. bulldozer - generate code for AMD Bulldozer and compatible processors. piledriver - generate code that is usable on any AMD Piledriver processor-based system. zen - generate code that is usable on any AMD Zen processor-based system (Epyc, Ryzen). sandybridge - generate code for Intel Sandy Bridge and compatible processors. haswell - generate code that is usable on any Intel Haswell processor-based system. knl - generate code that is usable on any Intel Knights Landing processor-based system. skylake - generate code that is usable on an Intel Skylake Xeon processor-based system.
- -mp
- FOPTIMIZE, OPTIMIZE
- Instructs the compiler to interpret user-inserted OpenMP shared-memory parallel programming directives, and to generate an executable file which will utilize multiple processors in a shared-memory parallel system. Use the -mp option to instruct the compiler to interpret user-inserted OpenMP shared-memory parallel programming directives and to generate an executable file which utilizes multiple processors in a shared-memory parallel system. The suboptions are one or more of the following: align Forces loop iterations to be allocated to OpenMP processes using an algorithm that maximizes alignment of vector sub-sections in loops that are both parallelized and vectorized for SSE. This allocation can improve performance in program units that include many such loops. It can also result in load-balancing problems that significantly decrease performance in program units with relatively short loops that contain a large amount of work in each iteration. The numa suboption uses libnuma on systems where it is available. allcores Instructs the compiler to target all available cores. You specify this suboption at link time. bind Instructs the compiler to bind threads to cores. You specify this suboption at link time. [no]numa Uses [does not use] libnuma on systems where it is available. For a detailed description of this programming model and the associated directives, refer to Section 9, Using OpenMP of the PGI Compiler User's Guide. To set this option in PVF, use the Fortran | Language | Enable OpenMP Directives property, described in Enable OpenMP Directives.
- -m64
- FOPTIMIZE, OPTIMIZE
- Use the 64-bit compiler for the default processor type.
  Description Use this option to specify the 64-bit compiler as the default processor type.
- -fast
- FOPTIMIZE, OPTIMIZE
- This options creates a generally optimal set of flags for targets that support SIMD capability. It incorporates optimization options to enable use of vector streaming SIMD instructions (64-bit targets) and enable vectorization with SIMD instructions, cache aligned and flushz.
- -Mmovnt
- FOPTIMIZE, OPTIMIZE
- The –Mmovnt option instructs the compiler to generate non-temporal stores and prefetch instructions, even when it cannot determine that this is beneficial.
- -Mfprelaxed
- FOPTIMIZE, OPTIMIZE
- Instructs the compiler to use [not use] relaxed precision in the calculation of some intrinsic functions. Can result in improved performance at the expense of numerical accuracy.
- -Mallocatable=95
- FOPTIMIZE
- Controls whether Fortran 95 or Fortran 2003 semantics are used in allocatable array assignments. The default behavior is to use Fortran 95 semantics; the 03 option instructs the compiler to use Fortran 2003 semantics.
- -Mdefaultunit
- FOPTIMIZE
- Instructs the compiler to treat "*" as a synonym for standard input for reading and standard output for writing.
- -Mnostride0
- FOPTIMIZE
- Instructs the compiler to perform certain optimizations and to disallow for stride 0 array references.
- -Mnoiomutex
- FOPTIMIZE
- Instructs the compiler not to generate critical section calls around Fortran I/O statements.
- -Mcray=pointer
- FOPTIMIZE
- (Fortran only) Force Cray Fortran (CF77) compatibility with respect to the listed options. Possible values of option include:
  pointer
  
  for purposes of optimization, it is assumed that pointer-based variables do not overlay the storage of any other ariable.

371.applu331

- Same as 363.swim

Shell, Environment, and Other Software Settings

Open MP Tuning Flags

Syntax
KMP_AFFINITY=[<modifier>,...]<type>[,<permute>][,<offset>]

Argument	Default	Description
modifier	noverbose respect granularity=core	Optional. String consisting of keyword and specifier. granularity=<specifier> takes the following specifiers: fine, thread, and core norespect noverbose nowarnings proclist={<proc-list>} respect verbose warnings
type	none	Required string. Indicates the thread affinity to use. compact disabled explicit none scatter logical (deprecated; instead use compact, but omit any permute value) physical (deprecated; instead use scatter, possibly with an offset value) The logical and physical types are deprecated but supported for backward compatibility.
permute	0	Optional. Positive integer value. Not valid with type values of explicit, none, or disabled.
offset	0	Optional. Positive integer value. Not valid with type values of explicit, none, or disabled.

For questions about the meanings of these flags, please contact the tester.
For other inquiries, please contact webmaster@spec.org
Copyright 2012-2021 Standard Performance Evaluation Corporation
Tested with SPEC OMP2012 v1.1.
Report generated on Mon Mar 15 11:08:19 2021 by SPEC OMP2012 flags formatter v538.

	Indicates that the flag description came from the user flags file.
	Indicates that the flag description came from the suite-wide flags file.
	Indicates that the flag description came from a per-benchmark flags file.

OMP2012 Flag DescriptionLenovo Global Technology ThinkSystem SR665(AMD EYPC 7763 CPU, 2.45GHz)

Base Compiler Invocation

Peak Compiler Invocation

Base Portability Flags

Peak Portability Flags

Base Optimization Flags

Peak Optimization Flags

Syntax

Default

Description

Affinity Types

type = none (default)

type = compact

type = disabled

type = explicit

type = scatter

Deprecated Types: logical and physical

Permute and offset combinations

Modifier Values for Affinity Types

modifier = noverbose (default)

modifier = verbose

Execution modes

Serial

Turnaround

Throughput

OMP2012 Flag Description
Lenovo Global Technology ThinkSystem SR665(AMD EYPC 7763 CPU, 2.45GHz)