accel2023 Result Flag Description

Base Optimization Flags

C benchmarks

- -Ofast
- mpicc, mpicxx, mpifort, mpiicc, mpiicpc, mpiifort, mpiicx, mpiicpx, mpiifx, icx, icpx, ifx
- OPTIMIZE
- Enable -O3 -no-prec-div -fp-model fast=2 optimizations.
- -O3
- OPTIMIZE
- Optimize for maximum speed and enable more aggressive optimizations that may not improve performance on some programs.
- -xsapphirerapids
- OPTIMIZE
- May generate instructions for processors that support the specified Intel processor or microarchitecture code name. Optimizes for the specified Intel processor or microarchitecture code name.
- -mprefer-vector-width=512
- OPTIMIZE
- Specifies preferred 512b vector width for auto-vectorization. Defaults to 'none' which allows target specific decisions.
- -qopt-multiple-gather-scatter-by-shuffles
- OPTIMIZE
- Determine if certain square root optimizations are enabled.
- -flto
- OPTIMIZE
- Enable LTO (Link Time Optimization) in 'full' mode.
- -ffast-math
- OPTIMIZE
- Allow aggressive, lossy floating-point optimizations.
- -fiopenmp
- mpicc, mpicxx, mpifort, mpiicc, mpiicpc, mpiifort, mpiicx, mpiicpx, mpiifx, icx, icpx, ifx
- OPTIMIZE
- Enable the compiler to generate multi-threaded code based on the OpenMP* directives. Similar behavior was granted by -qopenmp in previous versions.
- -qopt-dynamic-align
- OPTIMIZE
- Enable peeling to optimize alignment for vectorization.
- -fvec-peel-loops
- OPTIMIZE
- Enables peel loop vectorization.
- -qopt-streaming-stores always
- OPTIMIZE
- Specifies whether streaming stores are generated:
  
  always - enables generation of streaming stores under the assumption that the application is memory bound
  
  auto - compiler decides when streaming stores are used (DEFAULT)
  
  never - disables generation of streaming stores
- -Xclang
- COPTIMIZE
- Pass argument to clang -cc1
- -fopenmp-declare-target-scalar-defaultmap-firstprivate
- COPTIMIZE
- Assume that a scalar declare target variable with implicit data-mapping referenced in a 'target' construct has the same value in the host and device environment
- -fimf-precision=low
- COPTIMIZE
- -fimf-precision=value[:funclist]
  defines the accuracy (precision) for math library functions
  value - defined as one of the following values
  high - equivalent to max-error = 0.6
  medium - equivalent to max-error = 4 (DEFAULT)
  low - equivalent to accuracy-bits = 11 (single precision); accuracy-bits = 26 (double precision)
  funclist - optional comma separated list of one or more math library functions to which the attribute should be applied

Fortran benchmarks

- -Ofast
- mpicc, mpicxx, mpifort, mpiicc, mpiicpc, mpiifort, mpiicx, mpiicpx, mpiifx, icx, icpx, ifx
- OPTIMIZE
- Enable -O3 -no-prec-div -fp-model fast=2 optimizations.
- -O3
- OPTIMIZE
- Optimize for maximum speed and enable more aggressive optimizations that may not improve performance on some programs.
- -xsapphirerapids
- OPTIMIZE
- May generate instructions for processors that support the specified Intel processor or microarchitecture code name. Optimizes for the specified Intel processor or microarchitecture code name.
- -mprefer-vector-width=512
- OPTIMIZE
- Specifies preferred 512b vector width for auto-vectorization. Defaults to 'none' which allows target specific decisions.
- -qopt-multiple-gather-scatter-by-shuffles
- OPTIMIZE
- Determine if certain square root optimizations are enabled.
- -flto
- OPTIMIZE
- Enable LTO (Link Time Optimization) in 'full' mode.
- -ffast-math
- OPTIMIZE
- Allow aggressive, lossy floating-point optimizations.
- -fiopenmp
- mpicc, mpicxx, mpifort, mpiicc, mpiicpc, mpiifort, mpiicx, mpiicpx, mpiifx, icx, icpx, ifx
- OPTIMIZE
- Enable the compiler to generate multi-threaded code based on the OpenMP* directives. Similar behavior was granted by -qopenmp in previous versions.
- -qopt-dynamic-align
- OPTIMIZE
- Enable peeling to optimize alignment for vectorization.
- -fvec-peel-loops
- OPTIMIZE
- Enables peel loop vectorization.
- -qopt-streaming-stores always
- OPTIMIZE
- Specifies whether streaming stores are generated:
  
  always - enables generation of streaming stores under the assumption that the application is memory bound
  
  auto - compiler decides when streaming stores are used (DEFAULT)
  
  never - disables generation of streaming stores
- -nostandard-realloc-lhs
- FOPTIMIZE
- Option standard-realloc-lhs (the default), tells the compiler that when the left-hand side of an assignment is an allocatable object, it should be reallocated to the shape of the right-hand side of the assignment before the assignment occurs. This is the current Fortran Standard definition. This feature may cause extra overhead at run time. This option has the same effect as option assume realloc_lhs.
  
  If you specify nostandard-realloc-lhs, the compiler uses the old Fortran 2003 rules when interpreting assignment statements. The left-hand side is assumed to be allocated with the correct shape to hold the right-hand side. If it is not, incorrect behavior will occur. This option has the same effect as option assume norealloc_lhs.
- -align array32byte
- FOPTIMIZE
- The align toggle changes how data elements are aligned. Variables and arrays are analyzed and memory layout can be altered. Specifying array64byte will look for opportunities to transform and reailgn arrays to 32byte boundaries.
- -auto
- FOPTIMIZE
- This option places local variables (scalars and arrays of all types), except those declared as SAVE, on the runtime stack. It is as if the variables were declared with the AUTOMATIC attribute. It does not affect variables that have the SAVE attribute or ALLOCATABLE attribute, or variables that appear in an EQUIVALENCE statement or in a common block. This option may provide a performance gain for your program, but if your program depends on variables having the same value as the last time the routine was invoked, your program may not function properly. If you want to cause variables to be placed in static memory, specify option [Q]save. If you want only scalar variables of certain intrinsic types to be placed on the runtime stack, specify option auto-scalar.
- -fimf-accuracy-bits-sqrt=14
- FOPTIMIZE
- Define the relative error, measured by the number of correct bits,for math library function results
- -fimf-precision=low
- FOPTIMIZE
- -fimf-precision=value[:funclist]
  defines the accuracy (precision) for math library functions
  value - defined as one of the following values
  high - equivalent to max-error = 0.6
  medium - equivalent to max-error = 4 (DEFAULT)
  low - equivalent to accuracy-bits = 11 (single precision); accuracy-bits = 26 (double precision)
  funclist - optional comma separated list of one or more math library functions to which the attribute should be applied

Benchmarks using both Fortran and C

- -Ofast
- mpicc, mpicxx, mpifort, mpiicc, mpiicpc, mpiifort, mpiicx, mpiicpx, mpiifx, icx, icpx, ifx
- OPTIMIZE
- Enable -O3 -no-prec-div -fp-model fast=2 optimizations.
- -O3
- OPTIMIZE
- Optimize for maximum speed and enable more aggressive optimizations that may not improve performance on some programs.
- -xsapphirerapids
- OPTIMIZE
- May generate instructions for processors that support the specified Intel processor or microarchitecture code name. Optimizes for the specified Intel processor or microarchitecture code name.
- -mprefer-vector-width=512
- OPTIMIZE
- Specifies preferred 512b vector width for auto-vectorization. Defaults to 'none' which allows target specific decisions.
- -qopt-multiple-gather-scatter-by-shuffles
- OPTIMIZE
- Determine if certain square root optimizations are enabled.
- -flto
- OPTIMIZE
- Enable LTO (Link Time Optimization) in 'full' mode.
- -ffast-math
- OPTIMIZE
- Allow aggressive, lossy floating-point optimizations.
- -fiopenmp
- mpicc, mpicxx, mpifort, mpiicc, mpiicpc, mpiifort, mpiicx, mpiicpx, mpiifx, icx, icpx, ifx
- OPTIMIZE
- Enable the compiler to generate multi-threaded code based on the OpenMP* directives. Similar behavior was granted by -qopenmp in previous versions.
- -qopt-dynamic-align
- OPTIMIZE
- Enable peeling to optimize alignment for vectorization.
- -fvec-peel-loops
- OPTIMIZE
- Enables peel loop vectorization.
- -qopt-streaming-stores always
- OPTIMIZE
- Specifies whether streaming stores are generated:
  
  always - enables generation of streaming stores under the assumption that the application is memory bound
  
  auto - compiler decides when streaming stores are used (DEFAULT)
  
  never - disables generation of streaming stores
- -Xclang
- COPTIMIZE
- Pass argument to clang -cc1
- -fopenmp-declare-target-scalar-defaultmap-firstprivate
- COPTIMIZE
- Assume that a scalar declare target variable with implicit data-mapping referenced in a 'target' construct has the same value in the host and device environment
- -fimf-precision=low
- COPTIMIZE, FOPTIMIZE
- -fimf-precision=value[:funclist]
  defines the accuracy (precision) for math library functions
  value - defined as one of the following values
  high - equivalent to max-error = 0.6
  medium - equivalent to max-error = 4 (DEFAULT)
  low - equivalent to accuracy-bits = 11 (single precision); accuracy-bits = 26 (double precision)
  funclist - optional comma separated list of one or more math library functions to which the attribute should be applied
- -nostandard-realloc-lhs
- FOPTIMIZE
- Option standard-realloc-lhs (the default), tells the compiler that when the left-hand side of an assignment is an allocatable object, it should be reallocated to the shape of the right-hand side of the assignment before the assignment occurs. This is the current Fortran Standard definition. This feature may cause extra overhead at run time. This option has the same effect as option assume realloc_lhs.
  
  If you specify nostandard-realloc-lhs, the compiler uses the old Fortran 2003 rules when interpreting assignment statements. The left-hand side is assumed to be allocated with the correct shape to hold the right-hand side. If it is not, incorrect behavior will occur. This option has the same effect as option assume norealloc_lhs.
- -align array32byte
- FOPTIMIZE
- The align toggle changes how data elements are aligned. Variables and arrays are analyzed and memory layout can be altered. Specifying array64byte will look for opportunities to transform and reailgn arrays to 32byte boundaries.
- -auto
- FOPTIMIZE
- This option places local variables (scalars and arrays of all types), except those declared as SAVE, on the runtime stack. It is as if the variables were declared with the AUTOMATIC attribute. It does not affect variables that have the SAVE attribute or ALLOCATABLE attribute, or variables that appear in an EQUIVALENCE statement or in a common block. This option may provide a performance gain for your program, but if your program depends on variables having the same value as the last time the routine was invoked, your program may not function properly. If you want to cause variables to be placed in static memory, specify option [Q]save. If you want only scalar variables of certain intrinsic types to be placed on the runtime stack, specify option auto-scalar.
- -fimf-accuracy-bits-sqrt=14
- FOPTIMIZE
- Define the relative error, measured by the number of correct bits,for math library function results

Peak Optimization Flags

C benchmarks

403.stencil

- -Ofast
- mpicc, mpicxx, mpifort, mpiicc, mpiicpc, mpiifort, mpiicx, mpiicpx, mpiifx, icx, icpx, ifx
- OPTIMIZE
- Enable -O3 -no-prec-div -fp-model fast=2 optimizations.
- -O3
- OPTIMIZE
- Optimize for maximum speed and enable more aggressive optimizations that may not improve performance on some programs.
- -xsapphirerapids
- OPTIMIZE
- May generate instructions for processors that support the specified Intel processor or microarchitecture code name. Optimizes for the specified Intel processor or microarchitecture code name.
- -mprefer-vector-width=512
- OPTIMIZE
- Specifies preferred 512b vector width for auto-vectorization. Defaults to 'none' which allows target specific decisions.
- -qopt-multiple-gather-scatter-by-shuffles
- OPTIMIZE
- Determine if certain square root optimizations are enabled.
- -flto
- OPTIMIZE
- Enable LTO (Link Time Optimization) in 'full' mode.
- -ffast-math
- OPTIMIZE
- Allow aggressive, lossy floating-point optimizations.
- -fiopenmp
- mpicc, mpicxx, mpifort, mpiicc, mpiicpc, mpiifort, mpiicx, mpiicpx, mpiifx, icx, icpx, ifx
- OPTIMIZE
- Enable the compiler to generate multi-threaded code based on the OpenMP* directives. Similar behavior was granted by -qopenmp in previous versions.
- -qopt-dynamic-align
- OPTIMIZE
- Enable peeling to optimize alignment for vectorization.
- -fvec-peel-loops
- OPTIMIZE
- Enables peel loop vectorization.
- -qopt-streaming-stores always
- OPTIMIZE
- Specifies whether streaming stores are generated:
  
  always - enables generation of streaming stores under the assumption that the application is memory bound
  
  auto - compiler decides when streaming stores are used (DEFAULT)
  
  never - disables generation of streaming stores
- -Xclang
- COPTIMIZE
- Pass argument to clang -cc1
- -fopenmp-declare-target-scalar-defaultmap-firstprivate
- COPTIMIZE
- Assume that a scalar declare target variable with implicit data-mapping referenced in a 'target' construct has the same value in the host and device environment
- -fimf-precision=low
- COPTIMIZE
- -fimf-precision=value[:funclist]
  defines the accuracy (precision) for math library functions
  value - defined as one of the following values
  high - equivalent to max-error = 0.6
  medium - equivalent to max-error = 4 (DEFAULT)
  low - equivalent to accuracy-bits = 11 (single precision); accuracy-bits = 26 (double precision)
  funclist - optional comma separated list of one or more math library functions to which the attribute should be applied

Fortran benchmarks

456.spF

- -Ofast
- mpicc, mpicxx, mpifort, mpiicc, mpiicpc, mpiifort, mpiicx, mpiicpx, mpiifx, icx, icpx, ifx
- OPTIMIZE
- Enable -O3 -no-prec-div -fp-model fast=2 optimizations.
- -O3
- OPTIMIZE
- Optimize for maximum speed and enable more aggressive optimizations that may not improve performance on some programs.
- -xsapphirerapids
- OPTIMIZE
- May generate instructions for processors that support the specified Intel processor or microarchitecture code name. Optimizes for the specified Intel processor or microarchitecture code name.
- -mprefer-vector-width=512
- OPTIMIZE
- Specifies preferred 512b vector width for auto-vectorization. Defaults to 'none' which allows target specific decisions.
- -qopt-multiple-gather-scatter-by-shuffles
- OPTIMIZE
- Determine if certain square root optimizations are enabled.
- -flto
- OPTIMIZE
- Enable LTO (Link Time Optimization) in 'full' mode.
- -ffast-math
- OPTIMIZE
- Allow aggressive, lossy floating-point optimizations.
- -fiopenmp
- mpicc, mpicxx, mpifort, mpiicc, mpiicpc, mpiifort, mpiicx, mpiicpx, mpiifx, icx, icpx, ifx
- OPTIMIZE
- Enable the compiler to generate multi-threaded code based on the OpenMP* directives. Similar behavior was granted by -qopenmp in previous versions.
- -qopt-dynamic-align
- OPTIMIZE
- Enable peeling to optimize alignment for vectorization.
- -fvec-peel-loops
- OPTIMIZE
- Enables peel loop vectorization.
- -qopt-streaming-stores always
- OPTIMIZE
- Specifies whether streaming stores are generated:
  
  always - enables generation of streaming stores under the assumption that the application is memory bound
  
  auto - compiler decides when streaming stores are used (DEFAULT)
  
  never - disables generation of streaming stores
- -nostandard-realloc-lhs
- FOPTIMIZE
- Option standard-realloc-lhs (the default), tells the compiler that when the left-hand side of an assignment is an allocatable object, it should be reallocated to the shape of the right-hand side of the assignment before the assignment occurs. This is the current Fortran Standard definition. This feature may cause extra overhead at run time. This option has the same effect as option assume realloc_lhs.
  
  If you specify nostandard-realloc-lhs, the compiler uses the old Fortran 2003 rules when interpreting assignment statements. The left-hand side is assumed to be allocated with the correct shape to hold the right-hand side. If it is not, incorrect behavior will occur. This option has the same effect as option assume norealloc_lhs.
- -align array32byte
- FOPTIMIZE
- The align toggle changes how data elements are aligned. Variables and arrays are analyzed and memory layout can be altered. Specifying array64byte will look for opportunities to transform and reailgn arrays to 32byte boundaries.
- -auto
- FOPTIMIZE
- This option places local variables (scalars and arrays of all types), except those declared as SAVE, on the runtime stack. It is as if the variables were declared with the AUTOMATIC attribute. It does not affect variables that have the SAVE attribute or ALLOCATABLE attribute, or variables that appear in an EQUIVALENCE statement or in a common block. This option may provide a performance gain for your program, but if your program depends on variables having the same value as the last time the routine was invoked, your program may not function properly. If you want to cause variables to be placed in static memory, specify option [Q]save. If you want only scalar variables of certain intrinsic types to be placed on the runtime stack, specify option auto-scalar.
- -fimf-accuracy-bits-sqrt=14
- FOPTIMIZE
- Define the relative error, measured by the number of correct bits,for math library function results
- -fimf-precision=low
- FOPTIMIZE
- -fimf-precision=value[:funclist]
  defines the accuracy (precision) for math library functions
  value - defined as one of the following values
  high - equivalent to max-error = 0.6
  medium - equivalent to max-error = 4 (DEFAULT)
  low - equivalent to accuracy-bits = 11 (single precision); accuracy-bits = 26 (double precision)
  funclist - optional comma separated list of one or more math library functions to which the attribute should be applied

Benchmarks using both Fortran and C

453.clvrleaf

- -Ofast
- mpicc, mpicxx, mpifort, mpiicc, mpiicpc, mpiifort, mpiicx, mpiicpx, mpiifx, icx, icpx, ifx
- OPTIMIZE
- Enable -O3 -no-prec-div -fp-model fast=2 optimizations.
- -O3
- OPTIMIZE
- Optimize for maximum speed and enable more aggressive optimizations that may not improve performance on some programs.
- -xsapphirerapids
- OPTIMIZE
- May generate instructions for processors that support the specified Intel processor or microarchitecture code name. Optimizes for the specified Intel processor or microarchitecture code name.
- -mprefer-vector-width=512
- OPTIMIZE
- Specifies preferred 512b vector width for auto-vectorization. Defaults to 'none' which allows target specific decisions.
- -qopt-multiple-gather-scatter-by-shuffles
- OPTIMIZE
- Determine if certain square root optimizations are enabled.
- -flto
- OPTIMIZE
- Enable LTO (Link Time Optimization) in 'full' mode.
- -ffast-math
- OPTIMIZE
- Allow aggressive, lossy floating-point optimizations.
- -fiopenmp
- mpicc, mpicxx, mpifort, mpiicc, mpiicpc, mpiifort, mpiicx, mpiicpx, mpiifx, icx, icpx, ifx
- OPTIMIZE
- Enable the compiler to generate multi-threaded code based on the OpenMP* directives. Similar behavior was granted by -qopenmp in previous versions.
- -qopt-dynamic-align
- OPTIMIZE
- Enable peeling to optimize alignment for vectorization.
- -fvec-peel-loops
- OPTIMIZE
- Enables peel loop vectorization.
- -qopt-streaming-stores always
- OPTIMIZE
- Specifies whether streaming stores are generated:
  
  always - enables generation of streaming stores under the assumption that the application is memory bound
  
  auto - compiler decides when streaming stores are used (DEFAULT)
  
  never - disables generation of streaming stores
- -Xclang
- COPTIMIZE
- Pass argument to clang -cc1
- -fopenmp-declare-target-scalar-defaultmap-firstprivate
- COPTIMIZE
- Assume that a scalar declare target variable with implicit data-mapping referenced in a 'target' construct has the same value in the host and device environment
- -fimf-precision=low
- COPTIMIZE, FOPTIMIZE
- -fimf-precision=value[:funclist]
  defines the accuracy (precision) for math library functions
  value - defined as one of the following values
  high - equivalent to max-error = 0.6
  medium - equivalent to max-error = 4 (DEFAULT)
  low - equivalent to accuracy-bits = 11 (single precision); accuracy-bits = 26 (double precision)
  funclist - optional comma separated list of one or more math library functions to which the attribute should be applied
- -nostandard-realloc-lhs
- FOPTIMIZE
- Option standard-realloc-lhs (the default), tells the compiler that when the left-hand side of an assignment is an allocatable object, it should be reallocated to the shape of the right-hand side of the assignment before the assignment occurs. This is the current Fortran Standard definition. This feature may cause extra overhead at run time. This option has the same effect as option assume realloc_lhs.
  
  If you specify nostandard-realloc-lhs, the compiler uses the old Fortran 2003 rules when interpreting assignment statements. The left-hand side is assumed to be allocated with the correct shape to hold the right-hand side. If it is not, incorrect behavior will occur. This option has the same effect as option assume norealloc_lhs.
- -align array32byte
- FOPTIMIZE
- The align toggle changes how data elements are aligned. Variables and arrays are analyzed and memory layout can be altered. Specifying array64byte will look for opportunities to transform and reailgn arrays to 32byte boundaries.
- -auto
- FOPTIMIZE
- This option places local variables (scalars and arrays of all types), except those declared as SAVE, on the runtime stack. It is as if the variables were declared with the AUTOMATIC attribute. It does not affect variables that have the SAVE attribute or ALLOCATABLE attribute, or variables that appear in an EQUIVALENCE statement or in a common block. This option may provide a performance gain for your program, but if your program depends on variables having the same value as the last time the routine was invoked, your program may not function properly. If you want to cause variables to be placed in static memory, specify option [Q]save. If you want only scalar variables of certain intrinsic types to be placed on the runtime stack, specify option auto-scalar.
- -fimf-accuracy-bits-sqrt=14
- FOPTIMIZE
- Define the relative error, measured by the number of correct bits,for math library function results

459.miniGhost

- basepeak = yes

Shell, Environment, and Other Software Settings

One or more of the following settings may have been applied to the testbed. If so, the "Platform Notes" section of the report will say so; and you can read below to find out more about what these settings mean.

LD_LIBRARY_PATH=<directories> (linker)
LD_LIBRARY_PATH controls the search order for both the compile-time and run-time linkers. Usually, it can be defaulted; but testers may sometimes choose to explicitly set it (as documented in the notes in the submission), in order to ensure that the correct versions of libraries are picked up.

STACKSIZE=<n>
Set the size of the stack (temporary storage area) for each slave thread of a multithreaded program.

ulimit -s <n>
Sets the stack size to n kbytes, or "unlimited" to allow the stack size to grow without limit.

Operating System Tuning Parameters

Firmware / BIOS / Microcode Settings

For questions about the meanings of these flags, please contact the tester.
For other inquiries, please contact info@spec.org
Copyright 2023-2024 Standard Performance Evaluation Corporation
Tested with SPEC accel2023 v2.0.17.
Report generated on 2024-03-06 18:09:14 by SPEC accel2023 flags formatter v112 .

	Indicates that the flag description came from the user flags file.
	Indicates that the flag description came from the suite-wide flags file.
	Indicates that the flag description came from a per-benchmark flags file.

accel2023 Flag DescriptionSupermicro GPU SuperServer SYS-741GE-TNRT

Compilers: Intel Fortran/C/C++

Operating systems: Linux

Base Compiler Invocation

Peak Compiler Invocation

Base Portability Flags

Peak Portability Flags

Base Optimization Flags

Peak Optimization Flags

accel2023 Flag Description
Supermicro GPU SuperServer SYS-741GE-TNRT