# Invocation command line: # /mnt/ramdisk/cpu2017-1.1.9-aocc400-znver4-A1/bin/harness/runcpu --configfile amd_rate_aocc400_znver4_A1.cfg --tune all --reportable --iterations 2 --define DL-BIOSinc=Dell-BIOS_EPYC-4.inc --define DL-BIOS-adddcD=1 --define DL-BIOS-VirtD=1 --define DL-VERS=v4.6.1 --output_format html,pdf,txt --nopower --runmode rate --tune base:peak --size test:train:refrate fprate # output_root was not used for this run ############################################################################ ################################################################################ # AMD AOCC 400 SPEC CPU 2017 V1.1.9 Rate Configuration File for 64-bit Linux # # File name : amd_rate_aocc400_znver4_A1.cfg # Creation Date : April 13, 2023 # CPU 2017 Version : 1.1.9 # Supported benchmarks : All Rate benchmarks (intrate, fprate) # Compiler name/version : AOCC 4.0.0 # Operating system version : RHEL 8.6 # Supported OS's : SLE 15 SP4, Ubuntu 22.04, RHEL 9.0, RHEL 8.6 # Hardware : AMD Genoa, Genoa-X, Bergamo, Siena (AMD64) # FP Base Pointer Size : 64-bit # FP Peak Pointer Size : 64-bit # INT Base Pointer Size : 64-bit # INT Peak Pointer Size : 32/64-bit # Auto Parallelization : No # # Note: DO NOT EDIT THIS FILE, the only edits required to properly run these # binaries are made in the ini Python file. Please consult Readme.amd_rate_aocc400_znver4_A1.txt # for a few uncommon exceptions which require edits to this file. # # Description: # # This binary package automates away many of the complexities necessary to set # up and run SPEC CPU 2017 under optimized conditions on AMD Genoa, Genoa-X, Bergamo, Siena-based # server platforms within Linux (AMD64). # # The binary package was built specifically for AMD Genoa, Genoa-X, Bergamo, Siena microprocessors and # is not intended to run on other products. # # Please install the binary package by following the instructions in # "Readme.amd_rate_aocc400_znver4_A1.txt" under the "How To Use the Binaries" section. # # The binary package is designed to work without alteration on two socket AMD # Genoa, Genoa-X, Bergamo, Siena-based servers with 96 cores per socket, SMT enabled and 1.5 TiB of DDR5 # memory distributed evenly among all 24 channels using 64 GiB DIMMs. # # To run the binary package on other Genoa, Genoa-X, Bergamo, Siena configurations, please review # "Readme.amd_rate_aocc400_znver4_A1.txt". In general, Genoa, Genoa-X, Bergamo, Siena CPUs # should be autodetected with no action required by the user. # # In most cases, it should be unnecessary to edit "amd_rate_aocc400_znver4_A1.cfg" or any # other file besides "ini_amd_rate_aocc400_znver4_A1.py" where reporting fields # and run conditions are set. # # The run script automatically sets the optimal number of rate copies and binds # them appropriately. # # The run script and accompanying binary package are designed to work on Ubuntu # 22.04. # # Important! If you write your own run script, please set the stack size to # "unlimited" when executing this binary package. Failure to do so may cause # some benchmarks to overflow the stack. For example, to set stack size within # the bash shell, include the following line somewhere at the top of your run # script before the runcpu invocation: # # ulimit -s unlimited # # Modification of this config file should only be necessary if you intend to # rebuild the binaries. General instructions for rebuilding the binaries are # found in-line below. # ################################################################################ # Modifiable macros: ################################################################################ # "allow_build"" switch: # Change the following line to true if you intend to REBUILD the binaries (AMD # does not support this). Valid values are "true" or "false" (no quotes). %define allow_build false # Only change these macros if you are rebuilding the binary package: %define compiler_name aocc400 %define binary_package_name amd_rate_%{compiler_name}_znver4_A %define binary_package_ext %{binary_package_name} %define binary_package_revision 1 %define build_path /home/work/cpu2017/v119/aocc4/znver4/rate %define flags_file_name %{compiler_name}-flags.xml # Do NOT change build_lib_dir after the build or it will trigger a # rebuild of the xalanc. It should also remain literal: %define build_lib_dir %{binary_package_name}_lib # To enable the platform file, be sure to uncomment the flagsurl02 header line # below in the Header settings. %define platform_file_name INVALID_platform_%{binary_package_name}.xml ################################################################################ # You should never have to change binary_package_full_name: %define binary_package_full_name %{binary_package_name}%{binary_package_revision} ################################################################################ # Include file names ################################################################################ # The include file contains fields that are commonly changed. This file is auto- # generated based upon INI file settings and should not need user modification # for runs. The flags include file contains all of the compiler flags. %define inc_file_name %{binary_package_full_name}.inc %define flags_inc_file_name %{binary_package_full_name}_flags.inc # Binary label extension: # Only modify the binary label extension if you plan to rebuild the binaries. # If you plan to recompile these CPU 2017 binaries, please choose a new extension # name below to avoid confusion with the current binary set on your system # under test, and to avoid confusion for SPEC submission reviewers. You will # also need to set "allow_build" to true above. Finally, you must modify the # Paths section below to point to your library locations if the paths are not # already set up in your build environment. # Note that AMD calls an external script to set up the compiler and library # paths before initiating the build. %define ext %{binary_package_ext} ################################################################################ # Paths and Environment Variables # ** MODIFY AS NEEDED (modification should not be necessary for runs) ** ################################################################################ # Allow environment variables to be set before runs: preenv = 1 # Necessary to avoid out-of-memory exceptions on certain SUTs: preENV_MALLOC_CONF = retain:true # Define the name of the directory that holds AMD library files: %define lib_dir %{binary_package_name}_lib # Set the shared object library path for runs and builds: preENV_LD_LIBRARY_PATH = $[top]/%{lib_dir}/lib:$[top]/%{lib_dir}/lib32:%{ENV_LD_LIBRARY_PATH} # Define 32-bit library build paths: # Do NOT use $[top] with the 32-bit libraries because doing so will cause an # options checksum error triggering a xalanc recompile attempt on SUTs having # different file paths. # Do NOT change build_lib_dir after the build or it will also trigger a # rebuild of the xalanc: AMDALLOC_LIB32_PATH = %{build_path}/%{build_lib_dir}/lib32 %if '%{allow_build}' eq 'false' # The include file is only needed for runs, but not for builds. # include: %{inc_file_name} # ----- Begin inclusion of 'amd_rate_aocc400_znver4_A1.inc' ############################################################################ ################################################################################ ################################################################################ # File name: amd_rate_aocc400_znver4_A1.inc # File generation code date: September 23, 2022 # File generation date/time: August 23, 2023 / 16:21:13 # # This file is automatically generated during a SPEC CPU2017 run. # # To modify inc file generation, please consult the readme file or the run # script. ################################################################################ ################################################################################ ################################################################################ ################################################################################ # The following macros are generated for use in the cfg file. ################################################################################ ################################################################################ %define logical_core_count 32 %define physical_core_count 16 ################################################################################ # The following inc blocks set the rate copy counts and affinity settings. # # intrate benchmarks: 500.perlbench_r,502.gcc_r,505.mcf_r,520.omnetpp_r, # 523.xalancbmk_r,525.x264_r,531.deepsjeng_r,541.leela_r,548.exchange2_r, # 557.xz_r # fpspeed benchmarks: 503.bwaves_r,507.cactuBSSN_r,519.lbm_r,521.wrf_r, # 527.cam4_r,538.imagick_r,544.nab_r,549.fotonik3d_r,554.roms_r # # Selected copy counts from '9184x' section of CPU info ################################################################################ # default copy counts: default: copies = 32 # Bind commands for assigning affinity: bind0 = numactl --localalloc --physcpubind=0 bind1 = numactl --localalloc --physcpubind=1 bind2 = numactl --localalloc --physcpubind=2 bind3 = numactl --localalloc --physcpubind=3 bind4 = numactl --localalloc --physcpubind=4 bind5 = numactl --localalloc --physcpubind=5 bind6 = numactl --localalloc --physcpubind=6 bind7 = numactl --localalloc --physcpubind=7 bind8 = numactl --localalloc --physcpubind=8 bind9 = numactl --localalloc --physcpubind=9 bind10 = numactl --localalloc --physcpubind=10 bind11 = numactl --localalloc --physcpubind=11 bind12 = numactl --localalloc --physcpubind=12 bind13 = numactl --localalloc --physcpubind=13 bind14 = numactl --localalloc --physcpubind=14 bind15 = numactl --localalloc --physcpubind=15 bind16 = numactl --localalloc --physcpubind=16 bind17 = numactl --localalloc --physcpubind=17 bind18 = numactl --localalloc --physcpubind=18 bind19 = numactl --localalloc --physcpubind=19 bind20 = numactl --localalloc --physcpubind=20 bind21 = numactl --localalloc --physcpubind=21 bind22 = numactl --localalloc --physcpubind=22 bind23 = numactl --localalloc --physcpubind=23 bind24 = numactl --localalloc --physcpubind=24 bind25 = numactl --localalloc --physcpubind=25 bind26 = numactl --localalloc --physcpubind=26 bind27 = numactl --localalloc --physcpubind=27 bind28 = numactl --localalloc --physcpubind=28 bind29 = numactl --localalloc --physcpubind=29 bind30 = numactl --localalloc --physcpubind=30 bind31 = numactl --localalloc --physcpubind=31 submit = echo "$command" > run.sh ; $BIND bash run.sh ################################################################################ ################################################################################ # fprate copy counts: fprate: copies = 32 # Bind commands for assigning affinity: bind0 = numactl --localalloc --physcpubind=0 bind1 = numactl --localalloc --physcpubind=1 bind2 = numactl --localalloc --physcpubind=2 bind3 = numactl --localalloc --physcpubind=3 bind4 = numactl --localalloc --physcpubind=4 bind5 = numactl --localalloc --physcpubind=5 bind6 = numactl --localalloc --physcpubind=6 bind7 = numactl --localalloc --physcpubind=7 bind8 = numactl --localalloc --physcpubind=8 bind9 = numactl --localalloc --physcpubind=9 bind10 = numactl --localalloc --physcpubind=10 bind11 = numactl --localalloc --physcpubind=11 bind12 = numactl --localalloc --physcpubind=12 bind13 = numactl --localalloc --physcpubind=13 bind14 = numactl --localalloc --physcpubind=14 bind15 = numactl --localalloc --physcpubind=15 bind16 = numactl --localalloc --physcpubind=16 bind17 = numactl --localalloc --physcpubind=17 bind18 = numactl --localalloc --physcpubind=18 bind19 = numactl --localalloc --physcpubind=19 bind20 = numactl --localalloc --physcpubind=20 bind21 = numactl --localalloc --physcpubind=21 bind22 = numactl --localalloc --physcpubind=22 bind23 = numactl --localalloc --physcpubind=23 bind24 = numactl --localalloc --physcpubind=24 bind25 = numactl --localalloc --physcpubind=25 bind26 = numactl --localalloc --physcpubind=26 bind27 = numactl --localalloc --physcpubind=27 bind28 = numactl --localalloc --physcpubind=28 bind29 = numactl --localalloc --physcpubind=29 bind30 = numactl --localalloc --physcpubind=30 bind31 = numactl --localalloc --physcpubind=31 submit = echo "$command" > run.sh ; $BIND bash run.sh ################################################################################ # Switch back to the default block after the include file: default: ################################################################################ ################################################################################ ################################################################################ ################################################################################ # The remainder of this file defines CPU2017 report parameters. ################################################################################ ################################################################################ ################################################################################ # SPEC CPU 2017 report header ################################################################################ license_num =9999 # (Your SPEC license number) tester =unknown tester test_sponsor =unknown sponsor hw_vendor =unknown vendor #--------- If you install new compilers, edit this section -------------------- sw_compiler =C/C++/Fortran: Version 4.0.0 of AOCC ################################################################################ ################################################################################ # Hardware, firmware and software information ################################################################################ hw_avail =Jun-2023 sw_avail =Jun-2023 hw_cpu_name =AMD EPYC 9184X hw_cpu_nominal_mhz =3550 hw_cpu_max_mhz =4200 hw_ncores =16 hw_nthreadspercore =2 hw_ncpuorder =1 chip hw_other =None # Other perf-relevant hw, or "None" fw_bios =Version 0.0.0 released Jun-2023 sw_base_ptrsize =64-bit hw_pcache =32 KB I + 32 KB D on chip per core hw_scache =1 MB I+D on chip per core hw_tcache000 =768 MB I+D on chip per chip, 96 MB shared / 2 hw_tcache001 = cores hw_ocache =None sw_other =None ################################################################################ # Notes ################################################################################ # Enter notes_000 through notes_100 here. notes_000 =Binaries were compiled on a system with 2x AMD EPYC 9174F CPU + 1.5TiB Memory using RHEL 8.6 notes_005 = notes_010 =NA: The test sponsor attests, as of date of publication, that CVE-2017-5754 (Meltdown) notes_015 =is mitigated in the system as tested and documented. notes_020 =Yes: The test sponsor attests, as of date of publication, that CVE-2017-5753 (Spectre variant 1) notes_025 =is mitigated in the system as tested and documented. notes_030 =Yes: The test sponsor attests, as of date of publication, that CVE-2017-5715 (Spectre variant 2) notes_035 =is mitigated in the system as tested and documented. notes_040 = notes_submit_000 ='numactl' was used to bind copies to the cores. notes_submit_005 =See the configuration file for details. notes_submit_010 = notes_os_000 ='ulimit -s unlimited' was used to set environment stack size limit notes_os_005 ='ulimit -l 2097152' was used to set environment locked pages in memory limit notes_os_010 = notes_os_015 =runcpu command invoked through numactl i.e.: notes_os_020 =numactl --interleave=all runcpu notes_os_025 = notes_os_030 =To limit dirty cache to 8% of memory, 'sysctl -w vm.dirty_ratio=8' run as root. notes_os_035 =To limit swap usage to minimum necessary, 'sysctl -w vm.swappiness=1' run as root. notes_os_040 =To free node-local memory and avoid remote memory usage, notes_os_045 ='sysctl -w vm.zone_reclaim_mode=1' run as root. notes_os_050 =To clear filesystem caches, 'sync; sysctl -w vm.drop_caches=3' run as root. notes_os_055 =To disable address space layout randomization (ASLR) to reduce run-to-run notes_os_060 =variability, 'sysctl -w kernel.randomize_va_space=0' run as root. notes_os_065 = notes_comp_000 =The AMD64 AOCC Compiler Suite is available at notes_comp_005 =http://developer.amd.com/amd-aocc/ notes_comp_010 = # notes_jemalloc_000 =jemalloc: configured and built with GCC v4.8.2 in RHEL 7.4 (No options specified) # notes_jemalloc_005 =jemalloc 5.1.0 is available here: # notes_jemalloc_010 =https://github.com/jemalloc/jemalloc/releases/download/5.1.0/jemalloc-5.1.0.tar.bz2 # notes_jemalloc_015 = # sw_other000 =jemalloc: jemalloc memory allocator library v5.1.0 ################################################################################ # The following note fields describe platorm settings. ################################################################################ # example: (edit and uncomment as necessary) # notes_plat_000 =BIOS settings: # notes_plat_002 = TDP: 400 # notes_plat_004 = Determinism Slider set to Power # notes_plat_006 = PPT: 400 # notes_plat_010 = NPS: 4 # notes_plat_011 = Workload Profile = CPU Intensive # notes_plat_012 = TSME = Disabled # notes_plat_014 = SEV Control = Disabled # notes_plat_015 = Fan Speed: Maximum ################################################################################ # The following are custom fields: ################################################################################ # Use custom_fields to enter lines that are not listed here. For example: # notes_plat_100 = Energy Bias set to Max Performance # new_field = Ambient temperature set to 10C ################################################################################ # The following fields must be set here for only Int benchmarks. ################################################################################ intrate: sw_peak_ptrsize =32/64-bit notes_os_thp_000 =To enable Transparent Hugepages (THP) only on request for base runs, notes_os_thp_001 ='echo madvise > /sys/kernel/mm/transparent_hugepage/enabled' run as root. notes_os_thp_002 =To enable THP for all allocations for peak runs, notes_os_thp_003 ='echo always > /sys/kernel/mm/transparent_hugepage/enabled' and notes_os_thp_004 ='echo always > /sys/kernel/mm/transparent_hugepage/defrag' run as root. notes_os_thp_005 = ################################################################################ # The following fields must be set here for FP benchmarks. ################################################################################ fprate: sw_peak_ptrsize =64-bit notes_os_thp_000 =To enable Transparent Hugepages (THP) for all allocations, notes_os_thp_005 ='echo always > /sys/kernel/mm/transparent_hugepage/enabled' and notes_os_thp_010 ='echo always > /sys/kernel/mm/transparent_hugepage/defrag' run as root. notes_os_thp_015 = ################################################################################ # The following fields must be set here or they will be overwritten by sysinfo. ################################################################################ intrate,fprate: hw_disk =unknown hw_nchips =1 prepared_by =prepared by unknown sw_file =unknown file sw_state =Run level 3 (multi-user) ################################################################################ # End of inc file ################################################################################ # Switch back to the default block after the include file: default: # ---- End inclusion of '/mnt/ramdisk/cpu2017-1.1.9-aocc400-znver4-A1/config/amd_rate_aocc400_znver4_A1.inc' # Switch back to default block after the include file: default: fail_build = 0 # FIX THIS SO THAT CHECKSUMS WILL BE ENFORCED! %elif '%{allow_build}' eq 'true' # If you intend to rebuild, be sure to set the library paths either in the # build script or here: preENV_LIBRARY_PATH = $[top]/%{build_lib_dir}/lib:$[top]/%{build_lib_dir}/lib32:%{ENV_LIBRARY_PATH} % define build_ncpus 64 # controls number of simultaneous compiles fail_build = 0 makeflags = --jobs=%{build_ncpus} --load-average=%{build_ncpus} %else % error The value of "allow_build" is %{allow_build}, but it can only be "true" or "false". This error was generated %endif ################################################################################ # Enable automated data collection per benchmark ################################################################################ # Data collection is not enabled for reportable runs. # teeout is necessary to get data collection stdout into the logs. Best # practices for the individual data collection items would be to have # them store important output in separate files. Filenames could be # constructed from $SPEC (environment), $lognum (result number from runcpu), # and benchmark name/number. #teeout = yes # Run runcpu with '-v 35' (or greater) to log lists of variables which can # be used in substitutions as below. # For CPU2006, change $label to $ext %define data-collection-parameters benchname='$name' benchnum='$num' benchmark='$benchmark' iteration=$iter size='$size' tune='$tune' label='$label' log='$log' lognum='$lognum' from_runcpu='$from_runcpu' %define data-collection-start $[top]/data-collection/data-collection start %{data-collection-parameters} %define data-collection-stop $[top]/data-collection/data-collection stop %{data-collection-parameters} monitor_specrun_wrapper = %{data-collection-start} ; $command ; %{data-collection-stop} ################################################################################ # Header settings ################################################################################ backup_config = 0 # set to 0 if you do not want backup files bench_post_setup = sync # command_add_redirect: If set, the generated ${command} will include # redirection operators (stdout, stderr), which are passed along to the shell # that executes the command. If this variable is not set, specinvoke does the # redirection. command_add_redirect = yes env_vars = yes flagsurl000 = http://www.spec.org/cpu2017/flags/aocc400-flags.xml #flagsurl02 = $[top]/%{platform_file_name} # label: User defined extension string that tags your binaries & directories: label = %{ext} line_width = 1020 log_line_width = 1020 mean_anyway = yes output_format = all reportable = yes size = test,train,ref #teeout = yes #teerunout = yes tune = base,peak ################################################################################ # Include the flags file: ################################################################################ #include: %{flags_inc_file_name} # ----- Begin inclusion of 'amd_rate_aocc400_znver4_A1_flags.inc' ############################################################################ ################################################################################ # AMD AOCC 4.0.0 SPEC CPU2017 V1.1.9 Rate Configuration Flags for AMD64 Linux ################################################################################ # Compilers ################################################################################ default: CC = clang -m64 CXX = clang++ -m64 FC = flang -m64 CLD = clang -m64 CXXLD = clang++ -m64 FLD = flang -m64 CC_VERSION_OPTION = --version CXX_VERSION_OPTION = --version FC_VERSION_OPTION = --version ################################################################################ # Portability Flags ################################################################################ default: # data model applies to all benchmarks EXTRA_PORTABILITY = -DSPEC_LP64 # *** Benchmark-specific portability *** # Anything other than the data model is only allowed where a need is proven. # (ordered by last 2 digits of benchmark number) 500.perlbench_r: #lang='C' PORTABILITY = -DSPEC_LINUX_X64 521.wrf_r: #lang='F,C' CPORTABILITY = -DSPEC_CASE_FLAG FPORTABILITY = -Mbyteswapio 523.xalancbmk_r: #lang='CXX' PORTABILITY = -DSPEC_LINUX 526.blender_r: #lang='CXX,C' CPORTABILITY = -funsigned-char 527.cam4_r: #lang='F,C' PORTABILITY = -DSPEC_CASE_FLAG ################################################################################ # Default libraries and variables ################################################################################ default: # Libraries: EXTRA_LIBS = -lamdalloc -lamdlibm -lm MATHLIBOPT = #clearing this variable or else SPEC will set it to -lm VECMATHLIB = -fveclib=AMDLIBM # Variables: OPT_ROOT = -march=znver4 $(VECMATHLIB) -ffast-math OPT_ROOT_BASE = -O3 $(OPT_ROOT) OPT_ROOT_PEAK = -Ofast $(OPT_ROOT) -flto #Ofast enables -ffast-math ############################################################################### # AOCC 4.0.0 workarounds that do not count as PORTABILITY ################################################################################ # The workarounds in this section would not qualify under the SPEC CPU # PORTABILITY rule. # - In peak, they can be set as needed for individual benchmarks. # - In base, individual settings are not allowed; set for whole suite. # Use EXTRA_CFLAGS, EXTRA_CXXFLAGS, and EXTRA_FFLAGS for them. # # See: # https://www.spec.org/cpu2017/Docs/runrules.html#portability # https://www.spec.org/cpu2017/Docs/runrules.html#BaseFlags ####################### # Default workarounds # ####################### default: # Allow unused compile/link arguments without triggering warnings during build: EXTRA_CFLAGS = -Wno-unused-command-line-argument EXTRA_CXXFLAGS = -Wno-unused-command-line-argument EXTRA_FFLAGS = -Wno-unused-command-line-argument LDOPTIONS = -Wno-unused-command-line-argument #################### # Base workarounds # #################### # # *** NONE *** # ############################## # Integer workarounds - base # ############################## # intrate=base: # The following is necessary for 502/602 gcc: EXTRA_LDFLAGS = -z muldefs ######################### # FP workarounds - base # ######################### # # *** NONE *** # #################### # Peak workarounds # #################### # # *** NONE *** # ############################## # Integer workarounds - peak # ############################## 502.gcc_r=peak: #lang='C' EXTRA_CFLAGS = -Wno-unused-command-line-argument \ -fgnu89-inline EXTRA_LDFLAGS = -z muldefs ##################################### # Floating Point workarounds - peak # ##################################### # # *** NONE *** # ################################################################################ # Tuning Flags ################################################################################ ##################### # Base tuning flags # ##################### default=base: COPTIMIZE = $(OPT_ROOT_BASE) \ -flto \ -fstruct-layout=7 \ -mllvm -unroll-threshold=50 \ -mllvm -inline-threshold=1000 \ -fremap-arrays \ -fstrip-mining \ -mllvm -reduce-array-computations=3 \ -zopt CXXOPTIMIZE = $(OPT_ROOT_BASE) \ -flto \ -mllvm -unroll-threshold=100\ -finline-aggressive \ -mllvm -loop-unswitch-threshold=200000 \ -mllvm -reduce-array-computations=3 \ -zopt FOPTIMIZE = $(OPT_ROOT_BASE) \ -flto \ -Kieee \ -Mrecursive \ -funroll-loops \ -mllvm -lsr-in-nested-loop \ -mllvm -reduce-array-computations=3 \ -fepilog-vectorization-of-inductions \ -zopt LDCXXFLAGS = -Wl,-mllvm -Wl,-x86-use-vzeroupper=false LDCFLAGS = -Wl,-mllvm -Wl,-ldist-scalar-expand \ -fenable-aggressive-gather LDFLAGS = -flto \ -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 \ -Wl,-mllvm -Wl,-reduce-array-computations=3 LDFFLAGS = -Wl,-mllvm -Wl,-enable-X86-prefetching # Libraries: EXTRA_LIBS = -lamdlibm -lm -lamdalloc -lflang EXTRA_FLIBS = # Don't put the AMD and mvec math libraries in MATH_LIBS because it will trigger a reporting issue # because GCC won't use them. Forcefeed all benchmarks the math libraries in EXTRA_LIBS and clear # out MATH_LIBS. MATH_LIBS = ######################## # intrate tuning flags # ######################## intrate: FOPTIMIZE = $(OPT_ROOT_BASE) \ -flto \ -fepilog-vectorization-of-inductions \ -mllvm -optimize-strided-mem-cost \ -floop-transform \ -mllvm -unroll-aggressive \ -mllvm -unroll-threshold=500 EXTRA_CXXOPTIMIZE = -fvirtual-function-elimination \ -fvisibility=hidden # LDCXXFLAGS is left empty as intrate CPP bmks have to use VZEROUPPER # instruction which is the default. LDCXXFLAGS = LDFFLAGS = -Wl,-mllvm -Wl,-inline-recursion=4 \ -Wl,-mllvm -Wl,-lsr-in-nested-loop \ -Wl,-mllvm -Wl,-enable-iv-split # Libraries: EXTRA_LIBS = -lamdlibm -lm -lflang EXTRA_CLIBS = -lamdalloc EXTRA_CXXLIBS = -lamdalloc-ext EXTRA_FLIBS = -lamdalloc ##################### # Peak tuning flags # ##################### default=peak: COPTIMIZE = $(OPT_ROOT_PEAK) \ -fstruct-layout=7 \ -mllvm -unroll-threshold=50 -fremap-arrays \ -fstrip-mining \ -mllvm -inline-threshold=1000 \ -mllvm -reduce-array-computations=3 \ -zopt CXXOPTIMIZE = $(OPT_ROOT_PEAK) \ -finline-aggressive \ -mllvm -unroll-threshold=100 \ -mllvm -reduce-array-computations=3 \ -zopt FOPTIMIZE = $(OPT_ROOT_PEAK) \ -Mrecursive \ -mllvm -reduce-array-computations=3 \ -fepilog-vectorization-of-inductions \ -zopt LDFFLAGS = -Wl,-mllvm -Wl,-enable-X86-prefetching LDFLAGS = -flto \ -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 \ -Wl,-mllvm -Wl,-reduce-array-computations=3 LDCXXFLAGS = -Wl,-mllvm -Wl,-x86-use-vzeroupper=false feedback = 0 PASS1_CFLAGS = -fprofile-instr-generate PASS2_CFLAGS = -fprofile-instr-use PASS1_FFLAGS = -fprofile-generate PASS2_FFLAGS = -fprofile-use PASS1_CXXFLAGS = -fprofile-instr-generate PASS2_CXXFLAGS = -fprofile-instr-use PASS1_LDFLAGS = -fprofile-instr-generate PASS2_LDFLAGS = -fprofile-instr-use fdo_run1 = $command ; llvm-profdata merge --output=default.profdata *.profraw # Libraries: EXTRA_LIBS = -lamdlibm -lm -lamdalloc EXTRA_FLIBS = -lflang # Benchmark specific peak tuning flags: 500.perlbench_r=peak: #lang='C' COPTIMIZE = $(OPT_ROOT_PEAK) \ -fstruct-layout=7 \ -mllvm -unroll-threshold=50 \ -fremap-arrays \ -mllvm -inline-threshold=1000 \ -mllvm -reduce-array-computations=3 \ -faggressive-loop-transform \ -fvector-transform \ -fscalar-transform feedback = 1 502.gcc_r=peak: #lang='C' EXTRA_PORTABILITY = -D_FILE_OFFSET_BITS=64 CC = clang -m32 CLD = clang -m32 -L/usr/lib32 EXTRA_LIBS = -L$[AMDALLOC_LIB32_PATH] -lamdalloc MATHLIBOPT = -lm LDFLAGS = -flto 507.cactuBSSN_r=peak: CXXOPTIMIZE = $(OPT_ROOT_PEAK) \ -mllvm -unroll-threshold=100 \ -mllvm -loop-unswitch-threshold=200000 \ -finline-aggressive \ -mllvm -reduce-array-computations=3 \ -faggressive-loop-transform \ -fvector-transform \ -fscalar-transform EXTRA_LIBS += $(EXTRA_FLIBS) #adding flang libs to cxx linker 510.parest_r=peak: LDFLAGS = -flto \ -Wl,-mllvm -Wl,-suppress-fmas 511.povray_r=peak: #lang='CXX,C' COPTIMIZE = $(OPT_ROOT_BASE) \ -flto \ -fstruct-layout=7 \ -mllvm -unroll-threshold=50 \ -mllvm -inline-threshold=1000 \ -fremap-arrays \ -mllvm -reduce-array-computations=3 \ -zopt CXXOPTIMIZE = $(OPT_ROOT_BASE) \ -flto \ -mllvm -unroll-threshold=100\ -finline-aggressive \ -mllvm -loop-unswitch-threshold=200000 \ -mllvm -reduce-array-computations=3 \ -zopt LDCXXFLAGS = -Wl,-mllvm -Wl,-x86-use-vzeroupper=false LDCFLAGS = -Wl,-mllvm -Wl,-ldist-scalar-expand \ -fenable-aggressive-gather EXTRA_LIBS = -lamdlibm -lm -lamdalloc 520.omnetpp_r=peak: #lang='CXX` EXTRA_LIBS = -lamdlibm -lm -lamdalloc-ext 523.xalancbmk_r=peak: #lang='CXX` CXX = clang++ -m32 CXXLD = clang++ -m32 -L/usr/lib32 EXTRA_CXXOPTIMIZE = -mllvm -do-block-reorder=aggressive \ -fvirtual-function-elimination \ -fvisibility=hidden LDCXXFLAGS = -Wl,-mllvm -Wl,-do-block-reorder=aggressive \ -fno-loop-reroll EXTRA_LIBS = -L$[AMDALLOC_LIB32_PATH] -lamdalloc-ext ENV_MALLOC_CONF = thp:never 525.x264_r=peak: #lang='C' basepeak = yes 527.cam4_r=peak: COPTIMIZE = $(OPT_ROOT_BASE) \ -flto \ -fstruct-layout=7 \ -mllvm -unroll-threshold=50 \ -mllvm -inline-threshold=1000 \ -fremap-arrays \ -mllvm -reduce-array-computations=3 \ -zopt FOPTIMIZE = $(OPT_ROOT_BASE) \ -Kieee \ -Mrecursive \ -funroll-loops \ -mllvm -lsr-in-nested-loop \ -mllvm -reduce-array-computations=3 \ -fepilog-vectorization-of-inductions \ -zopt LDCFLAGS = -Wl,-mllvm -Wl,-ldist-scalar-expand \ -fenable-aggressive-gather LDFFLAGS = -Wl,-mllvm -Wl,-enable-X86-prefetching 531.deepsjeng_r=peak: #lang='CXX' CXXOPTIMIZE = $(OPT_ROOT_BASE) \ -flto \ -mllvm -unroll-threshold=100\ -finline-aggressive \ -mllvm -loop-unswitch-threshold=200000 \ -mllvm -reduce-array-computations=3 \ -zopt EXTRA_CXXOPTIMIZE = -fvirtual-function-elimination \ -fvisibility=hidden LDCXXFLAGS = EXTRA_LIBS = -lamdlibm -lm EXTRA_CXXLIBS = -lamdalloc-ext 544.nab_r=peak: LDFLAGS = -flto \ -Wl,-mllvm -Wl,-ldist-scalar-expand \ -fenable-aggressive-gather 549.fotonik3d_r=peak: FOPTIMIZE = $(OPT_ROOT_PEAK) \ -Kieee \ -Mrecursive \ -mllvm -reduce-array-computations=3 \ -fepilog-vectorization-of-inductions \ -fvector-transform \ -fscalar-transform # ---- End inclusion of '/mnt/ramdisk/cpu2017-1.1.9-aocc400-znver4-A1/config/amd_rate_aocc400_znver4_A1_flags.inc' # Dell #include: Dell.inc # ----- Begin inclusion of 'Dell.inc' ############################################################################ #---------------------------- # Dell #--------------------------- # #------------------------------------------- # **** DO NOT EDIT BELOW HERE!!! #------------------------------------------- # Defined only if called from Automation %ifdef %{DL-BIOSinc} fprate,fpspeed,intrate,intspeed: power_management000 = BIOS and OS set to prefer performance power_management001 = at the cost of additional power usage. # #include: Dell-flags.inc # ----- Begin inclusion of 'Dell-flags.inc' ############################################################################ #------------------------------------------------------- # Dell platform flags (Auto) #------------------------------------------------------- default: flagsurl001=http://www.spec.org/cpu2017/flags/Dell-Platform-Flags-PowerEdge-AMD-EPYC-v1.1.xml # ---- End inclusion of '/mnt/ramdisk/cpu2017-1.1.9-aocc400-znver4-A1/config/Dell-flags.inc' #include: %{DL-BIOSinc} # ----- Begin inclusion of 'Dell-BIOS_EPYC-4.inc' ############################################################################ #------------------------------------------------------- # Dell # # AMD EPYC 4 (Genoa) #------------------------------------------------------- # # BIOS Settings # - It is rare that these settings would need to be changed. # fprate,fpspeed,intrate,intspeed: notes_plat_form_000 = notes_plat_form_005 = BIOS settings: notes_plat_form_010 = DRAM Refresh Delay : Performance notes_plat_form_015 = DIMM Self Healing on notes_plat_form_020 = Uncorrectable Memory Error : Disabled %ifdef %{DL-BIOS-LogProcD} notes_plat_form_200 = Logical Processor : Disabled %endif notes_plat_form_025 = Virtualization Technology : Disabled fprate: notes_plat_form_030 = L1 Stride Prefetcher: : Disabled fprate,intrate,intspeed: notes_plat_form_035 = NUMA Nodes per Socket : 4 fprate,fpspeed,intrate,intspeed: notes_plat_form_040 = L3 Cache as NUMA Domain : Enabled notes_plat_form_045 = notes_plat_form_050 = System Profile : Custom fpspeed,intspeed: notes_plat_form_310 = C-States : Disabled fprate,fpspeed,intrate,intspeed: notes_plat_form_055 = Memory Patrol Scrub : Disabled notes_plat_form_060 = PCI ASPM L1 Link notes_plat_form_065 = Power Management : Disabled notes_plat_form_070 = Determinism Slider : Power Determinism fpspeed,intspeed: notes_plat_form_335 = Algorithm Performance notes_plat_form_340 = Boost Disable (ApbDis) : Enabled # ---- End inclusion of '/mnt/ramdisk/cpu2017-1.1.9-aocc400-znver4-A1/config/Dell-BIOS_EPYC-4.inc' #include: Dell-LIC.inc # ----- Begin inclusion of 'Dell-LIC.inc' ############################################################################ #------------------------------------------------------- # Dell (Dell Inc.) # # SPEC Licensing Info #------------------------------------------------------- fprate,fpspeed,intrate,intspeed: hw_vendor = Dell Inc. tester = Dell Inc. test_sponsor = Dell Inc. prepared_by = Dell v4.6.1 # Old number 55 license_num = 6573 # ---- End inclusion of '/mnt/ramdisk/cpu2017-1.1.9-aocc400-znver4-A1/config/Dell-LIC.inc' #include: Dell-cleanup.inc # ----- Begin inclusion of 'Dell-cleanup.inc' ############################################################################ #------------------------------------------------------- # Dell EMC (Dell Inc.) # # AMD #------------------------------------------------------- # cleanup - sysinfo/AMD scripts fprate,fpspeed,intspeed,intrate: #sw_os000 = %undef% #sw_os001 = %undef% sw_os002 = %undef% #hw_memory000 = %undef% hw_memory001 = %undef% hw_memory002 = %undef% #hw_model000 = %undef% #hw_model001 = %undef% # ---- End inclusion of '/mnt/ramdisk/cpu2017-1.1.9-aocc400-znver4-A1/config/Dell-cleanup.inc' # #include: Dell-Autogen.inc # ----- Begin inclusion of 'Dell-Autogen.inc' ############################################################################ fprate,fpspeed,intrate,intspeed: hw_model = PowerEdge R7615 (AMD EPYC 9184X 16-Core Processor) hw_cpu_name = AMD EPYC 9184X hw_cpu_max_mhz = 4200 hw_ncpuorder = 1 chip hw_ncores = 16 hw_nthreadspercore = 2 fw_bios = Version 1.4.5 released May-2023 sw_state = Run level 3 (multi-user) sw_file = tmpfs hw_disk = 40 GB on tmpfs notes_tmpfs_000 = notes_tmpfs_005 = Benchmark run from a 40 GB ramdisk created with the cmd: "mount -t tmpfs -o size=40G tmpfs /mnt/ramdisk" sw_os000 = Ubuntu 22.04.2 LTS sw_os001 = 5.15.0-67-generic hw_cpu_nominal_mhz = 3550 hw_memory000 = 768 GB (12 x 64 GB 2Rx4 PC5-4800B-R) sw_avail = Mar-2023 hw_avail = Jul-2023 # ---- End inclusion of '/mnt/ramdisk/cpu2017-1.1.9-aocc400-znver4-A1/config/Dell-Autogen.inc' %endif # ---- End inclusion of '/mnt/ramdisk/cpu2017-1.1.9-aocc400-znver4-A1/config/Dell.inc' # The following settings were obtained by running the sysinfo_program # 'specperl $[top]/bin/sysinfo' (sysinfo:SHA:2eb381fc1a58eb8122e4a1b875c1e38b3489dac84088192aa0ec6d157b084d06) default: notes_plat_sysinfo_000 = notes_plat_sysinfo_005 = Sysinfo program /mnt/ramdisk/cpu2017-1.1.9-aocc400-znver4-A1/bin/sysinfo notes_plat_sysinfo_010 = Rev: r6732 of 2022-11-07 fe91c89b7ed5c36ae2c92cc097bec197 notes_plat_sysinfo_015 = running on amd-sut Wed Aug 23 18:52:05 2023 notes_plat_sysinfo_020 = notes_plat_sysinfo_025 = SUT (System Under Test) info as seen by some common utilities. notes_plat_sysinfo_030 = notes_plat_sysinfo_035 = ------------------------------------------------------------ notes_plat_sysinfo_040 = Table of contents notes_plat_sysinfo_045 = ------------------------------------------------------------ notes_plat_sysinfo_050 = 1. uname -a notes_plat_sysinfo_055 = 2. w notes_plat_sysinfo_060 = 3. Username notes_plat_sysinfo_065 = 4. ulimit -a notes_plat_sysinfo_070 = 5. sysinfo process ancestry notes_plat_sysinfo_075 = 6. /proc/cpuinfo notes_plat_sysinfo_080 = 7. lscpu notes_plat_sysinfo_085 = 8. numactl --hardware notes_plat_sysinfo_090 = 9. /proc/meminfo notes_plat_sysinfo_095 = 10. who -r notes_plat_sysinfo_100 = 11. Systemd service manager version: systemd 249 (249.11-0ubuntu3.6) notes_plat_sysinfo_105 = 12. Failed units, from systemctl list-units --state=failed notes_plat_sysinfo_110 = 13. Services, from systemctl list-unit-files notes_plat_sysinfo_115 = 14. Linux kernel boot-time arguments, from /proc/cmdline notes_plat_sysinfo_120 = 15. cpupower frequency-info notes_plat_sysinfo_125 = 16. tuned-adm active notes_plat_sysinfo_130 = 17. sysctl notes_plat_sysinfo_135 = 18. /sys/kernel/mm/transparent_hugepage notes_plat_sysinfo_140 = 19. /sys/kernel/mm/transparent_hugepage/khugepaged notes_plat_sysinfo_145 = 20. OS release notes_plat_sysinfo_150 = 21. Disk information notes_plat_sysinfo_155 = 22. /sys/devices/virtual/dmi/id notes_plat_sysinfo_160 = 23. dmidecode notes_plat_sysinfo_165 = 24. BIOS notes_plat_sysinfo_170 = ------------------------------------------------------------ notes_plat_sysinfo_175 = notes_plat_sysinfo_180 = ------------------------------------------------------------ notes_plat_sysinfo_185 = 1. uname -a notes_plat_sysinfo_190 = Linux amd-sut 5.15.0-67-generic #74-Ubuntu SMP Wed Feb 22 14:14:39 UTC 2023 x86_64 x86_64 x86_64 GNU/Linux notes_plat_sysinfo_195 = notes_plat_sysinfo_200 = ------------------------------------------------------------ notes_plat_sysinfo_205 = 2. w notes_plat_sysinfo_210 = 18:52:05 up 2:34, 1 user, load average: 23.49, 29.94, 31.04 notes_plat_sysinfo_215 = USER TTY FROM LOGIN@ IDLE JCPU PCPU WHAT notes_plat_sysinfo_220 = root tty1 - 16:20 2:31m 2.48s 0.39s /bin/bash ./amd_rate_aocc400_znver4_A1.sh notes_plat_sysinfo_225 = notes_plat_sysinfo_230 = ------------------------------------------------------------ notes_plat_sysinfo_235 = 3. Username notes_plat_sysinfo_240 = From environment variable $USER: root notes_plat_sysinfo_245 = notes_plat_sysinfo_250 = ------------------------------------------------------------ notes_plat_sysinfo_255 = 4. ulimit -a notes_plat_sysinfo_260 = time(seconds) unlimited notes_plat_sysinfo_265 = file(blocks) unlimited notes_plat_sysinfo_270 = data(kbytes) unlimited notes_plat_sysinfo_275 = stack(kbytes) unlimited notes_plat_sysinfo_280 = coredump(blocks) 0 notes_plat_sysinfo_285 = memory(kbytes) unlimited notes_plat_sysinfo_290 = locked memory(kbytes) 2097152 notes_plat_sysinfo_295 = process 3093969 notes_plat_sysinfo_300 = nofiles 1024 notes_plat_sysinfo_305 = vmemory(kbytes) unlimited notes_plat_sysinfo_310 = locks unlimited notes_plat_sysinfo_315 = rtprio 0 notes_plat_sysinfo_320 = notes_plat_sysinfo_325 = ------------------------------------------------------------ notes_plat_sysinfo_330 = 5. sysinfo process ancestry notes_plat_sysinfo_335 = /sbin/init notes_plat_sysinfo_340 = /bin/login -p -- notes_plat_sysinfo_345 = -bash notes_plat_sysinfo_350 = /bin/bash ./DELL_rate.sh notes_plat_sysinfo_355 = /bin/bash ./dell-run-main.sh rate notes_plat_sysinfo_360 = /bin/bash ./dell-run-main.sh rate notes_plat_sysinfo_365 = /bin/bash ./dell-run-speccpu.sh rate --define DL-BIOSinc=Dell-BIOS_EPYC-4.inc --define DL-BIOS-adddcD=1 notes_plat_sysinfo_370 = --define DL-BIOS-VirtD=1 --define DL-VERS=v4.6.1 --output_format html,pdf,txt notes_plat_sysinfo_375 = python3 ./run_amd_rate_aocc400_znver4_A1.py notes_plat_sysinfo_380 = /bin/bash ./amd_rate_aocc400_znver4_A1.sh notes_plat_sysinfo_385 = runcpu --config amd_rate_aocc400_znver4_A1.cfg --tune all --reportable --iterations 2 --define notes_plat_sysinfo_390 = DL-BIOSinc=Dell-BIOS_EPYC-4.inc --define DL-BIOS-adddcD=1 --define DL-BIOS-VirtD=1 --define DL-VERS=v4.6.1 notes_plat_sysinfo_395 = --output_format html,pdf,txt fprate notes_plat_sysinfo_400 = runcpu --configfile amd_rate_aocc400_znver4_A1.cfg --tune all --reportable --iterations 2 --define notes_plat_sysinfo_405 = DL-BIOSinc=Dell-BIOS_EPYC-4.inc --define DL-BIOS-adddcD=1 --define DL-BIOS-VirtD=1 --define DL-VERS=v4.6.1 notes_plat_sysinfo_410 = --output_format html,pdf,txt --nopower --runmode rate --tune base:peak --size test:train:refrate fprate notes_plat_sysinfo_415 = --nopreenv --note-preenv --logfile $SPEC/tmp/CPU2017.002/templogs/preenv.fprate.002.0.log --lognum 002.0 notes_plat_sysinfo_420 = --from_runcpu 2 notes_plat_sysinfo_425 = specperl $SPEC/bin/sysinfo notes_plat_sysinfo_430 = $SPEC = /mnt/ramdisk/cpu2017-1.1.9-aocc400-znver4-A1 notes_plat_sysinfo_435 = notes_plat_sysinfo_440 = ------------------------------------------------------------ notes_plat_sysinfo_445 = 6. /proc/cpuinfo notes_plat_sysinfo_450 = model name : AMD EPYC 9184X 16-Core Processor notes_plat_sysinfo_455 = vendor_id : AuthenticAMD notes_plat_sysinfo_460 = cpu family : 25 notes_plat_sysinfo_465 = model : 17 notes_plat_sysinfo_470 = stepping : 1 notes_plat_sysinfo_475 = microcode : 0xa101135 notes_plat_sysinfo_480 = bugs : sysret_ss_attrs spectre_v1 spectre_v2 spec_store_bypass notes_plat_sysinfo_485 = TLB size : 3584 4K pages notes_plat_sysinfo_490 = cpu cores : 16 notes_plat_sysinfo_495 = siblings : 32 notes_plat_sysinfo_500 = 1 physical ids (chips) notes_plat_sysinfo_505 = 32 processors (hardware threads) notes_plat_sysinfo_510 = physical id 0: core ids 0-1,8-9,16-17,24-25,32-33,40-41,48-49,56-57 notes_plat_sysinfo_515 = physical id 0: apicids 0-3,16-19,32-35,48-51,64-67,80-83,96-99,112-115 notes_plat_sysinfo_520 = Caution: /proc/cpuinfo data regarding chips, cores, and threads is not necessarily reliable, especially for notes_plat_sysinfo_525 = virtualized systems. Use the above data carefully. notes_plat_sysinfo_530 = notes_plat_sysinfo_535 = ------------------------------------------------------------ notes_plat_sysinfo_540 = 7. lscpu notes_plat_sysinfo_545 = notes_plat_sysinfo_550 = From lscpu from util-linux 2.37.2: notes_plat_sysinfo_555 = Architecture: x86_64 notes_plat_sysinfo_560 = CPU op-mode(s): 32-bit, 64-bit notes_plat_sysinfo_565 = Address sizes: 52 bits physical, 57 bits virtual notes_plat_sysinfo_570 = Byte Order: Little Endian notes_plat_sysinfo_575 = CPU(s): 32 notes_plat_sysinfo_580 = On-line CPU(s) list: 0-31 notes_plat_sysinfo_585 = Vendor ID: AuthenticAMD notes_plat_sysinfo_590 = Model name: AMD EPYC 9184X 16-Core Processor notes_plat_sysinfo_595 = CPU family: 25 notes_plat_sysinfo_600 = Model: 17 notes_plat_sysinfo_605 = Thread(s) per core: 2 notes_plat_sysinfo_610 = Core(s) per socket: 16 notes_plat_sysinfo_615 = Socket(s): 1 notes_plat_sysinfo_620 = Stepping: 1 notes_plat_sysinfo_625 = Frequency boost: enabled notes_plat_sysinfo_630 = CPU max MHz: 4208.6909 notes_plat_sysinfo_635 = CPU min MHz: 1500.0000 notes_plat_sysinfo_640 = BogoMIPS: 7102.14 notes_plat_sysinfo_645 = Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 notes_plat_sysinfo_650 = clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm notes_plat_sysinfo_655 = constant_tsc rep_good nopl nonstop_tsc cpuid extd_apicid aperfmperf rapl notes_plat_sysinfo_660 = pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe notes_plat_sysinfo_665 = popcnt aes xsave avx f16c rdrand lahf_lm cmp_legacy svm extapic cr8_legacy notes_plat_sysinfo_670 = abm sse4a misalignsse 3dnowprefetch osvw ibs skinit wdt tce topoext notes_plat_sysinfo_675 = perfctr_core perfctr_nb bpext perfctr_llc mwaitx cpb cat_l3 cdp_l3 notes_plat_sysinfo_680 = invpcid_single hw_pstate ssbd mba ibrs ibpb stibp vmmcall fsgsbase bmi1 notes_plat_sysinfo_685 = avx2 smep bmi2 erms invpcid cqm rdt_a avx512f avx512dq rdseed adx smap notes_plat_sysinfo_690 = avx512ifma clflushopt clwb avx512cd sha_ni avx512bw avx512vl xsaveopt notes_plat_sysinfo_695 = xsavec xgetbv1 xsaves cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local notes_plat_sysinfo_700 = avx512_bf16 clzero irperf xsaveerptr rdpru wbnoinvd amd_ppin cppc arat npt notes_plat_sysinfo_705 = lbrv svm_lock nrip_save tsc_scale vmcb_clean flushbyasid decodeassists notes_plat_sysinfo_710 = pausefilter pfthreshold avic v_vmsave_vmload vgif v_spec_ctrl avx512vbmi notes_plat_sysinfo_715 = umip pku ospke avx512_vbmi2 gfni vaes vpclmulqdq avx512_vnni avx512_bitalg notes_plat_sysinfo_720 = avx512_vpopcntdq la57 rdpid overflow_recov succor smca fsrm flush_l1d notes_plat_sysinfo_725 = Virtualization: AMD-V notes_plat_sysinfo_730 = L1d cache: 512 KiB (16 instances) notes_plat_sysinfo_735 = L1i cache: 512 KiB (16 instances) notes_plat_sysinfo_740 = L2 cache: 16 MiB (16 instances) notes_plat_sysinfo_745 = L3 cache: 768 MiB (8 instances) notes_plat_sysinfo_750 = NUMA node(s): 4 notes_plat_sysinfo_755 = NUMA node0 CPU(s): 0-3,16-19 notes_plat_sysinfo_760 = NUMA node1 CPU(s): 4-7,20-23 notes_plat_sysinfo_765 = NUMA node2 CPU(s): 8-11,24-27 notes_plat_sysinfo_770 = NUMA node3 CPU(s): 12-15,28-31 notes_plat_sysinfo_775 = Vulnerability Itlb multihit: Not affected notes_plat_sysinfo_780 = Vulnerability L1tf: Not affected notes_plat_sysinfo_785 = Vulnerability Mds: Not affected notes_plat_sysinfo_790 = Vulnerability Meltdown: Not affected notes_plat_sysinfo_795 = Vulnerability Mmio stale data: Not affected notes_plat_sysinfo_800 = Vulnerability Retbleed: Not affected notes_plat_sysinfo_805 = Vulnerability Spec store bypass: Mitigation; Speculative Store Bypass disabled via prctl and seccomp notes_plat_sysinfo_810 = Vulnerability Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization notes_plat_sysinfo_815 = Vulnerability Spectre v2: Mitigation; Retpolines, IBPB conditional, IBRS_FW, STIBP always-on, RSB notes_plat_sysinfo_820 = filling, PBRSB-eIBRS Not affected notes_plat_sysinfo_825 = Vulnerability Srbds: Not affected notes_plat_sysinfo_830 = Vulnerability Tsx async abort: Not affected notes_plat_sysinfo_835 = notes_plat_sysinfo_840 = From lscpu --cache: notes_plat_sysinfo_845 = NAME ONE-SIZE ALL-SIZE WAYS TYPE LEVEL SETS PHY-LINE COHERENCY-SIZE notes_plat_sysinfo_850 = L1d 32K 512K 8 Data 1 64 1 64 notes_plat_sysinfo_855 = L1i 32K 512K 8 Instruction 1 64 1 64 notes_plat_sysinfo_860 = L2 1M 16M 8 Unified 2 2048 1 64 notes_plat_sysinfo_865 = L3 96M 768M 16 Unified 3 98304 1 64 notes_plat_sysinfo_870 = notes_plat_sysinfo_875 = ------------------------------------------------------------ notes_plat_sysinfo_880 = 8. numactl --hardware notes_plat_sysinfo_885 = NOTE: a numactl 'node' might or might not correspond to a physical chip. notes_plat_sysinfo_890 = available: 4 nodes (0-3) notes_plat_sysinfo_895 = node 0 cpus: 0-3,16-19 notes_plat_sysinfo_900 = node 0 size: 193079 MB notes_plat_sysinfo_905 = node 0 free: 192461 MB notes_plat_sysinfo_910 = node 1 cpus: 4-7,20-23 notes_plat_sysinfo_915 = node 1 size: 193497 MB notes_plat_sysinfo_920 = node 1 free: 192951 MB notes_plat_sysinfo_925 = node 2 cpus: 8-11,24-27 notes_plat_sysinfo_930 = node 2 size: 193532 MB notes_plat_sysinfo_935 = node 2 free: 192990 MB notes_plat_sysinfo_940 = node 3 cpus: 12-15,28-31 notes_plat_sysinfo_945 = node 3 size: 193496 MB notes_plat_sysinfo_950 = node 3 free: 189376 MB notes_plat_sysinfo_955 = node distances: notes_plat_sysinfo_960 = node 0 1 2 3 notes_plat_sysinfo_965 = 0: 10 12 12 12 notes_plat_sysinfo_970 = 1: 12 10 12 12 notes_plat_sysinfo_975 = 2: 12 12 10 12 notes_plat_sysinfo_980 = 3: 12 12 12 10 notes_plat_sysinfo_985 = notes_plat_sysinfo_990 = ------------------------------------------------------------ notes_plat_sysinfo_995 = 9. /proc/meminfo notes_plat_sysinfo_1000= MemTotal: 792172864 kB notes_plat_sysinfo_1005= notes_plat_sysinfo_1010= ------------------------------------------------------------ notes_plat_sysinfo_1015= 10. who -r notes_plat_sysinfo_1020= run-level 3 Aug 23 16:20 notes_plat_sysinfo_1025= notes_plat_sysinfo_1030= ------------------------------------------------------------ notes_plat_sysinfo_1035= 11. Systemd service manager version: systemd 249 (249.11-0ubuntu3.6) notes_plat_sysinfo_1040= Default Target Status notes_plat_sysinfo_1045= multi-user degraded notes_plat_sysinfo_1050= notes_plat_sysinfo_1055= ------------------------------------------------------------ notes_plat_sysinfo_1060= 12. Failed units, from systemctl list-units --state=failed notes_plat_sysinfo_1065= UNIT LOAD ACTIVE SUB DESCRIPTION notes_plat_sysinfo_1070= * systemd-networkd-wait-online.service loaded failed failed Wait for Network to be Configured notes_plat_sysinfo_1075= notes_plat_sysinfo_1080= ------------------------------------------------------------ notes_plat_sysinfo_1085= 13. Services, from systemctl list-unit-files notes_plat_sysinfo_1090= STATE UNIT FILES notes_plat_sysinfo_1095= enabled blk-availability console-setup cron dmesg e2scrub_reap finalrd getty@ gpu-manager notes_plat_sysinfo_1100= grub-common grub-initrd-fallback irqbalance keyboard-setup lm-sensors networkd-dispatcher notes_plat_sysinfo_1105= open-iscsi open-vm-tools pollinate rsyslog secureboot-db setvtrgb ssh systemd-networkd notes_plat_sysinfo_1110= systemd-pstore systemd-resolved systemd-timesyncd thermald tuned ua-reboot-cmds notes_plat_sysinfo_1115= ubuntu-advantage udisks2 vgauth wpa_supplicant notes_plat_sysinfo_1120= enabled-runtime netplan-ovs-cleanup systemd-fsck-root systemd-networkd-wait-online systemd-remount-fs notes_plat_sysinfo_1125= disabled ModemManager apparmor console-getty debug-shell iscsid lvm2-monitor lxd-agent multipathd notes_plat_sysinfo_1130= nftables rsync serial-getty@ systemd-boot-check-no-failures systemd-network-generator notes_plat_sysinfo_1135= systemd-sysext systemd-time-wait-sync ufw upower wpa_supplicant-nl80211@ notes_plat_sysinfo_1140= wpa_supplicant-wired@ wpa_supplicant@ notes_plat_sysinfo_1145= generated apport notes_plat_sysinfo_1150= indirect uuidd notes_plat_sysinfo_1155= masked NetworkManager NetworkManager-dispatcher NetworkManager-wait-online cryptdisks notes_plat_sysinfo_1160= cryptdisks-early hwclock lvm2 multipath-tools-boot rc rcS screen-cleanup sudo x11-common notes_plat_sysinfo_1165= notes_plat_sysinfo_1170= ------------------------------------------------------------ notes_plat_sysinfo_1175= 14. Linux kernel boot-time arguments, from /proc/cmdline notes_plat_sysinfo_1180= BOOT_IMAGE=/boot/vmlinuz-5.15.0-67-generic notes_plat_sysinfo_1185= root=UUID=593ab29a-c8fe-4d75-821a-b60d5c945311 notes_plat_sysinfo_1190= ro notes_plat_sysinfo_1195= notes_plat_sysinfo_1200= ------------------------------------------------------------ notes_plat_sysinfo_1205= 15. cpupower frequency-info notes_plat_sysinfo_1210= analyzing CPU 0: notes_plat_sysinfo_1215= current policy: frequency should be within 1.50 GHz and 3.55 GHz. notes_plat_sysinfo_1220= The governor "performance" may decide which speed to use notes_plat_sysinfo_1225= within this range. notes_plat_sysinfo_1230= boost state support: notes_plat_sysinfo_1235= Supported: yes notes_plat_sysinfo_1240= Active: yes notes_plat_sysinfo_1245= Boost States: 0 notes_plat_sysinfo_1250= Total States: 3 notes_plat_sysinfo_1255= Pstate-P0: 3550MHz notes_plat_sysinfo_1260= notes_plat_sysinfo_1265= ------------------------------------------------------------ notes_plat_sysinfo_1270= 16. tuned-adm active notes_plat_sysinfo_1275= Current active profile: latency-performance notes_plat_sysinfo_1280= notes_plat_sysinfo_1285= ------------------------------------------------------------ notes_plat_sysinfo_1290= 17. sysctl notes_plat_sysinfo_1295= kernel.numa_balancing 1 notes_plat_sysinfo_1300= kernel.randomize_va_space 0 notes_plat_sysinfo_1305= vm.compaction_proactiveness 20 notes_plat_sysinfo_1310= vm.dirty_background_bytes 0 notes_plat_sysinfo_1315= vm.dirty_background_ratio 3 notes_plat_sysinfo_1320= vm.dirty_bytes 0 notes_plat_sysinfo_1325= vm.dirty_expire_centisecs 3000 notes_plat_sysinfo_1330= vm.dirty_ratio 8 notes_plat_sysinfo_1335= vm.dirty_writeback_centisecs 500 notes_plat_sysinfo_1340= vm.dirtytime_expire_seconds 43200 notes_plat_sysinfo_1345= vm.extfrag_threshold 500 notes_plat_sysinfo_1350= vm.min_unmapped_ratio 1 notes_plat_sysinfo_1355= vm.nr_hugepages 0 notes_plat_sysinfo_1360= vm.nr_hugepages_mempolicy 0 notes_plat_sysinfo_1365= vm.nr_overcommit_hugepages 0 notes_plat_sysinfo_1370= vm.swappiness 1 notes_plat_sysinfo_1375= vm.watermark_boost_factor 15000 notes_plat_sysinfo_1380= vm.watermark_scale_factor 10 notes_plat_sysinfo_1385= vm.zone_reclaim_mode 1 notes_plat_sysinfo_1390= notes_plat_sysinfo_1395= ------------------------------------------------------------ notes_plat_sysinfo_1400= 18. /sys/kernel/mm/transparent_hugepage notes_plat_sysinfo_1405= defrag [always] defer defer+madvise madvise never notes_plat_sysinfo_1410= enabled [always] madvise never notes_plat_sysinfo_1415= hpage_pmd_size 2097152 notes_plat_sysinfo_1420= shmem_enabled always within_size advise [never] deny force notes_plat_sysinfo_1425= notes_plat_sysinfo_1430= ------------------------------------------------------------ notes_plat_sysinfo_1435= 19. /sys/kernel/mm/transparent_hugepage/khugepaged notes_plat_sysinfo_1440= alloc_sleep_millisecs 60000 notes_plat_sysinfo_1445= defrag 1 notes_plat_sysinfo_1450= max_ptes_none 511 notes_plat_sysinfo_1455= max_ptes_shared 256 notes_plat_sysinfo_1460= max_ptes_swap 64 notes_plat_sysinfo_1465= pages_to_scan 4096 notes_plat_sysinfo_1470= scan_sleep_millisecs 10000 notes_plat_sysinfo_1475= notes_plat_sysinfo_1480= ------------------------------------------------------------ notes_plat_sysinfo_1485= 20. OS release notes_plat_sysinfo_1490= From /etc/*-release /etc/*-version notes_plat_sysinfo_1495= os-release Ubuntu 22.04.2 LTS notes_plat_sysinfo_1500= notes_plat_sysinfo_1505= ------------------------------------------------------------ notes_plat_sysinfo_1510= 21. Disk information notes_plat_sysinfo_1515= SPEC is set to: /mnt/ramdisk/cpu2017-1.1.9-aocc400-znver4-A1 notes_plat_sysinfo_1520= Filesystem Type Size Used Avail Use% Mounted on notes_plat_sysinfo_1525= tmpfs tmpfs 40G 3.5G 37G 9% /mnt/ramdisk notes_plat_sysinfo_1530= notes_plat_sysinfo_1535= ------------------------------------------------------------ notes_plat_sysinfo_1540= 22. /sys/devices/virtual/dmi/id notes_plat_sysinfo_1545= Vendor: Dell Inc. notes_plat_sysinfo_1550= Product: PowerEdge R7615 notes_plat_sysinfo_1555= Product Family: PowerEdge notes_plat_sysinfo_1560= Serial: RDB5009 notes_plat_sysinfo_1565= notes_plat_sysinfo_1570= ------------------------------------------------------------ notes_plat_sysinfo_1575= 23. dmidecode notes_plat_sysinfo_1580= Additional information from dmidecode 3.3 follows. WARNING: Use caution when you interpret this section. notes_plat_sysinfo_1585= The 'dmidecode' program reads system data which is "intended to allow hardware to be accurately notes_plat_sysinfo_1590= determined", but the intent may not be met, as there are frequent changes to hardware, firmware, and the notes_plat_sysinfo_1595= "DMTF SMBIOS" standard. notes_plat_sysinfo_1600= Memory: notes_plat_sysinfo_1605= 12x 80AD000080AD HMCG94MEBRA109N 64 GB 2 rank 4800 notes_plat_sysinfo_1610= notes_plat_sysinfo_1615= notes_plat_sysinfo_1620= ------------------------------------------------------------ notes_plat_sysinfo_1625= 24. BIOS notes_plat_sysinfo_1630= (This section combines info from /sys/devices and dmidecode.) notes_plat_sysinfo_1635= BIOS Vendor: Dell Inc. notes_plat_sysinfo_1640= BIOS Version: 1.4.5 notes_plat_sysinfo_1645= BIOS Date: 05/29/2023 notes_plat_sysinfo_1650= BIOS Revision: 1.4 hw_cpu_name = AMD EPYC 9184X hw_disk = 40 GB add more disk info here hw_nchips = 1 hw_ncores = 16 hw_nthreadspercore = 2 prepared_by = root (is never output, only tags rawfile) sw_file = tmpfs sw_os001 = Ubuntu 22.04.2 LTS sw_state = Run level 3 (add definition here) # End of settings added by sysinfo_program 538.imagick_r: # The following setting was inserted automatically as a result of # post-run basepeak application. basepeak = 1 527.cam4_r: # The following setting was inserted automatically as a result of # post-run basepeak application. basepeak = 1 519.lbm_r: # The following setting was inserted automatically as a result of # post-run basepeak application. basepeak = 1 511.povray_r: # The following setting was inserted automatically as a result of # post-run basepeak application. basepeak = 1 508.namd_r: # The following setting was inserted automatically as a result of # post-run basepeak application. basepeak = 1 507.cactuBSSN_r: # The following setting was inserted automatically as a result of # post-run basepeak application. basepeak = 1