OS Images |
os_Image_1(1)
|
Hardware Description |
hw_1
|
Number of Systems |
1
|
SW Environment |
non-virtual
|
Tuning |
BIOS Settings: - SMT Control set to Auto
- IOMMU set to Enabled
- NUMA nodes per socket set to NPS2
- Determinism Control set to Manual
- Determinism Slider set to Power
- cTDP Control set to Manual
- cTDP set to 240
- Package Power Limit Control set to Manual
- Package Power Limit set to 240
- Enforce POR set to Accept
- Overclock set to Enabled
- Memory Clock Speed set to 1467MHz
- L1 Stream HW Prefetcher set to Disable
- L2 Stream HW Prefetcher set to Disable
|
Notes |
notes
|
|
JVM Instances |
jvm_Ctr_1(1), jvm_Backend_1(8), jvm_TxInjector_1(8)
|
OS Image Description |
os_1
|
Tuning |
- cpupower -c all frequency-set -g performance
- tuned-adm profile throughput-performance
- echo 950000 > /proc/sys/kernel/sched_rt_runtime_us
- echo 100000000 > /proc/sys/kernel/sched_latency_ns
- echo 80000 > /proc/sys/kernel/sched_migration_cost_ns
- echo 600000 > /proc/sys/kernel/sched_min_granularity_ns
- echo 100000 > /proc/sys/kernel/sched_wakeup_granularity_ns
- echo 10000 > /proc/sys/vm/dirty_expire_centisecs
- echo 1500 > /proc/sys/vm/dirty_writeback_centisecs
- echo 40 > /proc/sys/vm/dirty_ratio
- echo 10 > /proc/sys/vm/dirty_background_ratio
- echo 10 > /proc/sys/vm/swappiness
- echo 0 > /proc/sys/kernel/numa_balancing
- echo always > /sys/kernel/mm/transparent_hugepage/enabled
- echo always > /sys/kernel/mm/transparent_hugepage/defrag
- Add cgroup_disable=memory,cpu,cpuacct,blkio,hugetlb,pids,cpuset,perf_event,freezer,devices,net_cls,net_prio to GRUB_CMDLINE_LINUX_DEFAULT
- ulimit -n 1024000
- UserTasksMax=970000
- DefaultTasksMax=970000
|
Notes |
None
|
Parts of Benchmark |
Controller
|
JVM Instance Description |
jvm_1
|
Command Line |
-Xms2g -Xmx2g -Xmn1536m -XX:+UseParallelOldGC -XX:ParallelGCThreads=2 -XX:CICompilerCount=4
|
Tuning |
None
|
Notes |
Used numactl to interleave memory on all CPUs
|
Parts of Benchmark |
Backend
|
JVM Instance Description |
jvm_1
|
Command Line |
-Xms118g -Xmx118g -Xmn116g -server -XX:MetaspaceSize=256m -XX:AllocatePrefetchInstr=2 -XX:LargePageSizeInBytes=2m -XX:-UsePerfData -XX:-UseAdaptiveSizePolicy -XX:+AlwaysPreTouch -XX:-UseBiasedLocking -XX:+UseLargePages -XX:+UseParallelOldGC -XX:SurvivorRatio=50 -XX:TargetSurvivorRatio=95 -XX:ParallelGCThreads=32 -XX:MaxTenuringThreshold=15 -XX:InitialCodeCacheSize=25m -XX:InlineSmallCode=10k -XX:MaxGCPauseMillis=200 -XX:+UseCompressedOops -XX:ObjectAlignmentInBytes=32 -XX:+BindGCTaskThreadsToCPUs -XX:+UseTransparentHugePages
|
Tuning |
None
|
Notes |
Used numactl to affinitize two Backend JVM (2 Groups) to one NUMA node - Group1: numactl --cpunodebind=0 --localalloc
- Group2: numactl --cpunodebind=0 --localalloc
- Group3: numactl --cpunodebind=1 --localalloc
- Group4: numactl --cpunodebind=1 --localalloc
- Group5: numactl --cpunodebind=2 --localalloc
- Group6: numactl --cpunodebind=2 --localalloc
- Group7: numactl --cpunodebind=3 --localalloc
- Group8: numactl --cpunodebind=3 --localalloc
|
Parts of Benchmark |
TxInjector
|
JVM Instance Description |
jvm_1
|
Command Line |
-Xms2g -Xmx2g -Xmn1536m -XX:+UseParallelOldGC -XX:ParallelGCThreads=2 -XX:CICompilerCount=4
|
Tuning |
None
|
Notes |
Used numactl to affinitize two Transaction Injector JVM (2 Groups) to one NUMA node - Group1: numactl --cpunodebind=0 --localalloc
- Group2: numactl --cpunodebind=0 --localalloc
- Group3: numactl --cpunodebind=1 --localalloc
- Group4: numactl --cpunodebind=1 --localalloc
- Group5: numactl --cpunodebind=2 --localalloc
- Group6: numactl --cpunodebind=2 --localalloc
- Group7: numactl --cpunodebind=3 --localalloc
- Group8: numactl --cpunodebind=3 --localalloc
|
|