Hewlett Packard Enterprise ProLiant DL385 Gen11 723999 SPECjbb2015-MultiJVM max-jOPS
705682 SPECjbb2015-MultiJVM critical-jOPS
Tested by: Hewlett Packard Enterprise Test Sponsor: Hewlett Packard Enterprise Test location: Houston, TX Test date: August 27, 2024
SPEC license #: 3 Hardware Availability: Jan-2025 Software Availability: Jul-2024 Publication: Thu Oct 10 13:03:35 EDT 2024
Benchmark Results Summary
 
Overall Throughput RT curve
Overall SUT (System Under Test) Description
VendorHewlett Packard Enterprise
Vendor URLhttp://www.hpe.com/
System SourceSingle Supplier
System DesignationServer Rack
Total Systems1
All SUT Systems IdenticalYES
Total Nodes1
All Nodes IdenticalYES
Nodes Per System1
Total Chips2
Total Cores192
Total Threads384
Total Memory Amount (GB)1536
Total OS Images1
SW EnvironmentNon-virtual
 
Hardware hw_1
NameProLiant DL385 Gen11
VendorHewlett Packard Enterprise
Vendor URLhttp://hpe.com/
AvailableJan-2025
ModelProLiant DL385 Gen11
Form Factor2U Rack
CPU NameAMD EPYC 9655
CPU Characteristics96 Core, 2.6 GHz, 384 MB L3 Cache(Max. Boost Clock Up to 4.5 GHz)
Number of Systems1
Nodes Per System1
Chips Per System2
Cores Per System192
Cores Per Chip96
Threads Per System384
Threads Per Core2
VersionA55 v2.10 08/22/2024
CPU Frequency (MHz)2600
Primary Cache32 KB I + 48 KB D on chip per core
Secondary Cache1024 KB I+D on chip per core
Tertiary Cache384 MB (I+D) on chip per chip
Other CacheNone
Disk1 x 480 GB NVMe SSD
File Systembtrfs
Memory Amount (GB)1536
# and size of DIMM(s)24 x 64 GB
Memory Details64GB 2Rx4 DDR5 6400 MHz, running at 6000 MHz
# and type of Network Interface Cards (NICs)HPE Ethernet 10Gb 2-port
Power Supply Quantity and Rating (W)2 x 1600
Other HardwareNone
Cabinet/Housing/EnclosureNone
Shared DescriptionNone
Shared CommentNone
Notes
  • NA: The test sponsor attests, as of date of publication, that CVE-2017-5754 (Meltdown) is mitigated in the system as tested and documented
  • Yes: The test sponsor attests, as of date of publication, that CVE-2017-5753 (Spectre variant 1) is mitigated in the system as tested and documented
  • Yes: The test sponsor attests, as of date of publication, that CVE-2017-5715 (Spectre variant 2) is mitigated in the system as tested and documented
  • Cabled the 4th xGMI (xGMI3)
Other Hardware network_1
NameNone
VendorNone
Vendor URLNone
VersionNone
AvailableNone
BitnessNone
NotesNone
Operating System os_1
NameSUSE Linux Enterprise Server 15 SP6
VendorSUSE
Vendor URLhttp://suse.com/
Version6.4.0-150600.21-default
AvailableMay-2024
Bitness64
NotesNone
Java Virtual Machine jvm_1
NameOracle Java SE 22.0.2
VendorOracle
Vendor URLhttp://www.oracle.com/
VersionJava HotSpot 64-bit Server VM, version 22.0.2
AvailableJul-2024
Bitness64
NotesNone
Other Software other_1
NameNone
VendorNone
Vendor URLNone
VersionNone
AvailableNone
BitnessNone
NotesNone
Hardware
OS Images os_Image_1(1)
Hardware Description hw_1
Number of Systems 1
SW Environment Non-virtual
Tuning
  • Workload Profile=High Performance Compute(HPC)
  • Thermal Configuration=Enhanced Cooling
  • Determinism Control=Manual
  • Performance Determinism=Power Deterministic
  • Memory Patrol Scrubbing=Disabled
  • Numa Memory Domains Per Socket(NPS)=Two Memory Domains Per Socket
  • L1 Stream HW Prefetcher=Enabled
  • L2 Stream HW Prefetcher=Enabled
  • Minimum Processor Idle Power Core C-State=No C-states
  • xGMI Link Bandwidth=32Gbps
  • Package Power Limit=400
Notes None
OS Image os_Image_1
JVM Instances jvm_Ctr_1(1), jvm_Backend_1(12), jvm_TxInjector_1(12)
OS Image Description os_1
Tuning

  • tuned-adm profile throughput-performance
  • ulimit -n 1024000

  • echo 950000 > /proc/sys/kernel/sched_rt_runtime_us
  • echo 150000000 > /proc/sys/kernel/sched_latency_ns
  • echo 80000 > /proc/sys/kernel/sched_migration_cost_ns
  • echo 350000 > /proc/sys/kernel/sched_min_granularity_ns
  • echo 300000 > /proc/sys/kernel/sched_wakeup_granularity_ns
  • echo 32 > /proc/sys/kernel/sched_nr_migrate
  • echo 10000 > /proc/sys/vm/dirty_expire_centisecs
  • echo 1500 > /proc/sys/vm/dirty_writeback_centisecs
  • echo 40 > /proc/sys/vm/dirty_ratio
  • echo 10 > /proc/sys/vm/dirty_background_ratio
  • echo 10 > /proc/sys/vm/swappiness
  • echo 0 > /proc/sys/kernel/numa_balancing
  • echo 0 > /proc/sys/vm/numa_stat
  • echo always > /sys/kernel/mm/transparent_hugepage/enabled
  • echo always > /sys/kernel/mm/transparent_hugepage/defrag

Notes None
JVM Instance jvm_Ctr_1
Parts of Benchmark Controller
JVM Instance Description jvm_1
Command Line

-Xms2g -Xmx2g -Xmn1536m -XX:+UseParallelGC -XX:ParallelGCThreads=2 -XX:CICompilerCount=4

Tuning

Used numactl to interleave memory on all CPUs

  • numactl --interleave=all
Notes None
JVM Instance jvm_Backend_1
Parts of Benchmark Backend
JVM Instance Description jvm_1
Command Line

-Xms120g -Xmx120g -Xmn118g -server -XX:MetaspaceSize=256m -XX:AllocatePrefetchInstr=2 -XX:LargePageSizeInBytes=2m -XX:-UsePerfData -XX:-UseAdaptiveSizePolicy -XX:+AlwaysPreTouch -XX:+UseLargePages -XX:+UseParallelGC -XX:SurvivorRatio=100 -XX:TargetSurvivorRatio=90 -XX:ParallelGCThreads=32 -XX:MaxTenuringThreshold=15 -XX:InitialCodeCacheSize=25m -XX:InlineSmallCode=10k -XX:MaxGCPauseMillis=200 -XX:+UseCompressedOops -XX:ObjectAlignmentInBytes=32 -XX:+UseTransparentHugePages -XX:TLABAllocationWeight=55 -XX:ThreadStackSize=512 -XX:CompileThresholdScaling=120

Tuning

Used numactl to affinitize three Backend JVMs to a NUMA node

  • Group1: numactl --cpunodebind=0 --localalloc
  • Group2: numactl --cpunodebind=0 --localalloc
  • Group3: numactl --cpunodebind=0 --localalloc
  • Group4: numactl --cpunodebind=1 --localalloc
  • Group5: numactl --cpunodebind=1 --localalloc
  • Group6: numactl --cpunodebind=1 --localalloc
  • Group7: numactl --cpunodebind=2 --localalloc
  • Group8: numactl --cpunodebind=2 --localalloc
  • Group9: numactl --cpunodebind=2 --localalloc
  • Group10: numactl --cpunodebind=3 --localalloc
  • Group11: numactl --cpunodebind=3 --localalloc
  • Group12: numactl --cpunodebind=3 --localalloc
Notes None
JVM Instance jvm_TxInjector_1
Parts of Benchmark TxInjector
JVM Instance Description jvm_1
Command Line

-Xms2g -Xmx2g -Xmn1536m -XX:+UseParallelGC -XX:ParallelGCThreads=2 -XX:CICompilerCount=4

Tuning

Used numactl to affinitize three Transaction Injector JVMs to a NUMA node

  • Group1: numactl --cpunodebind=0 --localalloc
  • Group2: numactl --cpunodebind=0 --localalloc
  • Group3: numactl --cpunodebind=0 --localalloc
  • Group4: numactl --cpunodebind=1 --localalloc
  • Group5: numactl --cpunodebind=1 --localalloc
  • Group6: numactl --cpunodebind=1 --localalloc
  • Group7: numactl --cpunodebind=2 --localalloc
  • Group8: numactl --cpunodebind=2 --localalloc
  • Group9: numactl --cpunodebind=2 --localalloc
  • Group10: numactl --cpunodebind=3 --localalloc
  • Group11: numactl --cpunodebind=3 --localalloc
  • Group12: numactl --cpunodebind=3 --localalloc
Notes None
max-jOPS = jOPS passed before the First Failure
Pass/Fail Pass Pass Fail Fail Fail
jOPS 715481 723999 732516 741034 749551
critical-jOPS = Geomean ( jOPS @ 10000; 25000; 50000; 75000; 100000; SLAs )
Response time percentile is 99-th
SLA (us) 10000 25000 50000 75000 100000 Geomean
jOPS 660116 702704 719740 723999 723999 705682
  Percentile
  10-th 50-th 90-th 95-th 99-th 100-th
500us 136282 / 144800 25553 / 34071 8518 / 17035 - / 8518 - / 8518 - / 8518
1000us 689928 / 698446 468470 / 476987 59623 / 68141 51106 / 59623 25553 / 34071 - / 8518
5000us 723999 / - 723999 / - 672893 / 681410 647340 / 655857 553646 / 562164 76659 / 34071
10000us 723999 / - 723999 / - 723999 / - 723999 / - 655857 / 664375 76659 / 34071
25000us 723999 / - 723999 / - 723999 / - 723999 / - 698446 / 706963 76659 / 34071
50000us 723999 / - 723999 / - 723999 / - 723999 / - 715481 / 723999 323670 / 34071
75000us 723999 / - 723999 / - 723999 / - 723999 / - 723999 / - 494023 / 136282
100000us 723999 / - 723999 / - 723999 / - 723999 / - 723999 / - 596234 / 519575
200000us 723999 / - 723999 / - 723999 / - 723999 / - 723999 / - 715481 / 689928
500000us 723999 / - 723999 / - 723999 / - 723999 / - 723999 / - 723999 / -
1000000us 723999 / - 723999 / - 723999 / - 723999 / - 723999 / - 723999 / -
Probes jOPS / Total jOPS
Request Mix Accuracy
Note
(Actual % in the Mix - Expected % in the Mix) must be within:
'Main Tx' limit of +/-5.0% for the requests whose expected % in the mix is >= 10.0%
'Minor Tx' limit of +/-1.0% for the requests whose expected % in the mix is < 10.0%
There were no non-critical failures in Response Time curve building
Delay between status pings
IR/PR Accuracy
This section lists properties only set by user
Property Name Default Controller Group1.Backend.beJVM Group1.TxInjector.txiJVM1 Group10.Backend.beJVM Group10.TxInjector.txiJVM1 Group11.Backend.beJVM Group11.TxInjector.txiJVM1 Group12.Backend.beJVM Group12.TxInjector.txiJVM1 Group2.Backend.beJVM Group2.TxInjector.txiJVM1 Group3.Backend.beJVM Group3.TxInjector.txiJVM1 Group4.Backend.beJVM Group4.TxInjector.txiJVM1 Group5.Backend.beJVM Group5.TxInjector.txiJVM1 Group6.Backend.beJVM Group6.TxInjector.txiJVM1 Group7.Backend.beJVM Group7.TxInjector.txiJVM1 Group8.Backend.beJVM Group8.TxInjector.txiJVM1 Group9.Backend.beJVM Group9.TxInjector.txiJVM1
specjbb.comm.connect.client.pool.size 256 256 120 120 120 120 120 120 120 120 120 120 120 120 120 120 120 120 120 120 120 120 120 120 120 120
specjbb.comm.connect.selector.runner.count 0 8 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4
specjbb.comm.connect.worker.pool.max 256 256 120 120 120 120 120 120 120 120 120 120 120 120 120 120 120 120 120 120 120 120 120 120 120 120
specjbb.comm.connect.worker.pool.min 1 1 24 24 24 24 24 24 24 24 24 24 24 24 24 24 24 24 24 24 24 24 24 24 24 24
specjbb.controller.handshake.period 5000 20000
specjbb.controller.handshake.timeout 600000 900000
specjbb.forkjoin.workers 384 {Tier1=130, Tier2=1, Tier3=8}
specjbb.group.count 1 12
specjbb.heartbeat.period 10000 2000
specjbb.heartbeat.threshold 100000 9000000
specjbb.mapreducer.pool.size 384 16
specjbb.txi.pergroup.count 1 1
View table in csv format
 
Level: COMPLIANCE
Check Agent Result
Check properties on compliance All PASSED
 
Level: CORRECTNESS
Check Agent Result
Compare SM and HQ Inventory All PASSED
High-bound (max attempted) is 851763 IR
High-bound (settled) is 789500 IR