Greeting,
FYI, we noticed a -2.7% regression of will-it-scale.per_thread_ops due to commit:
commit: 5a07168d8d89b00fe1760120714378175b3ef992 ("futex: Ensure that futex address
is aligned in handle_futex_death()")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master
in testcase: will-it-scale
on test machine: 88 threads Intel(R) Xeon(R) CPU E5-2699 v4 @ 2.20GHz with 64G memory
with following parameters:
nr_task: 100%
mode: thread
test: futex3
cpufreq_governor: performance
ucode: 0xb00002e
test-description: Will It Scale takes a testcase and runs it from 1 through to n parallel
copies to see if the testcase will scale. It builds both a process and threads based test
in order to see any differences between the two.
test-url:
https://github.com/antonblanchard/will-it-scale
In addition to that, the commit also has significant impact on the following tests:
+------------------+----------------------------------------------------------------------+
| testcase: change | will-it-scale: will-it-scale.per_process_ops -1.6% regression
|
| test machine | 88 threads Intel(R) Xeon(R) CPU E5-2699 v4 @ 2.20GHz with 64G memory
|
| test parameters | cpufreq_governor=performance
|
| | mode=process
|
| | nr_task=100%
|
| | test=futex3
|
| | ucode=0xb00002e
|
+------------------+----------------------------------------------------------------------+
Details are as below:
-------------------------------------------------------------------------------------------------->
To reproduce:
git clone
https://github.com/intel/lkp-tests.git
cd lkp-tests
bin/lkp install job.yaml # job file is attached in this email
bin/lkp run job.yaml
=========================================================================================
compiler/cpufreq_governor/kconfig/mode/nr_task/rootfs/tbox_group/test/testcase/ucode:
gcc-7/performance/x86_64-rhel-7.6/thread/100%/debian-x86_64-2018-04-03.cgz/lkp-bdw-ep3b/futex3/will-it-scale/0xb00002e
commit:
82efcab3b9 ("workqueue: Only unregister a registered lockdep key")
5a07168d8d ("futex: Ensure that futex address is aligned in
handle_futex_death()")
82efcab3b9f3ef59 5a07168d8d89b00fe1760120714
---------------- ---------------------------
%stddev %change %stddev
\ | \
2979618 -2.7% 2898944 will-it-scale.per_thread_ops
16627 +2.4% 17021 will-it-scale.time.system_time
9856 ± 3% -4.0% 9460 will-it-scale.time.user_time
2.622e+08 -2.7% 2.551e+08 will-it-scale.workload
23771 ± 85% +95.5% 46469 ± 41% numa-meminfo.node0.Shmem
5942 ± 85% +95.6% 11624 ± 41% numa-vmstat.node0.nr_shmem
1216 +2.3% 1244 proc-vmstat.nr_page_table_pages
8525 ± 4% -7.7% 7870 ± 5% slabinfo.kmalloc-512.active_objs
8614 ± 4% -7.8% 7940 ± 6% slabinfo.kmalloc-512.num_objs
61.50 +2.4% 63.00 vmstat.cpu.sy
36.50 ± 2% -4.1% 35.00 vmstat.cpu.us
27376 ± 3% -4.8% 26064 ± 3% softirqs.CPU42.RCU
25222 ± 20% -14.4% 21590 ± 4% softirqs.CPU82.RCU
22490 ± 5% -3.9% 21617 ± 4% softirqs.CPU86.RCU
250320 ± 32% +69.7% 424902 ± 8% numa-numastat.node0.local_node
265927 ± 33% +63.3% 434388 ± 9% numa-numastat.node0.numa_hit
353210 ± 23% -49.9% 177033 ± 20% numa-numastat.node1.local_node
366101 ± 24% -46.4% 196058 ± 21% numa-numastat.node1.numa_hit
33.44 ± 6% -14.4% 28.63 ± 16% sched_debug.cfs_rq:/.load_avg.stddev
155.87 ± 24% -43.6% 87.89 ± 81%
sched_debug.cfs_rq:/.removed.runnable_sum.avg
1083 ± 13% -39.3% 658.24 ± 73%
sched_debug.cfs_rq:/.removed.runnable_sum.stddev
1.48 ± 19% -42.9% 0.84 ± 76% sched_debug.cfs_rq:/.removed.util_avg.avg
10.53 ± 8% -38.0% 6.53 ± 71%
sched_debug.cfs_rq:/.removed.util_avg.stddev
2.29 ± 7% -4.9% 2.18 ± 8%
sched_debug.cfs_rq:/.runnable_load_avg.stddev
401.30 ± 59% +98.8% 797.72 ± 4% sched_debug.cfs_rq:/.util_est_enqueued.avg
1.763e+10 -2.7% 1.715e+10 perf-stat.i.branch-instructions
2.651e+08 -2.7% 2.579e+08 perf-stat.i.branch-misses
1.97 +2.8% 2.02 perf-stat.i.cpi
3.073e+10 -2.7% 2.99e+10 perf-stat.i.dTLB-loads
2.31e+10 -2.7% 2.247e+10 perf-stat.i.dTLB-stores
1.24e+11 -2.7% 1.207e+11 perf-stat.i.instructions
0.51 -2.7% 0.49 perf-stat.i.ipc
1.97 +2.8% 2.02 perf-stat.overall.cpi
0.51 -2.7% 0.49 perf-stat.overall.ipc
1.757e+10 -2.7% 1.709e+10 perf-stat.ps.branch-instructions
2.643e+08 -2.7% 2.57e+08 perf-stat.ps.branch-misses
3.063e+10 -2.7% 2.98e+10 perf-stat.ps.dTLB-loads
2.302e+10 -2.7% 2.24e+10 perf-stat.ps.dTLB-stores
1.236e+11 -2.7% 1.202e+11 perf-stat.ps.instructions
3.732e+13 -2.6% 3.635e+13 perf-stat.total.instructions
38.08 -1.0 37.05
perf-profile.calltrace.cycles-pp.entry_SYSCALL_64.syscall
27.53 -0.4 27.12
perf-profile.calltrace.cycles-pp.syscall_return_via_sysret.syscall
3.28 -0.1 3.16 perf-profile.calltrace.cycles-pp.testcase
2.53 +0.1 2.66
perf-profile.calltrace.cycles-pp.get_futex_key_refs.get_futex_key.futex_wake.do_futex.__x64_sys_futex
6.02 +0.3 6.28
perf-profile.calltrace.cycles-pp.get_futex_key.futex_wake.do_futex.__x64_sys_futex.do_syscall_64
3.05 ± 2% +0.7 3.71
perf-profile.calltrace.cycles-pp.hash_futex.futex_wake.do_futex.__x64_sys_futex.do_syscall_64
13.11 +0.9 14.04
perf-profile.calltrace.cycles-pp.futex_wake.do_futex.__x64_sys_futex.do_syscall_64.entry_SYSCALL_64_after_hwframe
16.74 +1.0 17.75
perf-profile.calltrace.cycles-pp.do_futex.__x64_sys_futex.do_syscall_64.entry_SYSCALL_64_after_hwframe.syscall
29.05 +1.6 30.63
perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.syscall
30.35 +1.6 31.95
perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.syscall
22.91 +1.6 24.54
perf-profile.calltrace.cycles-pp.__x64_sys_futex.do_syscall_64.entry_SYSCALL_64_after_hwframe.syscall
33.87 -0.9 32.98
perf-profile.children.cycles-pp.entry_SYSCALL_64
31.87 -0.5 31.32
perf-profile.children.cycles-pp.syscall_return_via_sysret
2.36 -0.1 2.27 perf-profile.children.cycles-pp.testcase
98.54 +0.0 98.58 perf-profile.children.cycles-pp.syscall
2.54 +0.1 2.67
perf-profile.children.cycles-pp.get_futex_key_refs
6.22 +0.3 6.51
perf-profile.children.cycles-pp.get_futex_key
3.07 ± 2% +0.7 3.76 perf-profile.children.cycles-pp.hash_futex
13.47 +1.0 14.47 perf-profile.children.cycles-pp.futex_wake
16.91 +1.1 18.00 perf-profile.children.cycles-pp.do_futex
31.04 +1.5 32.57
perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
28.59 +1.6 30.21
perf-profile.children.cycles-pp.do_syscall_64
22.58 +1.7 24.25
perf-profile.children.cycles-pp.__x64_sys_futex
29.64 -0.8 28.88
perf-profile.self.cycles-pp.entry_SYSCALL_64
31.84 -0.5 31.30
perf-profile.self.cycles-pp.syscall_return_via_sysret
5.62 -0.2 5.41 perf-profile.self.cycles-pp.syscall
3.05 -0.1 2.92
perf-profile.self.cycles-pp.entry_SYSCALL_64_after_hwframe
1.41 -0.1 1.35 perf-profile.self.cycles-pp.testcase
4.02 +0.1 4.09 perf-profile.self.cycles-pp.futex_wake
3.42 +0.1 3.51 perf-profile.self.cycles-pp.do_futex
2.50 +0.1 2.61
perf-profile.self.cycles-pp.get_futex_key_refs
3.51 ± 3% +0.2 3.72 ± 2% perf-profile.self.cycles-pp.get_futex_key
5.23 +0.6 5.82 perf-profile.self.cycles-pp.__x64_sys_futex
3.04 ± 2% +0.6 3.66 perf-profile.self.cycles-pp.hash_futex
1367 ± 95% -68.3% 433.67 ± 81%
interrupts.36:PCI-MSI.3145733-edge.eth0-TxRx-4
503.50 ± 39% -45.6% 274.00 ± 55%
interrupts.39:PCI-MSI.3145736-edge.eth0-TxRx-7
4918 ± 34% +60.4% 7889
interrupts.CPU14.NMI:Non-maskable_interrupts
4918 ± 34% +60.4% 7889
interrupts.CPU14.PMI:Performance_monitoring_interrupts
1367 ± 95% -68.3% 433.67 ± 81%
interrupts.CPU15.36:PCI-MSI.3145733-edge.eth0-TxRx-4
200.75 ± 39% +197.7% 597.67 ± 13%
interrupts.CPU15.RES:Rescheduling_interrupts
204.50 ± 80% +251.1% 718.00 ± 52%
interrupts.CPU16.RES:Rescheduling_interrupts
503.50 ± 39% -45.6% 274.00 ± 55%
interrupts.CPU18.39:PCI-MSI.3145736-edge.eth0-TxRx-7
4912 -22.9% 3788 ± 32%
interrupts.CPU19.CAL:Function_call_interrupts
1753 ± 3% -10.6% 1567 ± 11% interrupts.CPU21.TLB:TLB_shootdowns
4909 -21.2% 3869 ± 31%
interrupts.CPU22.CAL:Function_call_interrupts
473.25 ± 46% -83.3% 79.00 ± 37%
interrupts.CPU24.RES:Rescheduling_interrupts
698.75 ± 55% -63.9% 252.00 ±106%
interrupts.CPU26.RES:Rescheduling_interrupts
382.50 ± 53% -64.9% 134.33 ± 47%
interrupts.CPU28.RES:Rescheduling_interrupts
342.00 ± 48% +146.4% 842.67 ± 26% interrupts.CPU3.RES:Rescheduling_interrupts
572.25 ± 97% -77.6% 128.00 ± 49%
interrupts.CPU33.RES:Rescheduling_interrupts
374.75 ± 62% -79.9% 75.33 ± 50%
interrupts.CPU34.RES:Rescheduling_interrupts
1306 ± 80% -91.8% 107.67 ± 25%
interrupts.CPU35.RES:Rescheduling_interrupts
1377 ± 57% -61.6% 529.33 ±128%
interrupts.CPU38.RES:Rescheduling_interrupts
427.00 ± 67% -73.2% 114.33 ± 51%
interrupts.CPU39.RES:Rescheduling_interrupts
343.25 ±117% +794.5% 3070 ± 36% interrupts.CPU4.RES:Rescheduling_interrupts
435.25 ± 76% -86.4% 59.00 ± 47%
interrupts.CPU40.RES:Rescheduling_interrupts
15.50 ± 33% +5132.3% 811.00 ±135%
interrupts.CPU46.RES:Rescheduling_interrupts
22.50 ± 58% +6574.1% 1501 ±102%
interrupts.CPU53.RES:Rescheduling_interrupts
27.00 ± 52% +685.2% 212.00 ±102%
interrupts.CPU54.RES:Rescheduling_interrupts
15.00 ± 48% +2224.4% 348.67 ±123%
interrupts.CPU62.RES:Rescheduling_interrupts
7918 -33.4% 5276 ± 35%
interrupts.CPU70.NMI:Non-maskable_interrupts
7918 -33.4% 5276 ± 35%
interrupts.CPU70.PMI:Performance_monitoring_interrupts
82.50 ± 95% -68.1% 26.33 ± 39%
interrupts.CPU70.RES:Rescheduling_interrupts
7903 -33.5% 5258 ± 35%
interrupts.CPU71.NMI:Non-maskable_interrupts
7903 -33.5% 5258 ± 35%
interrupts.CPU71.PMI:Performance_monitoring_interrupts
287.00 ±123% -91.4% 24.67 ± 5%
interrupts.CPU71.RES:Rescheduling_interrupts
6921 ± 24% -24.1% 5256 ± 34%
interrupts.CPU74.NMI:Non-maskable_interrupts
6921 ± 24% -24.1% 5256 ± 34%
interrupts.CPU74.PMI:Performance_monitoring_interrupts
6898 ± 24% -24.1% 5238 ± 35%
interrupts.CPU75.NMI:Non-maskable_interrupts
6898 ± 24% -24.1% 5238 ± 35%
interrupts.CPU75.PMI:Performance_monitoring_interrupts
61.50 ± 72% -86.4% 8.33 ± 40%
interrupts.CPU75.RES:Rescheduling_interrupts
6919 ± 24% -24.1% 5252 ± 35%
interrupts.CPU78.NMI:Non-maskable_interrupts
6919 ± 24% -24.1% 5252 ± 35%
interrupts.CPU78.PMI:Performance_monitoring_interrupts
593.25 ± 67% +188.3% 1710 ± 38% interrupts.CPU8.RES:Rescheduling_interrupts
63.00 ± 77% -63.5% 23.00 ± 57%
interrupts.CPU82.RES:Rescheduling_interrupts
280.25 ±136% -93.3% 18.67 ± 28%
interrupts.CPU83.RES:Rescheduling_interrupts
57.00 ± 60% -83.0% 9.67 ± 48%
interrupts.CPU84.RES:Rescheduling_interrupts
will-it-scale.per_thread_ops
3.5e+06 +-+---------------------------------------------------------------+
| |
3e+06 O-++..O O O O.O..O O O.+..O..O O +..O..+..+ +..+..+.+..|
| : : : : : : : : : |
2.5e+06 +-+ : :: : : : : : : : |
| : : : : : : : : : : |
2e+06 +-+ : : : : : : : : : : |
| : : : : : : : : : : |
1.5e+06 +-+ : : : : : : : : : : |
| : : : : : : : : : : |
1e+06 +-+ : : : : : : : : : : |
| :: : : : : : : :: |
500000 +-+ :: :: :: : :: |
| : : : : : |
0 +-+O----------O---------------O----------O------------------------+
will-it-scale.workload
3e+08 +-+---------------------------------------------------------------+
| |
2.5e+08 O-++..O O O O.O..O O O.+..O..O O +..O..+..+ +..+..+.+..|
| : : : : : : : : : |
| : :: : : : : : : : |
2e+08 +-+ : : : : : : : : : : |
| : : : : : : : : : : |
1.5e+08 +-+ : : : : : : : : : : |
| : : : : : : : : : : |
1e+08 +-+ : : : : : : : : : : |
| : : : : : : : : : : |
| :: : : : : : : :: |
5e+07 +-+ :: :: :: : :: |
| : : : : : |
0 +-+O----------O---------------O----------O------------------------+
[*] bisect-good sample
[O] bisect-bad sample
***************************************************************************************************
lkp-bdw-ep3b: 88 threads Intel(R) Xeon(R) CPU E5-2699 v4 @ 2.20GHz with 64G memory
=========================================================================================
compiler/cpufreq_governor/kconfig/mode/nr_task/rootfs/tbox_group/test/testcase/ucode:
gcc-7/performance/x86_64-rhel-7.6/process/100%/debian-x86_64-2018-04-03.cgz/lkp-bdw-ep3b/futex3/will-it-scale/0xb00002e
commit:
82efcab3b9 ("workqueue: Only unregister a registered lockdep key")
5a07168d8d ("futex: Ensure that futex address is aligned in
handle_futex_death()")
82efcab3b9f3ef59 5a07168d8d89b00fe1760120714
---------------- ---------------------------
fail:runs %reproduction fail:runs
| | |
1:4 -25% :4 dmesg.WARNING:at#for_ip_interrupt_entry/0x
1:4 -25% :4
kmsg.DHCP/BOOTP:Reply_not_for_us_on_eth#,op[#]xid[#]
%stddev %change %stddev
\ | \
2967197 -1.6% 2921017 will-it-scale.per_process_ops
2.611e+08 -1.6% 2.57e+08 will-it-scale.workload
6581105 ± 7% -10.8% 5868555 meminfo.DirectMap2M
0.00 ± 36% +0.0 0.00 ± 64% mpstat.cpu.all.soft%
61.00 +1.6% 62.00 vmstat.cpu.sy
36.00 -2.8% 35.00 vmstat.cpu.us
1355725 ± 87% -74.5% 345713 ± 10% cpuidle.C1.time
88925 ±129% -94.6% 4801 ± 25% cpuidle.C1.usage
86713 ±104% -88.5% 9950 ± 9% cpuidle.C1E.usage
86486 ±132% -95.3% 4102 ± 26% turbostat.C1
82307 ±107% -88.9% 9119 ± 12% turbostat.C1E
0.36 ± 19% +34.9% 0.49 ± 21% turbostat.CPU%c1
26465 ± 22% +68.6% 44634 ± 19% numa-vmstat.node0.nr_active_anon
26423 ± 22% +49.4% 39476 ± 16% numa-vmstat.node0.nr_anon_pages
26465 ± 22% +68.6% 44634 ± 19% numa-vmstat.node0.nr_zone_active_anon
51873 ± 13% -35.6% 33430 ± 25% numa-vmstat.node1.nr_active_anon
51873 ± 13% -35.6% 33430 ± 25% numa-vmstat.node1.nr_zone_active_anon
758.00 ± 41% +33.3% 1010 ± 29%
interrupts.32:PCI-MSI.3145729-edge.eth0-TxRx-0
758.00 ± 41% +33.3% 1010 ± 29%
interrupts.CPU11.32:PCI-MSI.3145729-edge.eth0-TxRx-0
1436 ± 13% -52.7% 678.75 ± 66%
interrupts.CPU22.RES:Rescheduling_interrupts
16.67 ±107% +810.5% 151.75 ± 86% interrupts.CPU7.RES:Rescheduling_interrupts
28.33 ± 21% +764.7% 245.00 ± 87%
interrupts.CPU80.RES:Rescheduling_interrupts
23.67 ±108% +300.4% 94.75 ± 93% interrupts.CPU9.RES:Rescheduling_interrupts
1936 ± 4% -19.1% 1567 ± 6% slabinfo.UNIX.active_objs
1936 ± 4% -19.1% 1567 ± 6% slabinfo.UNIX.num_objs
3665 ± 2% -15.6% 3094 ± 5% slabinfo.sock_inode_cache.active_objs
3665 ± 2% -15.6% 3094 ± 5% slabinfo.sock_inode_cache.num_objs
1397 ± 7% -13.0% 1215 ± 8% slabinfo.task_group.active_objs
1397 ± 7% -13.0% 1215 ± 8% slabinfo.task_group.num_objs
105945 ± 22% +68.5% 178555 ± 19% numa-meminfo.node0.Active
105854 ± 22% +68.6% 178487 ± 19% numa-meminfo.node0.Active(anon)
60862 ± 41% +87.3% 114016 ± 14% numa-meminfo.node0.AnonHugePages
105703 ± 22% +49.3% 157859 ± 16% numa-meminfo.node0.AnonPages
207473 ± 13% -35.5% 133847 ± 25% numa-meminfo.node1.Active
207428 ± 13% -35.5% 133779 ± 25% numa-meminfo.node1.Active(anon)
112468 ± 23% -47.0% 59623 ± 27% numa-meminfo.node1.AnonHugePages
22164 +12.9% 25013 ± 7% softirqs.CPU10.RCU
23983 ± 9% +8.1% 25924 ± 6% softirqs.CPU19.RCU
98574 ± 5% +7.1% 105614 ± 7% softirqs.CPU21.TIMER
25231 ± 7% +7.7% 27186 ± 6% softirqs.CPU30.RCU
28403 ± 5% +29.2% 36687 ± 10% softirqs.CPU36.RCU
27015 ± 6% +6.3% 28706 ± 4% softirqs.CPU42.RCU
22681 ± 4% +18.4% 26865 ± 19% softirqs.CPU5.RCU
21389 ± 8% +14.8% 24549 ± 5% softirqs.CPU54.RCU
98111 ± 5% +3.8% 101866 ± 4% softirqs.CPU57.TIMER
23135 ± 3% +8.6% 25116 ± 4% softirqs.CPU6.RCU
97209 ± 5% +39.9% 136028 ± 25% softirqs.CPU65.TIMER
24333 ± 3% +15.7% 28162 ± 14% softirqs.CPU7.RCU
21320 ± 7% +7.0% 22804 ± 8% softirqs.CPU78.RCU
23119 ± 3% +18.7% 27451 ± 17% softirqs.CPU9.RCU
1.741e+10 -1.3% 1.719e+10 perf-stat.i.branch-instructions
2.62e+08 -1.3% 2.586e+08 perf-stat.i.branch-misses
3.036e+10 -1.3% 2.997e+10 perf-stat.i.dTLB-loads
2.282e+10 -1.3% 2.253e+10 perf-stat.i.dTLB-stores
3.765e+08 ± 4% -9.3% 3.415e+08 perf-stat.i.iTLB-load-misses
1.225e+11 -1.3% 1.209e+11 perf-stat.i.instructions
333.48 ± 4% +6.9% 356.47 perf-stat.i.instructions-per-iTLB-miss
0.50 -1.4% 0.50 perf-stat.i.ipc
1.98 +1.6% 2.01 perf-stat.overall.cpi
326.21 ± 5% +8.6% 354.20
perf-stat.overall.instructions-per-iTLB-miss
0.51 -1.5% 0.50 perf-stat.overall.ipc
1.735e+10 -1.3% 1.713e+10 perf-stat.ps.branch-instructions
2.611e+08 -1.3% 2.577e+08 perf-stat.ps.branch-misses
3.025e+10 -1.3% 2.987e+10 perf-stat.ps.dTLB-loads
2.274e+10 -1.3% 2.245e+10 perf-stat.ps.dTLB-stores
3.753e+08 ± 4% -9.3% 3.404e+08 perf-stat.ps.iTLB-load-misses
1.221e+11 -1.3% 1.205e+11 perf-stat.ps.instructions
3.727e+13 -1.5% 3.67e+13 perf-stat.total.instructions
38.07 -0.4 37.66
perf-profile.calltrace.cycles-pp.entry_SYSCALL_64.syscall
28.03 -0.3 27.68
perf-profile.calltrace.cycles-pp.syscall_return_via_sysret.syscall
3.35 -0.1 3.22
perf-profile.calltrace.cycles-pp.hash_futex.futex_wake.do_futex.__x64_sys_futex.do_syscall_64
3.24 -0.1 3.14 perf-profile.calltrace.cycles-pp.testcase
97.84 +0.1 97.92 perf-profile.calltrace.cycles-pp.syscall
16.28 +0.5 16.82
perf-profile.calltrace.cycles-pp.do_futex.__x64_sys_futex.do_syscall_64.entry_SYSCALL_64_after_hwframe.syscall
28.49 +0.9 29.34
perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.syscall
29.73 +0.9 30.62
perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.syscall
22.21 +0.9 23.12
perf-profile.calltrace.cycles-pp.__x64_sys_futex.do_syscall_64.entry_SYSCALL_64_after_hwframe.syscall
32.37 -0.4 31.94
perf-profile.children.cycles-pp.syscall_return_via_sysret
33.97 -0.3 33.66
perf-profile.children.cycles-pp.entry_SYSCALL_64
3.37 -0.1 3.24 perf-profile.children.cycles-pp.hash_futex
2.33 -0.1 2.25 perf-profile.children.cycles-pp.testcase
98.56 +0.1 98.61 perf-profile.children.cycles-pp.syscall
16.39 +0.6 16.98 perf-profile.children.cycles-pp.do_futex
30.45 +0.9 31.31
perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
27.99 +0.9 28.88
perf-profile.children.cycles-pp.do_syscall_64
21.88 +1.0 22.85
perf-profile.children.cycles-pp.__x64_sys_futex
32.33 -0.4 31.90
perf-profile.self.cycles-pp.syscall_return_via_sysret
29.86 -0.2 29.63
perf-profile.self.cycles-pp.entry_SYSCALL_64
5.46 -0.1 5.33 perf-profile.self.cycles-pp.syscall
3.28 -0.1 3.18 perf-profile.self.cycles-pp.hash_futex
3.11 -0.1 3.03
perf-profile.self.cycles-pp.entry_SYSCALL_64_after_hwframe
2.57 -0.1 2.52
perf-profile.self.cycles-pp.get_futex_key_refs
1.39 -0.1 1.33 perf-profile.self.cycles-pp.testcase
4.00 +0.1 4.10 perf-profile.self.cycles-pp.futex_wake
5.06 +0.4 5.47 perf-profile.self.cycles-pp.__x64_sys_futex
2.62 +0.6 3.19 perf-profile.self.cycles-pp.do_futex
Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.
Thanks,
Rong Chen