Greeting,
There is no primary kpi change in this test, below is the data collected through multiple
monitors running background just for your information.
commit: d7100358d066cd7d64301a2da161390e9f4aa63f ("sched,rcu: Make cond_resched()
provide RCU quiescent state")
https://git.kernel.org/pub/scm/linux/kernel/git/paulmck/linux-rcu.git dev.2016.11.29e
in testcase: will-it-scale
on test machine: 48 threads 2 sockets Intel(R) Xeon(R) CPU E5-2697 v2 @ 2.70GHz with 64G
memory
with following parameters:
test: unlink2
cpufreq_governor: performance
test-description: Will It Scale takes a testcase and runs it from 1 through to n parallel
copies to see if the testcase will scale. It builds both a process and threads based test
in order to see any differences between the two.
test-url:
https://github.com/antonblanchard/will-it-scale
Details are as below:
-------------------------------------------------------------------------------------------------->
To reproduce:
git clone
git://git.kernel.org/pub/scm/linux/kernel/git/wfg/lkp-tests.git
cd lkp-tests
bin/lkp install job.yaml # job file is attached in this email
bin/lkp run job.yaml
testcase/path_params/tbox_group/run: will-it-scale/unlink2-performance/ivb42
0fabf6d573c7d95f d7100358d066cd7d64301a2da1
---------------- --------------------------
%stddev change %stddev
\ | \
23017 ± 4% 227% 75267
will-it-scale.time.involuntary_context_switches
8.71 -7% 8.07 turbostat.RAMWatt
2464 ± 4% 88% 4634 vmstat.system.cs
5858 ± 99% -6e+03 27 ± 29%
latency_stats.max.call_rwsem_down_write_failed_killable.vm_mmap_pgoff.SyS_mmap_pgoff.SyS_mmap.entry_SYSCALL_64_fastpath
6140 ± 99% -6e+03 294 ±138%
latency_stats.max.call_rwsem_down_read_failed.__do_page_fault.do_page_fault.page_fault
9362 ± 57% -8e+03 1024 ± 61%
latency_stats.max.pipe_read.__vfs_read.vfs_read.SyS_read.entry_SYSCALL_64_fastpath
11959 ± 3% -1e+04 24 ± 45%
latency_stats.max.call_rwsem_down_write_failed_killable.do_mprotect_pkey.SyS_mprotect.entry_SYSCALL_64_fastpath
11657 ± 98% -1e+04 105 ± 22%
latency_stats.sum.call_rwsem_down_write_failed_killable.vm_mmap_pgoff.SyS_mmap_pgoff.SyS_mmap.entry_SYSCALL_64_fastpath
16364 ± 39% -2e+04 121 ± 23%
latency_stats.sum.call_rwsem_down_write_failed_killable.do_mprotect_pkey.SyS_mprotect.entry_SYSCALL_64_fastpath
31270 ±100% -3e+04 671 ± 85%
latency_stats.sum.call_rwsem_down_read_failed.__do_page_fault.do_page_fault.page_fault
662656 ± 81% -5e+05 210805 ± 39%
latency_stats.sum.wait_on_page_bit.__migration_entry_wait.migration_entry_wait.do_swap_page.handle_mm_fault.__do_page_fault.do_page_fault.page_fault
761782 ± 4% 89% 1441090 perf-stat.context-switches
31.37 27% 39.72 perf-stat.node-store-miss-rate%
15968 24% 19846 ± 15% perf-stat.cpu-migrations
2.879e+09 ± 4% 18% 3.383e+09 ± 6% perf-stat.node-store-misses
2.162e+12 9% 2.354e+12 ± 4% perf-stat.dTLB-loads
1.176e+08 ± 3% 9% 1.278e+08 perf-stat.iTLB-load-misses
0.34 5% 0.35 perf-stat.ipc
1.773e+12 4% 1.85e+12 perf-stat.branch-instructions
8.478e+12 4% 8.82e+12 perf-stat.instructions
50.04 4% 51.87 perf-stat.iTLB-load-miss-rate%
25.82 -6% 24.18 ± 3% perf-stat.cache-miss-rate%
1.274e+10 ± 3% -7% 1.186e+10 ± 3% perf-stat.cache-misses
6.305e+09 ± 5% -19% 5.13e+09 ± 5% perf-stat.node-stores
perf-stat.context-switches
2e+06 ++----------------------------------------------------------------+
| |
1.8e+06 O+ O O O O |
| O O |
1.6e+06 ++ O O O |
| O O O O O O O |
1.4e+06 ++ O O O O |
| |
1.2e+06 ++ |
| |
1e+06 ++ .*. |
| .*.*.*.*.*..*.* *. .*. .*. .*.* *. * |
800000 *+ .* *.* * * + + *. .. + .*.*. .*. .*.|
| * * * * * * *
600000 ++----------------------------------------------------------------+
will-it-scale.time.involuntary_context_switches
100000 ++-----------------------------------------------------------------+
| O O |
90000 O+ |
80000 ++O O |
| O O O O O O O O O |
70000 ++ O O O O O O |
| O |
60000 ++ |
| |
50000 ++ |
40000 ++ .*.. * |
| .*. .*.*.*.* + + .*. |
30000 *+*.*.*.*.*..*.*.* * * *.*.* * |
| + .*.*.. .*.*. .*
20000 ++----------------------------------------------*-*------*-*-----*-+
vmstat.system.cs
6000 ++-------------------------------------------------------------------+
O O O O O |
5500 ++O O |
5000 ++ O O O |
| O O O O O O O |
4500 ++ O O O O |
| |
4000 ++ |
| |
3500 ++ |
3000 ++ .*.* * |
| .*..*.*.*.*.*.*. + + + .*. .*.* *. * |
2500 *+ .* *.* *. * + + *.. + + .*.*. .*. .*.|
| * * * * *. * *
2000 ++-------------------------------------------------------------------+
[*] bisect-good sample
[O] bisect-bad sample
Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.
Thanks,
Xiaolong