Greeting,
There is no primary kpi change in this test, below is the data collected through multiple
monitors running background just for your information.
commit: b613bc3d8b6ebb6543b1b59b7e67b3e5aa398f63 ("sched,rcu: Make cond_resched()
provide RCU quiescent state")
https://git.kernel.org/cgit/linux/kernel/git/next/linux-next.git master
in testcase: will-it-scale
on test machine: 72 threads Intel(R) Xeon(R) CPU E5-2699 v3 @ 2.30GHz with 128G memory
with following parameters:
nr_task: 100%
mode: thread
test: open1
cpufreq_governor: performance
test-description: Will It Scale takes a testcase and runs it from 1 through to n parallel
copies to see if the testcase will scale. It builds both a process and threads based test
in order to see any differences between the two.
test-url:
https://github.com/antonblanchard/will-it-scale
Details are as below:
-------------------------------------------------------------------------------------------------->
To reproduce:
git clone
https://github.com/intel/lkp-tests.git
cd lkp-tests
bin/lkp install job.yaml # job file is attached in this email
bin/lkp run job.yaml
testcase/path_params/tbox_group/run:
will-it-scale/100%-thread-open1-performance/lkp-hsw-ep4
v4.14-rc1 b613bc3d8b6ebb6543b1b59b7e
---------------- --------------------------
182722 89% 345505
will-it-scale.time.involuntary_context_switches
49.20 31% 64.54 will-it-scale.time.user_time
7185 -6% 6734
will-it-scale.time.percent_of_cpu_this_job_got
21614 -6% 20230 will-it-scale.time.system_time
1849 3% 1906 boot-time.idle
78140 76093 vmstat.system.in
24312 327% 103892 ± 13% perf-stat.instructions-per-iTLB-miss
0.00 59% 0.01 ± 40% perf-stat.dTLB-load-miss-rate%
1.251e+08 58% 1.981e+08 ± 41% perf-stat.dTLB-load-misses
11191 34% 14956 perf-stat.cpu-migrations
3.95e+11 10% 4.359e+11 ± 3% perf-stat.dTLB-stores
57.54 8% 62.04 perf-stat.node-store-miss-rate%
0.08 7% 0.09 ± 7% perf-stat.branch-miss-rate%
84.74 4% 88.56 perf-stat.node-load-miss-rate%
4.30 4.37 perf-stat.cpi
0.23 0.23 perf-stat.ipc
98.64 -11% 87.78 ± 10% perf-stat.iTLB-load-miss-rate%
1.202e+09 -15% 1.018e+09 perf-stat.node-stores
4.749e+08 ± 5% -30% 3.321e+08 ± 9% perf-stat.node-loads
5.769e+08 -76% 1.369e+08 ± 16% perf-stat.iTLB-load-misses
perf-stat.cpu-migrations
15500 +-O---------------O-------------------------------------------------+
15000 +-+ O O O O O O O O O O |
O O O O O O O O O O O O O O O O
14500 +-+ O O O O O O |
14000 +-+ O |
| |
13500 +-+ |
13000 +-+ |
12500 +-+ |
| |
12000 +-+ |
11500 +-+ |
|. .+.+. .+. .+. .+. .+.+.+. .+.+ |
11000 +-+ + + + + + |
10500 +-+-----------------------------------------------------------------+
will-it-scale.time.user_time
70 +-+--------------------------------------------------------------------+
| |
| O O |
65 O-+ O O O O O O O O O O |
| O O O O O O O O
| O O O O O O O O |
60 +-+ O O O |
| O O O |
55 +-+ |
| |
| |
50 +-+ .+.+. .+.+. |
|.+.+. .+.+.. +.+ +.+ + |
| + + |
45 +-+--------------------------------------------------------------------+
will-it-scale.time.system_time
21800 +-+-----------------------------------------------------------------+
21600 +-+.+.+.+.+.+.+.+.+.+.+.+.+.+.+.+ |
| |
21400 +-+ |
21200 +-+ |
| |
21000 +-+ |
20800 +-+ |
20600 +-+ |
| |
20400 +-+ O O O O O |
20200 O-O O O O O O O O O O O O O O O O O O O O O O O O O O O
| O |
20000 +-+ O |
19800 +-+-----------------------------------------------------------------+
will-it-scale.time.percent_of_cpu_this_job_got
7200 +-+------------------------------------------------------------------+
| |
7100 +-+ |
| |
| |
7000 +-+ |
| |
6900 +-+ |
| |
6800 +-+ |
| O O O O O O O O |
O O O O O O O O O O O O O O O O O O O O O O O O O
6700 +-+ O |
| O |
6600 +-+------------------------------------------------------------------+
will-it-scale.time.involuntary_context_switches
380000 +-+----------------------------------------------------------------+
360000 +-+ O |
O O O O O O O O O O O O O O O O O O O O O O O O O |
340000 +-+ O O O O O O O O O
320000 +-+ |
300000 +-+ |
280000 +-+ |
| |
260000 +-+ |
240000 +-+ |
220000 +-+ |
200000 +-+ |
|.+.+.+.+.+.+.+.+. .+.+. .+.+. |
180000 +-+ + +.+ + |
160000 +-+----------------------------------------------------------------+
[*] bisect-good sample
[O] bisect-bad sample
Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.
Thanks,
Xiaolong