Greeting,
FYI, we noticed a 113.6% improvement of unixbench.score due to commit:
commit: 6a0b19c0f39a7a7b7fb77d3867a733136ff059a3 ("sched/fair: Consider spare
capacity in find_idlest_group()")
https://git.kernel.org/pub/scm/linux/kernel/git/next/linux-next.git master
in testcase: unixbench
on test machine: 8 threads Intel(R) Core(TM) i7 CPU 870 @ 2.93GHz with 6G memory
with following parameters:
runtime: 300s
nr_task: 1
test: execl
test-description: UnixBench is the original BYTE UNIX benchmark suite aims to test
performance of Unix-like system.
test-url:
https://github.com/kdlucas/byte-unixbench
Details are as below:
-------------------------------------------------------------------------------------------------->
To reproduce:
git clone
git://git.kernel.org/pub/scm/linux/kernel/git/wfg/lkp-tests.git
cd lkp-tests
bin/lkp install job.yaml # job file is attached in this email
bin/lkp run job.yaml
testcase/path_params/tbox_group/run: unixbench/300s-1-execl/nhm-white
104cb16d9eb684f0 6a0b19c0f39a7a7b7fb77d3867
---------------- --------------------------
535 ± 3% 114% 1144 unixbench.score
232 3% 241 unixbench.time.system_time
49.07 18% 58.03 unixbench.time.user_time
31410903 ± 3% 121% 69542083 unixbench.time.minor_page_faults
1229 ± 4% -63% 450 unixbench.time.voluntary_context_switches
54152 -50% 27130 ± 7% interrupts.CAL:Function_call_interrupts
28729 ± 8% -28% 20702 ± 11% vmstat.system.cs
19.38 ± 3% -100% 0.00 turbostat.%Busy
367 ± 5% -100% 0 turbostat.Avg_MHz
0.62 -4% 0.59 perf-stat.ipc
0.04 -13% 0.03 perf-stat.iTLB-load-miss-rate%
4.41 -6% 4.16 perf-stat.branch-miss-rate%
1.22 ± 42% 621% 8.77 ± 11% perf-stat.cache-miss-rate%
775940 -99% 10741 perf-stat.cpu-migrations
1.462e+10 ± 9% 51% 2.203e+10 ± 10% perf-stat.cache-references
9990821 ± 6% -28% 7195338 ± 7% perf-stat.context-switches
2.049e+08 73% 3.545e+08 perf-stat.iTLB-load-misses
1.88e+08 ± 35% 981% 2.033e+09 ± 12% perf-stat.node-loads
2.43e+11 99% 4.835e+11 perf-stat.dTLB-loads
30440661 ± 3% 120% 66969488 perf-stat.page-faults
30440661 ± 3% 120% 66969487 perf-stat.minor-faults
1.194e+11 98% 2.358e+11 perf-stat.branch-instructions
5.794e+11 100% 1.159e+12 perf-stat.iTLB-loads
5.409e+11 ± 7% 83% 9.917e+11 ± 9% perf-stat.instructions
1.017e+11 90% 1.932e+11 perf-stat.dTLB-stores
5.265e+09 86% 9.803e+09 perf-stat.branch-misses
3.928e+08 ± 4% 83% 7.172e+08 ± 5% perf-stat.dTLB-store-misses
8.751e+11 ± 8% 91% 1.674e+12 ± 8% perf-stat.cpu-cycles
1.739e+09 ± 10% 79% 3.108e+09 ± 7% perf-stat.dTLB-load-misses
1597 ± 33% 9e+06 9009744 ±173%
latency_stats.sum.rpc_wait_bit_killable.__rpc_execute.rpc_execute.rpc_run_task.nfs4_call_sync_sequence.[nfsv4]._nfs4_proc_getattr.[nfsv4].nfs4_proc_getattr.[nfsv4].__nfs_revalidate_inode.nfs_do_access.nfs_permission.__inode_permission.inode_permission
9182 ± 95% -7e+03 2216 ± 92%
latency_stats.sum.pipe_wait.wait_for_partner.fifo_open.do_dentry_open.vfs_open.path_openat.do_filp_open.do_sys_open.SyS_open.entry_SYSCALL_64_fastpath
19466 ± 3% -2e+04 0
latency_stats.sum.stop_one_cpu.sched_exec.do_execveat_common.SyS_execve.do_syscall_64.return_from_SYSCALL_64
perf-stat.cpu-cycles
2.2e+12 ++----------------------------------------------------------------+
| O |
2e+12 ++ |
| O O O O |
1.8e+12 ++ O |
| O O O |
1.6e+12 ++ O O O O O O O
| O O O O O O |
1.4e+12 O+ O O |
| |
1.2e+12 ++ |
| |
1e+12 ++ .*.. * |
*.. .*.*..*.. .*.*..*..*..* *..*..*. .*.. .*.. + |
8e+11 ++-*----------*--------------------------*-----*--*----*--*--*----+
perf-stat.instructions
1.2e+12 ++----------------------------------------------------------------+
| O O |
1.1e+12 ++ O O O O |
| O |
1e+12 ++ O O O |
| O O O O O O O |
9e+11 O+ O O O O
| O O O |
8e+11 ++ |
| |
7e+11 ++ |
| |
6e+11 ++ .*.. * |
*.. .*.*..*.. .*.*..*..*..* *..*..*. .*.. .*.. .*.. + |
5e+11 ++-*----------*--------------------------*-----*--*----*-----*----+
perf-stat.cache-misses
4e+09 ++----------------------------------------------------------------+
| O |
3.5e+09 ++ |
3e+09 ++ |
| |
2.5e+09 ++ |
| O O O O O |
2e+09 ++ O O O O O
| O O O O |
1.5e+09 ++ O O O O O |
1e+09 O+ O O O O |
| |
5e+08 ++ |
*..*..*.*..*..*..*.*..*..*..*.*..*..*..*.*..*..*.. .*.. .*..*.* |
0 ++------------------------------------------------*----*----------+
perf-stat.branch-instructions
2.6e+11 ++-------------------------------O--------------------------------+
| O |
2.4e+11 O+ O O O O O O O O O O O O O O
2.2e+11 ++ O O O O O O |
| O O |
2e+11 ++ |
| |
1.8e+11 ++ |
| |
1.6e+11 ++ |
1.4e+11 ++ |
| |
1.2e+11 *+.*..*.*.. .*.. .*..*..*..*.*.. .*..*.*..*..*.. .*.. .*..*.* |
| *. * *. * *. |
1e+11 ++----------------------------------------------------------------+
perf-stat.branch-misses
1.2e+10 ++----------------------------------------------------------------+
| |
1.1e+10 ++ O |
| |
1e+10 ++ O O O
O O O O O O O O O O O O O O O O O O O |
9e+09 ++ O O |
| |
8e+09 ++ |
| |
7e+09 ++ |
| |
6e+09 ++ |
*..*..*.*.. .*.. .*..*..*..*.*.. .*..*.*..*..*.. .*.. .* |
5e+09 ++---------*-----*---------------*----------------*----*--*--*----+
perf-stat.dTLB-loads
5.5e+11 ++----------------------------------------------------------------+
| O |
5e+11 ++ O O O O
O O O O O O O O O O O O O O O O O O O O |
4.5e+11 ++ |
| |
4e+11 ++ |
| |
3.5e+11 ++ |
| |
3e+11 ++ |
| |
2.5e+11 *+.*..*.*.. .*.. .*..*..*..*.*.. .*..*.*..*..*.. .*.. .*..*.* |
| *. * *. * *. |
2e+11 ++----------------------------------------------------------------+
perf-stat.dTLB-stores
2.2e+11 ++----------------------------------------------------------------+
| O O |
2e+11 ++ O O O O O
O O O O O O O O O O O O O O O O O O |
1.8e+11 ++ |
| |
1.6e+11 ++ |
| |
1.4e+11 ++ |
| |
1.2e+11 ++ |
| .*.. .*.. .*.. |
1e+11 *+.*..* *..*..*.*..*..*..* *. *.*..*..*..*.*..*..*..*.* |
| |
8e+10 ++----------------------------------------------------------------+
perf-stat.dTLB-store-misses
8e+08 ++----------------------------------------------------------------+
7.5e+08 ++ O O O O |
| O O |
7e+08 ++ O O O O O O O O
6.5e+08 O+ O O O O O O O O |
| O O |
6e+08 ++ |
5.5e+08 ++ |
5e+08 ++ |
| |
4.5e+08 ++ |
4e+08 ++ .*.. .*.. .*.. .*.. .*..*.*..*..*.. *.. .*.. .* |
*..*..* *. *.*. *..* *. + *. * |
3.5e+08 ++ * |
3e+08 ++----------------------------------------------------------------+
perf-stat.iTLB-loads
1.3e+12 ++----------------------------------------------------------------+
| O |
1.2e+12 ++ O O O O O O
1.1e+12 O+ O O O O O O O O O O O O O O O |
| O O |
1e+12 ++ |
| |
9e+11 ++ |
| |
8e+11 ++ |
7e+11 ++ |
| |
6e+11 ++.*..*.*.. .*.. .*.. .*.. .*.. .*.. .*.. .*.. |
*. *..*..* *. * *. * *. *.*..*. *.* |
5e+11 ++----------------------------------------------------------------+
perf-stat.iTLB-load-misses
4e+08 ++----------------------------------------------------------------+
3.8e+08 ++ O O |
| O O O O O O |
3.6e+08 O+ O O O O O O O O O O O O O O O
3.4e+08 ++ O |
3.2e+08 ++ |
3e+08 ++ |
| |
2.8e+08 ++ |
2.6e+08 ++ |
2.4e+08 ++ |
2.2e+08 ++ |
*..*..*.*.. .*.. .*.. .*.. .*.. .*.. .*.. .*..*.* |
2e+08 ++ *. * *. * *. *.*..*..*..* *. |
1.8e+08 ++----------------------------------------------------------------+
perf-stat.node-loads
4e+09 ++-------------------------------O--------------------------------+
| |
3.5e+09 ++ |
3e+09 ++ |
| |
2.5e+09 ++ O |
| O O
2e+09 ++ O O O O O |
O O O O O O O O O O O O O |
1.5e+09 ++ O O O |
1e+09 ++ |
| |
5e+08 ++ |
*..*..*.*..*..*..*.*..*..*..*.*..*..*..*.*..*..*.. .*.. .*..*.* |
0 ++------------------------------------------------*----*----------+
perf-stat.page-faults
7e+07 ++----------------------------------------------------------------+
O O O O O O O O O O O O O O O O O O O O O O O O O
6.5e+07 ++ |
6e+07 ++ |
| |
5.5e+07 ++ |
5e+07 ++ |
| |
4.5e+07 ++ |
4e+07 ++ |
| |
3.5e+07 ++ |
3e+07 *+.*..*.*.. .*.. .*..*..*..*.*..*..*..*.*..*..*.. .*.. .*.. .* |
| *. * * *. * |
2.5e+07 ++----------------------------------------------------------------+
perf-stat.cpu-migrations
900000 ++-----------------------------------------------------------------+
| .*.. .*.. .*.. |
800000 *+.*..* *..*..*..*.*..*..*. * *..*..*.*.. .*..*..*.*..* |
700000 ++ *. |
| |
600000 ++ |
500000 ++ |
| |
400000 ++ |
300000 ++ |
| |
200000 ++ |
100000 ++ |
| |
0 O+-O--O-O--O--O--O--O-O--O--O--O--O-O--O--O--O-O--O--O--O--O-O--O--O
perf-stat.minor-faults
7e+07 ++----------------------------------------------------------------+
O O O O O O O O O O O O O O O O O O O O O O O O O
6.5e+07 ++ |
6e+07 ++ |
| |
5.5e+07 ++ |
5e+07 ++ |
| |
4.5e+07 ++ |
4e+07 ++ |
| |
3.5e+07 ++ |
3e+07 *+.*..*.*.. .*.. .*..*..*..*.*..*..*..*.*..*..*.. .*.. .*.. .* |
| *. * * *. * |
2.5e+07 ++----------------------------------------------------------------+
perf-stat.cache-miss-rate_
16 ++---------------------------------------------------------------------+
| |
14 ++ O |
12 ++ |
| |
10 ++ O O O
| O O O O O |
8 ++ O O O O O O O O O |
| O |
6 O+ O O O O O |
4 ++ |
| |
2 ++ .*..*.. .*..*.. .*..*. .* |
*..*..*..*. *..*. *. *..*..*..*..*.. .*..*..*..*. |
0 ++---------------------------------------------------*-----------------+
unixbench.score
1200 ++-------------------------------------------------------------------+
O O O O O O O O O O O O O O O O O O O O O O O O O
1100 ++ |
1000 ++ |
| |
900 ++ |
| |
800 ++ |
| |
700 ++ |
600 ++ |
| .*..*..*.. .*.. .*.. .*..*..*.. .*..*..*..*..*.. .*.. .* |
500 *+ * *. *. * *..*..* *. |
| |
400 ++-------------------------------------------------------------------+
interrupts.CAL:Function_call_interrupts
60000 ++------------------------------------------------------------------+
| .*.. .*.. *.. |
55000 *+.*..*..*. .*. .*.*. .. *.. .*.. .*..*..*.*..* |
50000 ++ *. *..*. * * *..*. |
| |
45000 ++ |
| |
40000 ++ |
| |
35000 ++ O |
30000 ++ |
| O O O O
25000 ++ O O O O O O O O O O O O O O O O O |
O O O |
20000 ++------------------------------------------------------------------+
unixbench.time.user_time
59 ++---------------------------------------------------------------------+
58 ++ O O O O O O |
O O O O O O O O O O O O O
57 ++ O O O O O O |
56 ++ |
55 ++ |
54 ++ |
| |
53 ++ |
52 ++ |
51 ++ |
50 ++ .*..*.. .* *.. *.. |
*..*. *. + + *.. .. *.. .*..*..*..*..*.. .*.. .*.. .* |
49 ++ + + * * *. *. *. |
48 ++----------------*----------------------------------------------------+
unixbench.time.system_time
244 ++--------------------------------------------------------------------+
| O |
242 ++ O O O O O O O O O O O O O
| O O O O O O |
240 O+ O O O O |
| |
238 ++ |
| |
236 ++ |
| *. |
234 ++ .*.. .. *.. *.. |
| *..*..*. .*. .*..*..*..*..*..* *.. .. * |
232 *+. .*.. .. *. *..*. * |
| *. * |
230 ++--------------------------------------------------------------------+
unixbench.time.minor_page_faults
7.5e+07 ++----------------------------------------------------------------+
7e+07 ++ O O O
O O O O O O O O O O O O O O O O O O O O O O |
6.5e+07 ++ |
6e+07 ++ |
| |
5.5e+07 ++ |
5e+07 ++ |
4.5e+07 ++ |
| |
4e+07 ++ |
3.5e+07 ++ |
*..*..*.*..*..*..*.*..*..*..*.*..*..*..*.*..*..*.. .*..*..*.. .* |
3e+07 ++ * * |
2.5e+07 ++----------------------------------------------------------------+
unixbench.time.voluntary_context_switches
1300 ++-------------------------------------------------------------*-----+
*.. .*.. .*.. .*.. .*..*.*..*..*..*..*..*..*.. .*. * |
1200 ++ *..*. *.*..*. *. *. * |
1100 ++ |
| |
1000 ++ |
900 ++ |
| |
800 ++ |
700 ++ |
| |
600 ++ |
500 ++ |
O O O O O O O O O O O O O O O O O O O O O O O O O
400 ++-------------------------------------------------------------------+
unixbench.time.involuntary_context_switches
900000 ++------------*-----*----*----------*-----*----*-------------------+
*..*..*.*..*. *. *. *..*..* *. * .*..*..*.*..* |
800000 ++ *. |
700000 ++ |
| |
600000 ++ |
500000 ++ |
| |
400000 ++ |
300000 ++ |
| |
200000 ++ |
100000 ++ O |
| O O O O O O O O O O |
0 O+-O--O----O-----O--O-O--O--O----------O-----O----O--O-------------O
[*] bisect-good sample
[O] bisect-bad sample
Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.
Thanks,
Xiaolong