Greeting,
There is no primary kpi change in this test, below is the data collected through multiple
monitors running background just for your information.
commit: 74cbb3898012216597c4a7200d356473b5cbde97 ("locking/mutex: Enable optimistic
spinning of woken task in wait queue")
git://anongit.freedesktop.org/drm-intel topic/core-for-CI
in testcase: fsmark
on test machine: 48 threads Intel(R) Xeon(R) CPU E5-2697 v2 @ 2.70GHz with 64G memory
with following parameters:
iterations: 1x
nr_threads: 64t
disk: 1BRD_48G
fs: f2fs
filesize: 4M
test_size: 40G
sync_method: fsyncBeforeClose
cpufreq_governor: performance
test-description: The fsmark is a file system benchmark to test synchronous write
workloads, for example, mail servers workload.
test-url:
https://sourceforge.net/projects/fsmark/
Details are as below:
-------------------------------------------------------------------------------------------------->
To reproduce:
git clone
git://git.kernel.org/pub/scm/linux/kernel/git/wfg/lkp-tests.git
cd lkp-tests
bin/lkp install job.yaml # job file is attached in this email
bin/lkp run job.yaml
testcase/path_params/tbox_group/run:
fsmark/1x-64t-1BRD_48G-f2fs-4M-40G-fsyncBeforeClose-performance/ivb44
e05c8f4658ee72c0 74cbb3898012216597c4a7200d
---------------- --------------------------
%stddev change %stddev
\ | \
7419 ± 8% 120% 16296 ± 4% fsmark.time.involuntary_context_switches
2063200 -77% 474914 fsmark.time.voluntary_context_switches
386575 25% 484879 ± 8% fsmark.app_overhead
271 -6% 256 fsmark.files_per_sec
934 ± 3% 66% 1546 fsmark.time.system_time
2450 ± 3% 56% 3834 fsmark.time.percent_of_cpu_this_job_got
35466 24% 44060 fsmark.time.minor_page_faults
62685 ± 7% 47% 92056 ± 12% interrupts.CAL:Function_call_interrupts
1031607 -5% 977505 vmstat.io.bo
103565 -77% 23649 vmstat.system.cs
1540 ± 3% 53% 2362 turbostat.Avg_MHz
51.49 ± 3% 53% 78.99 turbostat.%Busy
135 11% 150 turbostat.CorWatt
168 9% 182 turbostat.PkgWatt
7.87 -6% 7.39 turbostat.RAMWatt
348 ± 5% 7e+03 7782 ± 32%
latency_stats.max.call_rwsem_down_read_failed.f2fs_submit_merged_bio_cond.[f2fs].f2fs_write_cache_pages.[f2fs].f2fs_write_data_pages.[f2fs].do_writepages.__filemap_fdatawrite_range.filemap_write_and_wait_range.f2fs_do_sync_file.[f2fs].f2fs_sync_file.[f2fs].vfs_fsync_range.do_fsync.SyS_fsync
1240194 ± 25% 1e+07 15121836 ± 17%
latency_stats.sum.call_rwsem_down_read_failed.get_node_info.[f2fs].write_data_page.[f2fs].do_write_data_page.[f2fs].f2fs_write_data_page.[f2fs].f2fs_write_cache_pages.[f2fs].f2fs_write_data_pages.[f2fs].do_writepages.__filemap_fdatawrite_range.filemap_write_and_wait_range.f2fs_do_sync_file.[f2fs].f2fs_sync_file.[f2fs]
29432 ± 25% 1e+05 134088 ± 17%
latency_stats.sum.call_rwsem_down_write_failed.set_node_addr.[f2fs].new_node_page.[f2fs].get_dnode_of_data.[f2fs].f2fs_map_blocks.[f2fs].f2fs_preallocate_blocks.[f2fs].f2fs_file_write_iter.[f2fs].__vfs_write.vfs_write.SyS_write.entry_SYSCALL_64_fastpath
4665 ± 47% 8e+04 86855 ± 23%
latency_stats.sum.call_rwsem_down_write_failed.set_node_addr.[f2fs].f2fs_write_node_page.[f2fs].fsync_node_pages.[f2fs].f2fs_do_sync_file.[f2fs].f2fs_sync_file.[f2fs].vfs_fsync_range.do_fsync.SyS_fsync.entry_SYSCALL_64_fastpath
11898 ± 9% 5e+04 61422 ± 17%
latency_stats.sum.call_rwsem_down_write_failed.set_node_addr.[f2fs].new_node_page.[f2fs].new_inode_page.[f2fs].init_inode_metadata.[f2fs].f2fs_add_regular_entry.[f2fs].__f2fs_do_add_link.[f2fs].__f2fs_add_link.[f2fs].f2fs_create.[f2fs].path_openat.do_filp_open.do_sys_open
1258 ± 68% 5e+04 49298 ± 26%
latency_stats.sum.call_rwsem_down_write_failed.get_node_info.[f2fs].new_node_page.[f2fs].new_inode_page.[f2fs].init_inode_metadata.[f2fs].f2fs_add_regular_entry.[f2fs].__f2fs_do_add_link.[f2fs].__f2fs_add_link.[f2fs].f2fs_create.[f2fs].path_openat.do_filp_open.do_sys_open
473 ± 87% 1e+04 12720 ± 23%
latency_stats.sum.call_rwsem_down_read_failed.get_node_info.[f2fs].f2fs_write_node_page.[f2fs].fsync_node_pages.[f2fs].f2fs_do_sync_file.[f2fs].f2fs_sync_file.[f2fs].vfs_fsync_range.do_fsync.SyS_fsync.entry_SYSCALL_64_fastpath
116 ± 69% 9e+03 9609 ± 25%
latency_stats.sum.call_rwsem_down_read_failed.is_checkpointed_node.[f2fs].f2fs_do_sync_file.[f2fs].f2fs_sync_file.[f2fs].vfs_fsync_range.do_fsync.SyS_fsync.entry_SYSCALL_64_fastpath
298 ± 47% 7e+03 7496 ± 31%
latency_stats.sum.call_rwsem_down_read_failed.need_inode_block_update.[f2fs].f2fs_do_sync_file.[f2fs].f2fs_sync_file.[f2fs].vfs_fsync_range.do_fsync.SyS_fsync.entry_SYSCALL_64_fastpath
9165016 ± 76% -5e+06 3994495 ± 26%
latency_stats.sum.f2fs_sync_fs.[f2fs].f2fs_do_sync_file.[f2fs].f2fs_sync_file.[f2fs].vfs_fsync_range.do_fsync.SyS_fsync.entry_SYSCALL_64_fastpath
47073 ± 6% 208% 144866 ± 4% perf-stat.cpu-migrations
51080 ± 5% 110% 107239 ± 3% perf-stat.instructions-per-iTLB-miss
2.994e+12 ± 4% 57% 4.697e+12 perf-stat.cpu-cycles
3.036e+11 ± 3% 44% 4.362e+11 perf-stat.branch-instructions
3.139e+11 ± 6% 43% 4.498e+11 perf-stat.dTLB-loads
1.145e+12 ± 3% 40% 1.602e+12 perf-stat.instructions
127842 12% 143318 perf-stat.minor-faults
127867 12% 143338 perf-stat.page-faults
1.304e+09 1.281e+09 perf-stat.node-stores
6.531e+08 6.338e+08 perf-stat.node-load-misses
4.765e+09 -3% 4.609e+09 perf-stat.cache-references
49.05 -3% 47.40 perf-stat.cache-miss-rate%
9.865e+08 -4% 9.51e+08 perf-stat.node-loads
39.26 -6% 36.71 perf-stat.iTLB-load-miss-rate%
2.337e+09 -7% 2.184e+09 perf-stat.cache-misses
0.38 -11% 0.34 perf-stat.ipc
34728336 -26% 25779325 perf-stat.iTLB-loads
0.61 ± 3% -31% 0.42 perf-stat.dTLB-load-miss-rate%
0.15 ± 5% -32% 0.10 perf-stat.branch-miss-rate%
22445130 -33% 14955093 ± 3% perf-stat.iTLB-load-misses
4209598 -76% 1007929 perf-stat.context-switches
perf-stat.context-switches
4.5e+06 ++---*--*-----------------*--------*----**--------*-------*-------+
*.** * * *.*.* *.**.* *.* *.**.* *.**.* *.*. *.*
4e+06 ++ : : * |
3.5e+06 ++ : : |
| * .*. : |
3e+06 ++ * **.* |
| |
2.5e+06 ++ |
| |
2e+06 ++ |
1.5e+06 ++ |
O O O O O O O |
1e+06 ++ O O O OO O O O O O O OO O OO O OO O OO O OO OO O |
| |
500000 ++----------------------------------------------------------------+
perf-stat.branch-miss-rate_
0.18 ++-------------------------------------------------------------------+
| *. * |
0.17 ++ : * : : |
0.16 ++ : + *. : : * * |
| *. : * *. * *. .** : * *. + : :+|
0.15 ++* + *. * * .*. + *. .*. : * *. *. + *. .*. : * :: *
0.14 *+ * * * * * * * * * * * |
| |
0.13 ++ |
0.12 ++ |
| O O |
0.11 O+ OO O O O O |
0.1 ++ O O O O O O O O O O OO O O O |
| O O O O OO OO OO |
0.09 ++-O-----------------------------------------------------------------+
fsmark.time.voluntary_context_switches
2.2e+06 ++---*--*-----------------*--------*----**--------*-------*-------+
*.** * * *.*.* *.**.* *.* *.**.* *.**.* *.*. *.*
2e+06 ++ : : * |
1.8e+06 ++ : : |
| : .* : |
1.6e+06 ++ ** + *.: |
1.4e+06 ++ * * |
| |
1.2e+06 ++ |
1e+06 ++ |
| |
800000 ++ |
600000 ++ |
O O O OO O O O OO OO O OO O OO O OO O OO O OO O OO O |
400000 ++-O--------O----------------------------------------O--O---------+
[*] bisect-good sample
[O] bisect-bad sample
Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.
Thanks,
Ying Huang