Greeting,
FYI, we noticed the following commit (built with gcc-9):
commit: 9cbaaa9a2298e5bc367379c47a561570c47a01eb ("block: cleanup the bd_holder
locking")
git://git.infradead.org/users/hch/block.git part_tbl_mutex
in testcase: blktests
version: blktests-x86_64-f51ee53-1_20220128
with following parameters:
disk: 1SSD
test: block-group-00
ucode: 0xe2
on test machine: 4 threads Intel(R) Core(TM) i5-6500 CPU @ 3.20GHz with 32G memory
caused below changes (please refer to attached dmesg/kmsg for entire log/backtrace):
If you fix the issue, kindly add following tag
Reported-by: kernel test robot <oliver.sang(a)intel.com>
[ 57.337346][ T1724] WARNING: CPU: 2 PID: 1724 at block/disk-events.c:502
disk_release_events (block/disk-events.c:502)
[ 57.346614][ T1724] Modules linked in: sr_mod cdrom scsi_debug loop dm_multipath dm_mod
btrfs blake2b_generic xor raid6_pq zstd_compress libcrc32c sd_mod t10_pi sg intel_rapl_msr
ipmi_devintf ipmi_msghandler intel_rapl_common x86_pkg_temp_thermal intel_powerclamp
coretemp i915 intel_gtt kvm_intel ttm ahci drm_kms_helper libahci syscopyarea kvm
irqbypass sysfillrect wmi_bmof mei_wdt crct10dif_pclmul sysimgblt crc32_pclmul
crc32c_intel fb_sys_fops ghash_clmulni_intel rapl libata intel_cstate mei_me drm
intel_uncore mei intel_pch_thermal wmi video acpi_pad intel_pmc_core ip_tables [last
unloaded: scsi_debug]
[ 57.399995][ T1724] CPU: 2 PID: 1724 Comm: check Not tainted 5.16.0-11202-g9cbaaa9a2298
#1
[ 57.408239][ T1724] Hardware name: Dell Inc. OptiPlex 7040/0Y7WYT, BIOS 1.8.1
12/05/2017
[ 57.416287][ T1724] RIP: 0010:disk_release_events (block/disk-events.c:502)
[ 57.421873][ T1724] Code: ed 74 29 48 b8 00 00 00 00 00 fc ff df 48 8d 7d 40 48 89 fa 48
c1 ea 03 0f b6 04 02 84 c0 74 04 3c 03 7e 12 83 7d 40 01 74 02 <0f> 0b 48 89 ef 5b
5d e9 99 a6 83 ff e8 94 54 84 ff eb e7 e8 cd 54
All code
========
0: ed in (%dx),%eax
1: 74 29 je 0x2c
3: 48 b8 00 00 00 00 00 movabs $0xdffffc0000000000,%rax
a: fc ff df
d: 48 8d 7d 40 lea 0x40(%rbp),%rdi
11: 48 89 fa mov %rdi,%rdx
14: 48 c1 ea 03 shr $0x3,%rdx
18: 0f b6 04 02 movzbl (%rdx,%rax,1),%eax
1c: 84 c0 test %al,%al
1e: 74 04 je 0x24
20: 3c 03 cmp $0x3,%al
22: 7e 12 jle 0x36
24: 83 7d 40 01 cmpl $0x1,0x40(%rbp)
28: 74 02 je 0x2c
2a:* 0f 0b ud2 <-- trapping instruction
2c: 48 89 ef mov %rbp,%rdi
2f: 5b pop %rbx
30: 5d pop %rbp
31: e9 99 a6 83 ff jmpq 0xffffffffff83a6cf
36: e8 94 54 84 ff callq 0xffffffffff8454cf
3b: eb e7 jmp 0x24
3d: e8 .byte 0xe8
3e: cd 54 int $0x54
Code starting with the faulting instruction
===========================================
0: 0f 0b ud2
2: 48 89 ef mov %rbp,%rdi
5: 5b pop %rbx
6: 5d pop %rbp
7: e9 99 a6 83 ff jmpq 0xffffffffff83a6a5
c: e8 94 54 84 ff callq 0xffffffffff8454a5
11: eb e7 jmp 0xfffffffffffffffa
13: e8 .byte 0xe8
14: cd 54 int $0x54
[ 57.441235][ T1724] RSP: 0018:ffffc90000fcfa50 EFLAGS: 00010206
[ 57.447142][ T1724] RAX: 0000000000000000 RBX: ffff8881a6046800 RCX: ffffffff8120d4bb
[ 57.454929][ T1724] RDX: 1ffff11100d59148 RSI: 0000000000000008 RDI: ffff888806ac8a40
[ 57.462755][ T1724] RBP: ffff888806ac8a00 R08: 0000000000000000 R09: ffffed1049cf8889
[ 57.470543][ T1724] R10: ffff88824e7c4447 R11: ffffed1049cf8888 R12: ffff8881a6046850
[ 57.478344][ T1724] R13: ffff8881031e03e8 R14: 0000000000000000 R15: ffffffff84e79040
[ 57.486132][ T1724] FS: 00007fda67af3740(0000) GS:ffff888783500000(0000)
knlGS:0000000000000000
[ 57.494884][ T1724] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 57.501306][ T1724] CR2: 0000563dcd95bfb4 CR3: 0000000813606005 CR4: 00000000003706e0
[ 57.509105][ T1724] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 57.516907][ T1724] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[ 57.524738][ T1724] Call Trace:
[ 57.527896][ T1724] <TASK>
[ 57.530692][ T1724] disk_release (block/genhd.c:1113 (discriminator 3))
[ 57.534975][ T1724] device_release (drivers/base/core.c:2234)
[ 57.539443][ T1724] kobject_release (lib/kobject.c:709 lib/kobject.c:736)
[ 57.544070][ T1724] sr_kref_release (drivers/scsi/sr.c:1040) sr_mod
[ 57.549474][ T1724] sr_remove (include/linux/kref.h:66 drivers/scsi/sr.c:1053) sr_mod
[ 57.554272][ T1724] device_release_driver_internal (drivers/base/dd.c:1206
drivers/base/dd.c:1237)
[ 57.560178][ T1724] bus_remove_device (drivers/base/bus.c:530)
[ 57.564962][ T1724] device_del (drivers/base/core.c:3593)
[ 57.569156][ T1724] ? __device_link_del (drivers/base/core.c:3548)
[ 57.574109][ T1724] ? kobject_put (arch/x86/include/asm/atomic.h:190
include/linux/atomic/atomic-instrumented.h:177 include/linux/refcount.h:272
include/linux/refcount.h:315 include/linux/refcount.h:333 include/linux/kref.h:64
lib/kobject.c:753)
[ 57.578405][ T1724] ? sysfs_kf_bin_read (fs/sysfs/file.c:129)
[ 57.583361][ T1724] __scsi_remove_device (drivers/scsi/scsi_sysfs.c:1450)
[ 57.588405][ T1724] scsi_remove_device (drivers/scsi/scsi_sysfs.c:1490)
[ 57.593103][ T1724] sdev_store_delete (drivers/scsi/scsi_sysfs.c:787)
[ 57.597717][ T1724] kernfs_fop_write_iter (fs/kernfs/file.c:300)
[ 57.602845][ T1724] new_sync_write (fs/read_write.c:504 (discriminator 1))
[ 57.607360][ T1724] ? _raw_write_lock_irq (kernel/locking/spinlock.c:153)
[ 57.612500][ T1724] ? new_sync_read (fs/read_write.c:493)
[ 57.617117][ T1724] vfs_write (fs/read_write.c:590)
[ 57.621219][ T1724] ksys_write (fs/read_write.c:643)
[ 57.625318][ T1724] ? __ia32_sys_read (fs/read_write.c:633)
[ 57.629937][ T1724] ? fput_many (fs/file_table.c:336)
[ 57.634125][ T1724] do_syscall_64 (arch/x86/entry/common.c:50 arch/x86/entry/common.c:80)
[ 57.638394][ T1724] entry_SYSCALL_64_after_hwframe (arch/x86/entry/entry_64.S:113)
[ 57.644130][ T1724] RIP: 0033:0x7fda67be0504
[ 57.648396][ T1724] Code: 00 f7 d8 64 89 02 48 c7 c0 ff ff ff ff eb b3 0f 1f 80 00 00 00
00 48 8d 05 f9 61 0d 00 8b 00 85 c0 75 13 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff
77 54 c3 0f 1f 00 41 54 49 89 d4 55 48 89 f5 53
All code
========
0: 00 f7 add %dh,%bh
2: d8 64 89 02 fsubs 0x2(%rcx,%rcx,4)
6: 48 c7 c0 ff ff ff ff mov $0xffffffffffffffff,%rax
d: eb b3 jmp 0xffffffffffffffc2
f: 0f 1f 80 00 00 00 00 nopl 0x0(%rax)
16: 48 8d 05 f9 61 0d 00 lea 0xd61f9(%rip),%rax # 0xd6216
1d: 8b 00 mov (%rax),%eax
1f: 85 c0 test %eax,%eax
21: 75 13 jne 0x36
23: b8 01 00 00 00 mov $0x1,%eax
28: 0f 05 syscall
2a:* 48 3d 00 f0 ff ff cmp $0xfffffffffffff000,%rax <-- trapping instruction
30: 77 54 ja 0x86
32: c3 retq
33: 0f 1f 00 nopl (%rax)
36: 41 54 push %r12
38: 49 89 d4 mov %rdx,%r12
3b: 55 push %rbp
3c: 48 89 f5 mov %rsi,%rbp
3f: 53 push %rbx
Code starting with the faulting instruction
===========================================
0: 48 3d 00 f0 ff ff cmp $0xfffffffffffff000,%rax
6: 77 54 ja 0x5c
8: c3 retq
9: 0f 1f 00 nopl (%rax)
c: 41 54 push %r12
e: 49 89 d4 mov %rdx,%r12
11: 55 push %rbp
12: 48 89 f5 mov %rsi,%rbp
15: 53 push %rbx
To reproduce:
git clone
https://github.com/intel/lkp-tests.git
cd lkp-tests
sudo bin/lkp install job.yaml # job file is attached in this email
bin/lkp split-job --compatible job.yaml # generate the yaml file for lkp run
sudo bin/lkp run generated-yaml-file
# if come across any failure that blocks the test,
# please remove ~/.lkp and /lkp dir to run from a clean state.
---
0DAY/LKP+ Test Infrastructure Open Source Technology Center
https://lists.01.org/hyperkitty/list/lkp@lists.01.org Intel Corporation
Thanks,
Oliver Sang