Greeting,
FYI, we noticed the following commit (built with gcc-9):
commit: b4878255cd61bc62bf239c02d497af3593ee1028 ("mm/page_alloc: Add remote draining
support to per-cpu lists")
https://git.kernel.org/cgit/linux/kernel/git/nsaenz/linux-rpi.git pcplists-rfc-v1
in testcase: kernel-selftests
version: kernel-selftests-x86_64-c8c9111a-1_20210929
with following parameters:
group: vm
ucode: 0xe2
test-description: The kernel contains a set of "self tests" under the
tools/testing/selftests/ directory. These are intended to be small unit tests to exercise
individual code paths in the kernel.
test-url:
https://www.kernel.org/doc/Documentation/kselftest.txt
on test machine: 4 threads Intel(R) Core(TM) i5-6500 CPU @ 3.20GHz with 32G memory
caused below changes (please refer to attached dmesg/kmsg for entire log/backtrace):
If you fix the issue, kindly add following tag
Reported-by: kernel test robot <oliver.sang(a)intel.com>
[ 77.386350][ T383] # ------------------------------------
[ 77.386353][ T383]
[ 77.396938][ T383] # running vmalloc stability smoke test
[ 77.396941][ T383]
[ 77.407259][ T383] # ------------------------------------
[ 77.407262][ T383]
[ 77.418257][ T383] # Run smoke test. Note, this test provides basic coverage.
[ 77.418261][ T383]
[ 77.430982][ T383] # Please check ./test_vmalloc.sh output how it can be used
[ 77.430985][ T383]
[ 77.443607][ T383] # for deep performance analysis as well as stress testing.
[ 77.443611][ T383]
[ 89.243110][ T9412] vmap allocation for size 8192 failed: use vmalloc=<size> to
increase size
[ 89.253083][ T9412] vmalloc_test/0: vmalloc error: size 4096, vm_struct allocation
failed, mode:0xdc0(GFP_KERNEL|__GFP_ZERO), nodemask=(null),cpuset=/,mems_allowed=0
[ 89.270153][ T9412] CPU: 0 PID: 9412 Comm: vmalloc_test/0 Tainted: G I
5.15.0-rc4-00018-gb4878255cd61 #1
[ 89.282017][ T9412] Hardware name: Dell Inc. OptiPlex 7040/0Y7WYT, BIOS 1.1.1
10/07/2015
[ 89.291314][ T9412] Call Trace:
[ 89.295591][ T9412] dump_stack_lvl+0x45/0x59
[ 89.301109][ T9412] warn_alloc.cold+0x7b/0xdf
[ 89.306697][ T9412] ? __get_vm_area_node+0x150/0x180
[ 89.313920][ T9412] __vmalloc_node_range+0xc4/0x100
[ 89.320123][ T9412] __vmalloc_node+0x44/0x80
[ 89.325637][ T9412] ? test_func+0x110/0x2c0 [test_vmalloc]
[ 89.332510][ T9412] align_shift_alloc_test+0x36/0x80 [test_vmalloc]
[ 89.339949][ T9412] test_func+0x110/0x2c0 [test_vmalloc]
[ 89.346586][ T9412] ? fix_align_alloc_test+0x80/0x80 [test_vmalloc]
[ 89.354078][ T9412] kthread+0x14f/0x180
[ 89.359216][ T9412] ? set_kthread_struct+0x40/0x40
[ 89.365207][ T9412] ret_from_fork+0x22/0x30
[ 89.370663][ T9412] Mem-Info:
[ 89.375000][ T9412] active_anon:102 inactive_anon:54277 isolated_anon:0
[ 89.375000][ T9412] active_file:0 inactive_file:32 isolated_file:0
[ 89.375000][ T9412] unevictable:749441 dirty:0 writeback:0
[ 89.375000][ T9412] slab_reclaimable:32933 slab_unreclaimable:11391
[ 89.375000][ T9412] mapped:7221 shmem:2404 pagetables:645 bounce:0
[ 89.375000][ T9412] kernel_misc_reclaimable:0
[ 89.375000][ T9412] free:7303325 free_pcp:14047 free_cma:0
[ 89.457761][ T9412] Node 0
[ 89.457763][ T9412] BUG: using smp_processor_id() in preemptible [00000000] code:
vmalloc_test/0/9412
[ 89.472027][ T9412] caller is show_free_areas.cold+0x785/0x977
[ 89.478974][ T9412] CPU: 1 PID: 9412 Comm: vmalloc_test/0 Tainted: G I
5.15.0-rc4-00018-gb4878255cd61 #1
[ 89.490879][ T9412] Hardware name: Dell Inc. OptiPlex 7040/0Y7WYT, BIOS 1.1.1
10/07/2015
[ 89.500081][ T9412] Call Trace:
[ 89.504316][ T9412] dump_stack_lvl+0x45/0x59
[ 89.509759][ T9412] check_preemption_disabled+0xd9/0x100
[ 89.516300][ T9412] show_free_areas.cold+0x785/0x977
[ 89.522449][ T9412] ? vprintk_emit+0x100/0x300
[ 89.528071][ T9412] ? _printk+0x58/0x6f
[ 89.533068][ T9412] show_mem+0x29/0xd9
[ 89.537963][ T9412] warn_alloc.cold+0xb7/0xdf
[ 89.543474][ T9412] ? __get_vm_area_node+0x150/0x180
[ 89.550627][ T9412] __vmalloc_node_range+0xc4/0x100
[ 89.556634][ T9412] __vmalloc_node+0x44/0x80
[ 89.562013][ T9412] ? test_func+0x110/0x2c0 [test_vmalloc]
[ 89.568609][ T9412] align_shift_alloc_test+0x36/0x80 [test_vmalloc]
[ 89.575993][ T9412] test_func+0x110/0x2c0 [test_vmalloc]
[ 89.582406][ T9412] ? fix_align_alloc_test+0x80/0x80 [test_vmalloc]
[ 89.589773][ T9412] kthread+0x14f/0x180
[ 89.594727][ T9412] ? set_kthread_struct+0x40/0x40
[ 89.600641][ T9412] ret_from_fork+0x22/0x30
[ 89.605957][ T9412] DMA free:15360kB min:32kB low:44kB high:56kB
reserved_highatomic:0KB active_anon:0kB inactive_anon:0kB active_file:0kB
inactive_file:0kB unevictable:0kB writepending:0kB present:15980kB managed:15360kB
mlocked:0kB bounce:0kB free_pcp:0kB local_pcp:0kB free_cma:0kB
[ 89.633503][ T9412] lowmem_reserve[]: 0 2071 31126 31126 31126
[ 89.640520][ T9412] Node 0
[ 89.640522][ T9412] BUG: using smp_processor_id() in preemptible [00000000] code:
vmalloc_test/0/9412
[ 89.654811][ T9412] caller is show_free_areas.cold+0x785/0x977
[ 89.661782][ T9412] CPU: 1 PID: 9412 Comm: vmalloc_test/0 Tainted: G I
5.15.0-rc4-00018-gb4878255cd61 #1
[ 89.673949][ T9412] Hardware name: Dell Inc. OptiPlex 7040/0Y7WYT, BIOS 1.1.1
10/07/2015
[ 89.683360][ T9412] Call Trace:
[ 89.687700][ T9412] dump_stack_lvl+0x45/0x59
[ 89.693309][ T9412] check_preemption_disabled+0xd9/0x100
[ 89.700034][ T9412] show_free_areas.cold+0x785/0x977
[ 89.706314][ T9412] ? vprintk_emit+0x100/0x300
[ 89.712126][ T9412] ? _printk+0x58/0x6f
[ 89.717381][ T9412] show_mem+0x29/0xd9
[ 89.722366][ T9412] warn_alloc.cold+0xb7/0xdf
[ 89.727949][ T9412] ? __get_vm_area_node+0x150/0x180
[ 89.735254][ T9412] __vmalloc_node_range+0xc4/0x100
[ 89.741378][ T9412] __vmalloc_node+0x44/0x80
[ 89.746902][ T9412] ? test_func+0x110/0x2c0 [test_vmalloc]
[ 89.753531][ T9412] align_shift_alloc_test+0x36/0x80 [test_vmalloc]
[ 89.761040][ T9412] test_func+0x110/0x2c0 [test_vmalloc]
[ 89.767493][ T9412] ? fix_align_alloc_test+0x80/0x80 [test_vmalloc]
[ 89.774990][ T9412] kthread+0x14f/0x180
[ 89.780109][ T9412] ? set_kthread_struct+0x40/0x40
[ 89.786153][ T9412] ret_from_fork+0x22/0x30
[ 89.791599][ T9412] DMA32 free:2120628kB min:4492kB low:6612kB high:8732kB
reserved_highatomic:0KB active_anon:0kB inactive_anon:0kB active_file:0kB
inactive_file:0kB unevictable:0kB writepending:0kB present:2186416kB managed:2120880kB
mlocked:0kB bounce:0kB free_pcp:248kB local_pcp:0kB free_cma:0kB
[ 89.820802][ T9412] lowmem_reserve[]: 0 0 29055 29055 29055
Please be noted that we reported this due to we didn't observe similar to below
on parent:
[ 89.640522][ T9412] BUG: using smp_processor_id() in preemptible [00000000] code:
vmalloc_test/0/9412
96e77aa0817824bb b4878255cd61bc62bf239c02d49
---------------- ---------------------------
fail:runs %reproduction fail:runs
| | |
:6 83% 5:6
dmesg.BUG:using_smp_processor_id()in_preemptible <-----
6:6 -100% :6
dmesg.Kernel_panic-not_syncing:Fatal_exception
6:6 -17% 5:6 dmesg.Mem-Info
6:6 -100% :6 dmesg.RIP:show_free_areas.cold
6:6 -100% :6 dmesg.canonical_address#:#[##]
however, it seems the parent is even worse than fbc, that it will directly
crash as below (also attached dmesg-parent.xz), FYI:
[ 77.717079][ T374] # ------------------------------------
[ 77.717083][ T374]
[ 77.727872][ T374] # running vmalloc stability smoke test
[ 77.727875][ T374]
[ 77.738707][ T374] # ------------------------------------
[ 77.738710][ T374]
[ 77.750038][ T374] # Run smoke test. Note, this test provides basic coverage.
[ 77.750056][ T374]
[ 77.763061][ T374] # Please check ./test_vmalloc.sh output how it can be used
[ 77.763065][ T374]
[ 77.776174][ T374] # for deep performance analysis as well as stress testing.
[ 77.776178][ T374]
[ 87.958239][ T9394] vmap allocation for size 8192 failed: use vmalloc=<size> to
increase size
[ 87.968300][ T9394] vmalloc_test/0: vmalloc error: size 4096, vm_struct allocation
failed, mode:0xdc0(GFP_KERNEL|__GFP_ZERO),
nodemask=(null),cpuset=/,mems_allowed=0
[ 87.985710][ T9394] CPU: 0 PID: 9394 Comm: vmalloc_test/0 Tainted: G I
5.15.0-rc4-00017-g96e77aa08178 #1
[ 87.997602][ T9394] Hardware name: Dell Inc. OptiPlex 7040/0Y7WYT, BIOS 1.1.1
10/07/2015
[ 88.006740][ T9394] Call Trace:
[ 88.010934][ T9394] dump_stack_lvl+0x45/0x59
[ 88.016326][ T9394] warn_alloc.cold+0x7b/0xdf
[ 88.021809][ T9394] ? __get_vm_area_node+0x150/0x180
[ 88.028995][ T9394] __vmalloc_node_range+0xc4/0x100
[ 88.035017][ T9394] __vmalloc_node+0x44/0x80
[ 88.040454][ T9394] ? test_func+0x110/0x2c0 [test_vmalloc]
[ 88.047121][ T9394] align_shift_alloc_test+0x36/0x80 [test_vmalloc]
[ 88.054584][ T9394] test_func+0x110/0x2c0 [test_vmalloc]
[ 88.061046][ T9394] ? fix_align_alloc_test+0x80/0x80 [test_vmalloc]
[ 88.068478][ T9394] kthread+0x14f/0x180
[ 88.073479][ T9394] ? set_kthread_struct+0x40/0x40
[ 88.079434][ T9394] ret_from_fork+0x22/0x30
[ 88.084813][ T9394] Mem-Info:
[ 88.088851][ T9394] active_anon:100 inactive_anon:54065 isolated_anon:0
[ 88.088851][ T9394] active_file:0 inactive_file:44 isolated_file:0
[ 88.088851][ T9394] unevictable:749383 dirty:12 writeback:0
[ 88.088851][ T9394] slab_reclaimable:32956 slab_unreclaimable:11286
[ 88.088851][ T9394] mapped:7202 shmem:2404 pagetables:630 bounce:0
[ 88.088851][ T9394] kernel_misc_reclaimable:0
[ 88.088851][ T9394] free:7300262 free_pcp:17806 free_cma:0
[ 88.137264][ T9394] Node 0 active_anon:400kB inactive_anon:216260kB active_file:0kB
inactive_file:176kB unevictable:2997532kB
isolated(anon):0kB isolated(file):0kB mapped:28808kB dirty:48kB writeback:0kB shmem:9616kB
shmem_thp: 0kB shmem_pmdmapped: 0kB an
on_thp: 92160kB writeback_tmp:0kB kernel_stack:2832kB pagetables:2520kB all_unreclaimable?
no
[ 88.171273][ T9394] Node 0
[ 88.171293][ T9394] general protection fault, probably for non-canonical address
0xffff11108d3d5060: 0000 [#1] PREEMPT SMP PTI
[ 88.187617][ T9394] CPU: 0 PID: 9394 Comm: vmalloc_test/0 Tainted: G I
5.15.0-rc4-00017-g96e77aa08178 #1
[ 88.199464][ T9394] Hardware name: Dell Inc. OptiPlex 7040/0Y7WYT, BIOS 1.1.1
10/07/2015
[ 88.208636][ T9394] RIP: 0010:show_free_areas.cold+0x75b/0x958
[ 88.215620][ T9394] Code: 0f 85 cb fc ff ff 8b 7d 50 e9 a2 fd ff ff e8 fd 12 07 00 41
8b 76 50 48 c7 c7 8b b4 a1 82 e8 9d 35 f
f ff 4d 8b be 10 06 00 00 <65> 49 8b 46 60 4d 8b ae 00 06 00 00 4d 8b a6 f8 05 00 00
48 89 44
[ 88.237297][ T9394] RSP: 0018:ffffc9000b867c40 EFLAGS: 00010246
[ 88.244326][ T9394] RAX: 0000000000000007 RBX: 0000000000000000 RCX: 0000000000000000
[ 88.253320][ T9394] RDX: 0000000000000000 RSI: ffffffff81258a57 RDI: ffffffff81258a57
[ 88.262575][ T9394] RBP: 0000000000000004 R08: 0000000000000003 R09: 0000000000000001
[ 88.271603][ T9394] R10: 0000000000000001 R11: 0000000000000000 R12: 000000000002f2e0
[ 88.280614][ T9394] R13: 0000000000000003 R14: ffff888871fd5000 R15: 0000000000000000
[ 88.289596][ T9394] FS: 0000000000000000(0000) GS:ffff88881b400000(0000)
knlGS:0000000000000000
[ 88.299478][ T9394] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 88.307029][ T9394] CR2: 0000555a546aada4 CR3: 0000000870026004 CR4: 00000000003706f0
[ 88.315948][ T9394] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 88.324862][ T9394] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[ 88.333811][ T9394] Call Trace:
[ 88.337997][ T9394] ? vprintk_emit+0x100/0x300
[ 88.343589][ T9394] ? _printk+0x58/0x6f
[ 88.348608][ T9394] show_mem+0x29/0xd9
[ 88.353486][ T9394] warn_alloc.cold+0xb7/0xdf
[ 88.358976][ T9394] ? __get_vm_area_node+0x150/0x180
[ 88.366122][ T9394] __vmalloc_node_range+0xc4/0x100
[ 88.372163][ T9394] __vmalloc_node+0x44/0x80
[ 88.377596][ T9394] ? test_func+0x110/0x2c0 [test_vmalloc]
[ 88.384227][ T9394] align_shift_alloc_test+0x36/0x80 [test_vmalloc]
[ 88.391660][ T9394] test_func+0x110/0x2c0 [test_vmalloc]
[ 88.398105][ T9394] ? fix_align_alloc_test+0x80/0x80 [test_vmalloc]
[ 88.405540][ T9394] kthread+0x14f/0x180
[ 88.410580][ T9394] ? set_kthread_struct+0x40/0x40
[ 88.416490][ T9394] ret_from_fork+0x22/0x30
[ 88.421808][ T9394] Modules linked in: test_vmalloc(+) btrfs blake2b_generic xor
zstd_compress raid6_pq libcrc32c ipmi_devintf
ipmi_msghandler sd_mod t10_pi sg intel_rapl_msr intel_rapl_common x86_pkg_temp_thermal
intel_powerclamp coretemp crct10dif_pclmu
l crc32_pclmul crc32c_intel ghash_clmulni_intel rapl intel_cstate i915 mei_wdt wmi_bmof
joydev ahci libahci intel_uncore intel_gt
t libata mei_me i2c_i801 mei intel_pch_thermal i2c_smbus ttm video wmi intel_pmc_core
acpi_pad ip_tables
[ 88.469135][ T9394] ---[ end trace e93dee9a3677a6e6 ]---
[ 88.475556][ T9394] RIP: 0010:show_free_areas.cold+0x75b/0x958
[ 88.482445][ T9394] Code: 0f 85 cb fc ff ff 8b 7d 50 e9 a2 fd ff ff e8 fd 12 07 00 41
8b 76 50 48 c7 c7 8b b4 a1 82 e8 9d 35 f
f ff 4d 8b be 10 06 00 00 <65> 49 8b 46 60 4d 8b ae 00 06 00 00 4d 8b a6 f8 05 00 00
48 89 44
[ 88.504166][ T9394] RSP: 0018:ffffc9000b867c40 EFLAGS: 00010246
[ 88.511208][ T9394] RAX: 0000000000000007 RBX: 0000000000000000 RCX: 0000000000000000
[ 88.520176][ T9394] RDX: 0000000000000000 RSI: ffffffff81258a57 RDI: ffffffff81258a57
[ 88.529164][ T9394] RBP: 0000000000000004 R08: 0000000000000003 R09: 0000000000000001
[ 88.538151][ T9394] R10: 0000000000000001 R11: 0000000000000000 R12: 000000000002f2e0
[ 88.547131][ T9394] R13: 0000000000000003 R14: ffff888871fd5000 R15: 0000000000000000
[ 88.556084][ T9394] FS: 0000000000000000(0000) GS:ffff88881b400000(0000)
knlGS:0000000000000000
[ 88.565996][ T9394] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 88.573636][ T9394] CR2: 0000555a546aada4 CR3: 0000000870026004 CR4: 00000000003706f0
[ 88.582633][ T9394] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 88.591664][ T9394] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[ 88.600643][ T9394] Kernel panic - not syncing: Fatal exception
[ 88.607720][ T9394] Kernel Offset: disabled
To reproduce:
git clone
https://github.com/intel/lkp-tests.git
cd lkp-tests
sudo bin/lkp install job.yaml # job file is attached in this email
bin/lkp split-job --compatible job.yaml # generate the yaml file for lkp run
sudo bin/lkp run generated-yaml-file
# if come across any failure that blocks the test,
# please remove ~/.lkp and /lkp dir to run from a clean state.
---
0DAY/LKP+ Test Infrastructure Open Source Technology Center
https://lists.01.org/hyperkitty/list/lkp@lists.01.org Intel Corporation
Thanks,
Oliver Sang