FYI, we noticed the following commit:
git://bee.sh.intel.com/git/aaron/linux.git lck1376
commit 7361bbc30e1cb9ddee70ff6895cf29256ab35418 ("lck1376: avoid global atomic for
huge_zero_page")
in testcase: ltp
with following parameters: test=mm
on test machine: nhm-white2: 8 threads Nehalem with 4G memory
caused below changes:
+------------------------------------------+------------+------------+
| | d325ea8594 | 7361bbc30e |
+------------------------------------------+------------+------------+
| boot_successes | 46 | 19 |
| boot_failures | 4 | 4 |
| invoked_oom-killer:gfp_mask=0x | 4 | |
| Mem-Info | 4 | |
| Out_of_memory:Kill_process | 4 | |
| backtrace:vfs_write | 3 | |
| backtrace:SyS_write | 3 | |
| kernel_BUG_at_mm/huge_memory.c | 0 | 4 |
| invalid_opcode:#[##]SMP | 0 | 4 |
| RIP:put_huge_zero_page | 0 | 4 |
| Kernel_panic-not_syncing:Fatal_exception | 0 | 4 |
| backtrace:do_group_exit | 0 | 4 |
| backtrace:SyS_exit_group | 0 | 4 |
+------------------------------------------+------------+------------+
[ 18.551742] Key type id_legacy registered
[ 19.981339] loop: module loaded
[ 24.980427] ------------[ cut here ]------------
[ 24.980806] kernel BUG at mm/huge_memory.c:257!
[ 24.981155] invalid opcode: 0000 [#1] SMP
[ 24.981478] Modules linked in: loop rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver
netconsole sg sr_mod sd_mod cdrom dcdbas snd_hda_codec_realtek snd_hda_codec_generic
coretemp kvm_intel kvm ata_generic pata_acpi irqbypass snd_hda_codec_hdmi crc32c_intel
ata_piix i7core_edac serio_raw pcspkr edac_core libata usb_storage snd_hda_intel
snd_hda_codec firewire_ohci snd_hda_core firewire_core crc_itu_t snd_hwdep snd_pcm
snd_timer snd soundcore shpchp acpi_cpufreq broadcom bcm_phy_lib
[ 24.985517] CPU: 6 PID: 850 Comm: mmstress Not tainted 4.7.0-rc3-00056-g7361bbc #1
[ 24.985984] Hardware name: Dell Inc. Studio XPS 8000/0X231R, BIOS A01 08/11/2009
[ 24.986447] task: ffff880029b12480 ti: ffff88008cde8000 task.ti: ffff88008cde8000
[ 24.986908] RIP: 0010:[<ffffffff811e61d4>] [<ffffffff811e61d4>]
put_huge_zero_page+0x14/0x20
[ 24.987475] RSP: 0018:ffff88008cdebe70 EFLAGS: 00010246
[ 24.987792] RAX: 0000000000000001 RBX: ffff88008d7326c0 RCX: 0000000180660064
[ 24.988179] RDX: 0000000180660065 RSI: ffffea00049f72c0 RDI: 0000000040000000
[ 24.988646] RBP: ffff88008cdebe70 R08: 0000000027dcb501 R09: 0000000180660064
[ 24.989041] R10: ffff88013fd9c000 R11: ffff880127dcb550 R12: ffff880029b12fb0
[ 24.989430] R13: ffff88008d7326c0 R14: 0000000000000001 R15: ffff88008d732730
[ 24.989828] FS: 0000000000000000(0000) GS:ffff88013fd80000(0000)
knlGS:0000000000000000
[ 24.990321] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 24.990664] CR2: 0000000000610268 CR3: 0000000029a46000 CR4: 00000000000006e0
[ 24.991063] Stack:
[ 24.991233] ffff88008cdebe90 ffffffff8107aa8e ffff880029b12480 ffff880029b12fb0
[ 24.991944] ffff88008cdebf08 ffffffff81081685 ffff880029b12fb4 000000008cdebf58
[ 24.992571] ffff88008cdebf18 ffffffff8106612c ffff88008bff67e8 0000000000000008
[ 24.993197] Call Trace:
[ 24.993447] [<ffffffff8107aa8e>] mmput+0x11e/0x130
[ 24.993761] [<ffffffff81081685>] do_exit+0x335/0xb70
[ 24.994070] [<ffffffff8106612c>] ? __do_page_fault+0x21c/0x4d0
[ 24.994460] [<ffffffff81081f45>] do_group_exit+0x45/0xb0
[ 24.994792] [<ffffffff81081fc4>] SyS_exit_group+0x14/0x20
[ 24.995122] [<ffffffff81917372>] entry_SYSCALL_64_fastpath+0x1a/0xa4
[ 24.995484] Code: f0 49 ff 44 24 50 b8 01 00 00 00 5b 41 5c 41 5d 41 5e 5d c3 0f 1f 00
66 66 66 66 90 55 48 89 e5 f0 ff 0d 70 1b 1a 01 74 02 5d c3 <0f> 0b 66 2e 0f 1f 84
00 00 00 00 00 66 66 66 66 90 8b 47 58 85
[ 24.998924] RIP [<ffffffff811e61d4>] put_huge_zero_page+0x14/0x20
[ 24.999317] RSP <ffff88008cdebe70>
[ 24.999586] ---[ end trace 66e4ec04d53c3feb ]---
[ 24.999858] Kernel panic - not syncing: Fatal exception
To reproduce:
git clone
git://git.kernel.org/pub/scm/linux/kernel/git/wfg/lkp-tests.git
cd lkp-tests
bin/lkp install job.yaml # job file is attached in this email
bin/lkp run job.yaml
Thanks,
Xiaolong