Server Crash - Kernel Panic - linux-image-5.17.8-eve-ng-uksm-wg+

Moderator: mike

Post Reply
sflynn1509
Posts: 1
Joined: Sat May 01, 2021 1:25 pm

Server Crash - Kernel Panic - linux-image-5.17.8-eve-ng-uksm-wg+

Post by sflynn1509 » Tue Jan 02, 2024 3:36 pm

I updated to the latest release of EVE-NG Pro v5.0.1-120 from v5.0.1-106 this weekend.
Now, when starting any VM, the server crashes with a kernel panic and I must power off the server.

Code: Select all

Jan  2 14:55:48 labs kernel: [  490.231961] ------------[ cut here ]------------
Jan  2 14:55:48 labs kernel: [  490.231966] WARNING: CPU: 31 PID: 360 at arch/x86/kvm/../../../virt/kvm/kvm_main.c:649 kvm_mmu_notifier_change_pte+0x28a/0x2b0 [kvm]
Jan  2 14:55:48 labs kernel: [  490.232027] Modules linked in: cls_u32 sch_prio wireguard curve25519_x86_64 libcurve25519_generic libchacha20poly1305 chacha_x86_64 poly1305_x86_64 libchacha ip6_udp_tunnel udp_tunnel ebt_802_3 nft_compat nft_meta_bridge nf_tables xt_conntrack xt_MASQUERADE nf_conntrack_netlink xfrm_user xfrm_algo xt_addrtype iptable_filter iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 bpfilter br_netfilter cfg80211 nfnetlink overlay dummy bridge stp llc dm_multipath scsi_dh_rdac scsi_dh_emc scsi_dh_alua binfmt_misc ipmi_ssif intel_rapl_msr intel_rapl_common sb_edac x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm joydev input_leds mei_me rapl ipmi_si mei intel_cstate ipmi_devintf ipmi_msghandler acpi_power_meter mac_hid sch_fq_codel ramoops efi_pstore msr reed_solomon ip_tables x_tables autofs4 btrfs blake2b_generic zstd_compress raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear hid_generic usbhid
Jan  2 14:55:48 labs kernel: [  490.232117]  hid mgag200 drm_shmem_helper drm_kms_helper syscopyarea sysfillrect sysimgblt crct10dif_pclmul igb fb_sys_fops crc32_pclmul ghash_clmulni_intel aesni_intel cec ahci dca crypto_simd cryptd drm megaraid_sas libahci lpc_ich i2c_algo_bit wmi
Jan  2 14:55:48 labs kernel: [  490.232136] CPU: 31 PID: 360 Comm: uksmd Not tainted 5.17.8-eve-ng-uksm-wg+ #1
Jan  2 14:55:48 labs kernel: [  490.232139] Hardware name: Dell Inc. PowerEdge T620/0F5XM3, BIOS 2.9.0 12/06/2019
Jan  2 14:55:48 labs kernel: [  490.232140] RIP: 0010:kvm_mmu_notifier_change_pte+0x28a/0x2b0 [kvm]
Jan  2 14:55:48 labs kernel: [  490.232175] Code: ff 4c 89 ef e8 67 15 c5 ef e9 5b ff ff ff 4c 89 ef 44 88 4d a0 e8 36 fd ff ff 44 0f b6 4d a0 45 84 c9 0f 84 41 ff ff ff eb d7 <0f> 0b e9 b8 fd ff ff 0f 0b e9 4d ff ff ff 0f 0b e9 37 ff ff ff e8
Jan  2 14:55:48 labs kernel: [  490.232177] RSP: 0018:ffffacb9ced67bb0 EFLAGS: 00010246
Jan  2 14:55:48 labs kernel: [  490.232179] RAX: 0000000000000000 RBX: ffffacb9e0966c18 RCX: 800000314118f805
Jan  2 14:55:48 labs kernel: [  490.232181] RDX: 00007f9c8a38f000 RSI: ffff8a37fa4d8cc0 RDI: ffffacb9e0966c18
Jan  2 14:55:48 labs kernel: [  490.232182] RBP: ffffacb9ced67c38 R08: 0000000000000000 R09: 0000000000000000
Jan  2 14:55:48 labs kernel: [  490.232183] R10: 0000000000000000 R11: 0000000000000000 R12: ffff8a37fa4d8cc0
Jan  2 14:55:48 labs kernel: [  490.232184] R13: 800000314118f805 R14: 00007f9c8a38f000 R15: 00007f9c8a38f000
Jan  2 14:55:48 labs kernel: [  490.232185] FS:  0000000000000000(0000) GS:ffff8a672fbc0000(0000) knlGS:0000000000000000
Jan  2 14:55:48 labs kernel: [  490.232187] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jan  2 14:55:48 labs kernel: [  490.232188] CR2: 000000c0004f1000 CR3: 0000004822e0a002 CR4: 00000000001726e0
Jan  2 14:55:48 labs kernel: [  490.232190] Call Trace:
Jan  2 14:55:48 labs kernel: [  490.232192]  <TASK>
Jan  2 14:55:48 labs kernel: [  490.232195]  ? kvm_arch_mmu_notifier_invalidate_range+0x21/0x50 [kvm]
Jan  2 14:55:48 labs kernel: [  490.232236]  __mmu_notifier_change_pte+0x55/0x90
Jan  2 14:55:48 labs kernel: [  490.232241]  restore_uksm_page_pte+0x23e/0x250
Jan  2 14:55:48 labs kernel: [  490.232244]  cmp_and_merge_page+0x89c/0x2580
Jan  2 14:55:48 labs kernel: [  490.232248]  scan_vma_one_page+0x500/0x1720
Jan  2 14:55:48 labs kernel: [  490.232251]  uksm_do_scan+0x15a/0x3130
Jan  2 14:55:48 labs kernel: [  490.232253]  ? del_timer_sync+0x29/0x40
Jan  2 14:55:48 labs kernel: [  490.232258]  ? schedule_timeout+0x19c/0x290
Jan  2 14:55:48 labs kernel: [  490.232263]  ? uksm_do_scan+0x3130/0x3130
Jan  2 14:55:48 labs kernel: [  490.232265]  uksm_scan_thread+0x164/0x1a0
Jan  2 14:55:48 labs kernel: [  490.232268]  ? _raw_spin_lock_irqsave+0x2a/0x60
Jan  2 14:55:48 labs kernel: [  490.232270]  ? _raw_spin_unlock_irqrestore+0x29/0x3d
Jan  2 14:55:48 labs kernel: [  490.232273]  kthread+0xfd/0x130
Jan  2 14:55:48 labs kernel: [  490.232278]  ? kthread_complete_and_exit+0x20/0x20
Jan  2 14:55:48 labs kernel: [  490.232280]  ret_from_fork+0x1f/0x30
Jan  2 14:55:48 labs kernel: [  490.232286]  </TASK>
Jan  2 14:55:48 labs kernel: [  490.232287] ---[ end trace 0000000000000000 ]---
Hardware:
Dell PowerEdge T620 Server
  • Dual E5-2658 v2 CPUs
  • 384G DDR3 RAM
  • 6 x 1TB SSD (RAID6)

Any suggestions on how to fix this issue?
How do I go about rolling back the kernel or upgrade process ?

Uldis (UD)
Posts: 5086
Joined: Wed Mar 15, 2017 4:44 pm
Location: London
Contact:

Re: Server Crash - Kernel Panic - linux-image-5.17.8-eve-ng-uksm-wg+

Post by Uldis (UD) » Tue Jan 02, 2024 7:24 pm

System HDD is screwed, it is not kernel issue
revert back is impossible form such stage

Post Reply