Kernel Panic
This shouldn’t come as a surprise, but I assumed that all my problems would be solved and forgot to enable reboot-on-kernel-panic. I should have been more cynical and expect the worse. Three days within my two week long holiday it suddenly become unresponsive. I hoped it was just temporary, but after the second day I shifted toward power failure with a bug in one of my scripts, but steeled myself that it could be the dreaded kernel panic. Due to illness within my family we decided to abort our vacation and traveled home earlier than expected. While at home, I instantly saw the horrific sight of a kernel backtrace spewed on the monitor. I swiftly reset the server, and after it chugging along for a couple of minutes due to invalid reboot, it was up and running normally, short of a memory lapse of 8 days.
I’ve since enabled reboot-on-kernel-panic, upgraded Proxmox to 8.0, and therefore upgraded kernel to 6.2, and will monitor it for an another month before I will fiddle with the BIOS once again to make sure there’s nothing I’ve missed. (c-states are a nasty bug)