So far my record is 3 weeks. It was a hiesenbug triggered when two different ebpf based systems raced with each other. Ebpf is a great tool in the right place but is it ever a pain in the ass to debug.
The fix ended up being one character -> change the priority of an ebpf tc filter from 0 to 1.