Some analysts online have shown debugging data from crash dumps and minimal reverse engineering. By their account it's a null reference to a pointer in a system driver. That's something unit testing should have easily caught ... if used.
So here is what we know.
- Trivial error in the software, running as a system driver.
- Insufficient testing.
- Insufficient control over large scale rollouts.
- Not previously sharing release notes with customers.
- Not previously allowing customers to control timing of rollouts.
- Not previously allowing customers to use automated staged rollouts.
As someone working with governance in Enterprise IT, I am astonished they got this big without their customers challenging these things.
It's truly a WTF moment for the industry.