Resolved: Kerberos Server Outage

One of the three Kerberos servers, fido (128.118.2.6), crashed with a hardware failure at 4:56. The system was restarted and was functioning as of 6:56. The cause of the hardware failure is a bad memory DIMM and is offline. During the hardware failure users may have had issues with authenticating to services. After the fido was restarted, data replication of password changes were not completely caught up until 7:14. So, users may have received a failed authentication, if they changed their password from 4:56-7:14.

More information: Resolved: Kerberos Server Outage

Story added 6. September 2013, content source with full text you can find at link above.