server crashes

hi Im going to replace a server with a new one...
In the logs I find
Nov 22 02:20:54 sv01 kernel: Uhhuh. NMI received for unknown reason 31 on CPU 0.
Nov 22 02:20:54 sv01 kernel: Do you have a strange power saving mode enabled?
Nov 22 02:20:54 sv01 kernel: Dazed and confused, but trying to continue
Oct 8 04:17:03 sv01 syslogd 1.4.1: restart.

As you see, there is a false date Oct. 8

NMI seems to be watchdog. But the error 31 may be a faulty memory
The end of the log says
Oct 8 04:19:47 sv01 kernel: usb 1-1: USB disconnect, address 2
Then a reboot
Nov 23 14:11:03 sv01 named[1973]: *** POKED TIMER ***

and the server is again running.

Any ideas to what to think seeing these logentries? I checked other logs but no relevant lines in there.

Anyway, Ill pick up the new server today, restore the domains and replace the hardware tomorrow or so.

thank you

Status: 
Active

Comments

ronald's picture
Submitted by ronald on Mon, 11/23/2009 - 10:20 Pro Licensee

In the meantime I have indentified a power supply issue. the server is using 2A. The spikes may have also corrupted the BIOS or motherboard battery.

the 8 oct. logentries are dated 1973 .. lol

Looks like some kind of clock issue. There isn't much we can do to debug hardware problems though ..