Re: SunFire Server Hangs

From: De DBA <dedba_at_tpg.com.au>
Date: Wed, 11 Mar 2015 09:55:45 +1000
Message-ID: <54FF8481.30507_at_tpg.com.au>



Hi Ian,

Could it be that the OS is actually down, but the OpenBoot console is still running? I'm not sure if that would answer to the ping to the OS, but worth a shot..

Cheers,
Tony

On 11/03/15 02:52, MacGregor, Ian A. wrote:
> Some more information:
>
> The machines are of two types SunFire X4250 and X4270. /etc/release on one machine is
>
> Solaris 10 10/09 s10x_u8wos_08a X86
> Copyright 2009 Sun Microsystems, Inc. All Rights Reserved.
> Use is subject to license terms.
> Assembled 16 September 2009
>
> and on the other two
>
> cat /etc/release
> Solaris 10 5/09 s10x_u7wos_08 X86
> Copyright 2009 Sun Microsystems, Inc. All Rights Reserved.
> Use is subject to license terms.
> Assembled 30 March 2009
>
> We do apply Solaris patching quarterly.
>
> The machine definitely tried to halt. After bringing the machine backup “last reboot” shows the system down time at the time of the "freeze. The machine continued to ping after that time. The response is the server itself not from any firewall. So despite the reported system down time at least part of the OS was up
>
> Theere is nothing in /var/log/messages not any file which might be used as a utility to monitor the system all I/O ceased at the time of the freeze. There were no resources which were in any danger of exhaustion as far as we can tell before the freeze.
> But that could be because it happened too quickly to be sampled. I don’t think resource exhaustion is at all likely here.
>
> There is a single raid controller. I thought the system disks might not under this controller, but it turns out they are.
> So the complete loss of the I/O system is looking more likely.
>
> I did find we were a bit back level on the BIOS.
>

... <snip>

--
http://www.freelists.org/webpage/oracle-l
Received on Wed Mar 11 2015 - 00:55:45 CET

Original text of this message