Re: Possible reasons for load average high and CPU 90% idle on RHEL 7.6

From: Tanel Poder <tanel_at_tanelpoder.com>
Date: Tue, 24 Aug 2021 18:43:38 -0400
Message-ID: <CAMHX9JLxMcP47dM5SqEcka3nQKaTvj+u4gwBn0XEHAKT2Du9Nw_at_mail.gmail.com>



It can also be Linux kernel bugs combined with asynchronous IO (or other reasons for spawning lots of kworker processes) that end up stuck for too long at some stage.

I wrote how to drill down into these problems on Linux here:

https://tanelpoder.com/posts/high-system-load-low-cpu-utilization-on-linux/

Tanel.

On Mon, Aug 23, 2021 at 7:29 AM Goti <aryan.goti_at_gmail.com> wrote:

> HI Listers,
>
> We had a situation where the load average spiked to more than 2000+ and
> continued the saw way for about 45 minutes. During this entire time we had
> CPU 95% idle. One thing that was observed was that we had a sudden increase
> in the number of connections from 1800 to around 3700. We had 90Gb of free
> memory throughout when the load average spiked.
>
> Server has 48 CPU with 512 Gb of RAM running on RHEL 7.6.
>
> Is this something expected?
>
>
> Thanks,
> Goti
>

--
http://www.freelists.org/webpage/oracle-l
Received on Wed Aug 25 2021 - 00:43:38 CEST

Original text of this message