Re: Large discrepancy between 'log file parallel write' and 'db file parallel write' times

From: John Clarke <john.clarke_at_centroid.com>
Date: Mon, 31 Dec 2012 02:01:28 +0000
Message-ID: <5EE1E2A57A88A740B5E3B5F945EFCDF40B2334E8_at_ORD2MBX01F.mex05.mlsrvr.com>



Redo log and control file writes are considered "high priority" I/O consumers and get lumped into the high priority background I/O queue, so cellsrv will satisfy these requests at a higher priority than anything else. DBWR writes (which your db file parallel write waits is associated with) are prioritized the same as user I/O, relatively speaking, and a handful of other background-related writes and lower than log file/control file writes.

Under heavy I/O load and saturation, this behavior should be what you may expect.

  • John

From: Matt McClernon <mccmx_at_hotmail.com<mailto:mccmx_at_hotmail.com>> Reply-To: "mccmx_at_hotmail.com<mailto:mccmx_at_hotmail.com>" <mccmx_at_hotmail.com<mailto:mccmx_at_hotmail.com>> Date: Sunday, December 30, 2012 5:51 PM
To: "oracle-l_at_freelists.org<mailto:oracle-l_at_freelists.org>" <oracle-l_at_freelists.org<mailto:oracle-l_at_freelists.org>> Subject: Large discrepancy between 'log file parallel write' and 'db file parallel write' times

Oracle 11.2.0.2 EE on Exadata X2-2 quarter rack with extra cells (7 cells in total).

Under heavy I/O load (i.e. when the cells are saturated) we are seeing a very large difference between the average latencies of 'log file parallel write' and 'db file parallel write' waits.

From a 30 minute AWR report in which each cell was performing its theoretical maximum 1500 IOPS we observe the following write latencies:

DBFPW = 263ms
LFPW = 2ms

when the I/O subsystem is not pushed to its limits the values for both wait events are in the order of single digit milliseconds.

So my question is why do the DBWR write times suffer so much during I/O bottlenecks when the LGWR write times do not..?

The LGWR writes will be significantly smaller and are sequential as opposed to the DBWR writes which are random writes, but are those differences enough to justify the large gap, or is it possible that we are losing 'db file parallel write' time to some other processing..?

and are the LGWR writes genuinely sequential..? because ultimately the redo logs are on the same physical disks as the datafiles because both the RECO and DATA diskgroups are carved from the same cell disks.

Matt --
http://www.freelists.org/webpage/oracle-l

--
http://www.freelists.org/webpage/oracle-l
Received on Mon Dec 31 2012 - 03:01:28 CET

Original text of this message