How do I find which SQL is causing high CPU in Oracle 19c?

Query V$SQL ordered by CPU_TIME descending, or use ASH (V$ACTIVE_SESSION_HISTORY) filtering on SESSION_STATE='ON CPU' to pinpoint the exact SQL_ID consuming the most CPU in real time.

Can parallel query cause Oracle high CPU usage?

Yes. When parallel degree of parallelism exceeds the number of available CPU cores, Oracle spawns more PX slave processes than the OS can schedule, causing context-switching overhead that manifests as runaway CPU consumption.

What is latch contention and how does it spike CPU in Oracle?

Latch contention occurs when multiple sessions spin-wait on library cache or shared pool latches. Spin-waiting burns CPU in a tight loop. A small shared pool relative to parsing workload or hard parsing of non-reusable SQL are the most common triggers.

How do I fix oracle high cpu usage fix 19c without restarting the database?

Kill the offending session with ALTER SYSTEM KILL SESSION, apply a SQL Profile or Baseline to fix a bad execution plan, limit parallel degree with ALTER SESSION or system-level PARALLEL_DEGREE_POLICY, and resize the shared pool online with ALTER SYSTEM SET SHARED_POOL_SIZE — all without requiring a database restart.

How do you read an Oracle AWR report in 19c quickly?

Skip the top-to-bottom read. Start with the header to confirm the snapshot window and DB Time, then jump straight to the Top 10 Foreground Wait Events. That section tells you where time was actually spent. Trace the biggest wait back to the Top SQL section, and you have your suspect in under five minutes.

What is the first section to check when you read an Oracle AWR report in 19c?

Check the report header and the DB Time figure first. The header confirms the instance, the snapshot interval, and the elapsed time. DB Time divided by elapsed time tells you roughly how many sessions were active. If DB Time is far higher than your CPU count, the instance was overloaded during that window.

What is a healthy DB Time to Elapsed Time ratio?

There is no single magic number, but a useful rule is to compare average active sessions against your CPU core count. If DB Time divided by elapsed time stays near or below the number of CPU cores, the instance had headroom. When it runs several times higher than the core count, sessions were queuing and the database was CPU or wait bound.

How long should the AWR snapshot window be?

Keep the window tight around the problem. A one-hour report that spans both a quiet period and a spike will average the spike away and hide it. For a sharp incident I generate a report covering only the two or three snapshots around the slowdown, often a 30 to 60 minute window, so the wait profile reflects the real event.

Is AWR free to use in Oracle 19c?

No. AWR, ASH, and the related advisors are part of the Oracle Diagnostics Pack, which is a separately licensed option on Enterprise Edition. Running awrrpt.sql on an unlicensed instance is a license violation. If you do not hold the pack, use Statspack instead, which is free but collects less detail than AWR.

Why are standby redo logs not applying in Oracle Data Guard?

Standby redo logs not applying is most often caused by: MRP process stopped or never started, SRL groups missing or misconfigured, incorrect VALID_FOR in LOG_ARCHIVE_DEST_2, or a redo sequence gap. Run SELECT PROCESS, STATUS FROM V$MANAGED_STANDBY WHERE PROCESS='MRP0' as your first diagnostic step.

How do I check if standby redo logs are applying in Oracle Data Guard?

Run SELECT PROCESS, STATUS, SEQUENCE# FROM V$MANAGED_STANDBY on the standby. Look for MRP0 with STATUS=APPLYING_LOG and an incrementing SEQUENCE#. Also check V$DATAGUARD_STATS WHERE NAME='apply lag' to confirm lag is near zero. MRP0 absent or WAIT_FOR_LOG means apply has stalled.

How many standby redo log groups should I create?

Oracle recommends at least one more SRL group than the number of online redo log groups per thread on the primary. For a single-instance primary with 3 online redo log groups, create 4 SRL groups. Each SRL member must be at least as large as the largest online redo log member, adding 10 percent extra is recommended.

What does ORA-16766 mean in a Data Guard environment?

ORA-16766 means the redo apply service is disabled on the standby. This fires when MRP is stopped or crashed. Confirm SRL groups exist at correct size, then issue ALTER DATABASE RECOVER MANAGED STANDBY DATABASE DISCONNECT FROM SESSION. Monitor V$MANAGED_STANDBY until MRP0 shows APPLYING_LOG.

Can I fix Data Guard apply lag without a full resync?

Yes. For most apply-lag incidents caused by MRP outages or SRL misconfiguration, a full resync is not required. Restart MRP, verify SRL status in V$STANDBY_LOG, and check for archive gaps using V$ARCHIVE_GAP. A full resync is only needed when the standby SCN has diverged from the primary after an incomplete recovery.

Why is the MRP process not running in Oracle Data Guard?

The MRP process stops in Oracle Data Guard for five main reasons: manual cancellation during maintenance, missing Standby Redo Logs preventing real-time apply, Data Guard Broker setting apply state to OFF, protection mode conflict causing MRP abort, or no auto-start after standby instance crash. Each cause has a different fix command.

How do I restart the MRP process in Oracle Data Guard 19c?

Run on the standby: ALTER DATABASE RECOVER MANAGED STANDBY DATABASE USING CURRENT LOGFILE DISCONNECT FROM SESSION; Then verify with: SELECT process, status FROM v$managed_standby WHERE process='MRP0'; If using Data Guard Broker, run in DGMGRL: EDIT DATABASE standby_db SET STATE='APPLY-ON';

What is the difference between MRP not running and ORA-16766?

They describe the same condition from different perspectives. ORA-16766 is the error code Oracle reports in DGMGRL SHOW CONFIGURATION output. MRP not running is the underlying technical reason for that error. Both mean the Managed Recovery Process on the standby has stopped and redo is no longer being applied.

Why does MRP stop immediately after I restart it?

Recurring MRP stop after restart points to three causes: Data Guard Broker has apply set to OFF and overrides your manual restart (fix via DGMGRL), missing Standby Redo Logs cause MRP to fail on start (create SRLs first), or Maximum Protection mode with an unstable standby connection causes repeated abort (lower to Maximum Availability).

How do I prevent MRP from stopping after a standby reboot?

Create an AFTER STARTUP database trigger on the standby that checks the database role and automatically runs ALTER DATABASE RECOVER MANAGED STANDBY DATABASE USING CURRENT LOGFILE DISCONNECT. Alternatively, enable Data Guard Broker with dg_broker_start=TRUE and set the database state to APPLY-ON, which makes the broker auto-restart MRP after any restart.

Showing posts with label Production DBA. Show all posts

Monday, June 8, 2026

Oracle High CPU Usage: Causes and Fix in 19c

Oracle 19c Performance Tuning

Oracle High CPU Usage: Causes and Fix in 19c

 Chetan Yadav — Senior Oracle DBA ⏱ 10 min read  Troubleshoot Guide

How to Read Oracle AWR Report in 19c: DBA Guide

How to Read an Oracle AWR Report in 19c

A practical reading order from real production incidents, not a feature tour.

Oracle 19c Performance Tuning AWR Report Production DBA

how to read oracle awr report 19c data center servers

An AWR report is a snapshot of where your instance spent its time. Reading it in the right order is half the battle.

02:14. The on-call page hit: checkout API p95 had jumped from 180 ms to 4.2 seconds. No errors. No node eviction. No failover. Just a database that had quietly gone slow under a normal load. The first artifact I pulled was a one-hour AWR report, and within four minutes it pointed straight at the cause.

If you have ever stared at a 30-page AWR report and not known where to look first, this guide is for you. Knowing how to read an Oracle AWR report in 19c is not about understanding every section. It is about reading a handful of sections in the right order so you can go from "the database is slow" to "this SQL on this object is the problem" in minutes. That is exactly what I did at 02:14, and it is the workflow I will walk you through here.

AWR (Automatic Workload Repository) takes regular snapshots of performance statistics and stores them in the SYSAUX tablespace. A report compares two snapshots and shows you the delta: what the instance did, where it waited, and which statements drove the load. The trick is to stop reading top to bottom and start reading by importance.

Who this guide is for

Junior and mid-level DBAs who can generate an AWR report but freeze when it comes to interpreting it, and senior engineers who want a tighter triage checklist for incidents. Examples use Oracle 19c, but the reading order applies to 12c and 18c as well.

Standby Redo Logs Not Applying Oracle Data Guard Fix

Standby Redo Logs Not Applying in Oracle Data Guard: Complete Fix Guide

MRP process troubleshooting, SRL configuration, apply lag resolution, and parallel apply tuning from 15 years of production Oracle environments.

Oracle 19c / 21c Data Guard Troubleshooting Senior DBA Production Guide

standby redo logs not applying oracle data guard server room terminal screens

Who This Is For: Junior to Senior Oracle DBAs managing Oracle 19c or 21c Data Guard environments who are troubleshooting standby redo logs not applying, MRP process failures, or unexplained apply lag. Freshers studying for Oracle certification will also benefit from the architecture walkthrough.

It was 2:41 AM. PagerDuty fires. The on-call message reads: "Data Guard apply lag 38 minutes and climbing." I SSH into the standby. ORA-16766 stares back at me from the alert log. A quick check of V$MANAGED_STANDBY confirms it -- MRP0 is gone. No standby redo logs are applying. The business had a 4-hour RPO commitment. We had maybe 90 minutes before the DBA team had a very uncomfortable conversation with the CTO.

Standby redo logs not applying in Oracle Data Guard is one of the highest-stress incidents a production DBA faces. It is also one of the most fixable -- if you know the exact diagnostic tree. In this post I am going to walk through every root cause I have encountered across 15 years of Oracle production work, the precise SQL to diagnose each one, and the fix you run to get apply moving again.

Environment Reference: Oracle 19c (19.19 RU), 2-node RAC primary + single-node physical standby, SYNC transport, 6 TB database, peak 12,000 TPS. All SQL verified in this environment. Steps also apply to Oracle 21c and Oracle 23ai Data Guard configurations.

MRP Process Not Running in Data Guard: Fix in Oracle 19c

MRP Process Not Running in Data Guard: Fix It Step-by-Step (Oracle 19c)

MRP Process Not Running in Data Guard? Fix It Step-by-Step (Oracle 19c)

5 Root Causes, DGMGRL Diagnosis and Exact Fix Commands for Every Scenario

📅 April 2026

👤 Chetan Yadav, Senior Oracle & Cloud DBA, Oracle ACE Apprentice

⏱️ 12,14 min read

⏱️ Estimated Reading Time: 12,14 minutes

MRP Not Running, ORA-16766, DGMGRL Fix, SRL Creation, Broker State, Auto-Start Trigger, Oracle 19c

mrp process not running data guard oracle 19c server infrastructure troubleshooting guide

⚙️ Environment Referenced in This Article

Oracle Database: 19.18.0.0.0 Enterprise Edition • Primary: 2-Node RAC, 4.8 TB OLTP, 2,800 TPS
Standby: Physical Standby with Active Data Guard enabled
Protection Mode: Maximum Availability (SYNC/AFFIRM) • Broker: Data Guard Broker enabled

The monitoring alert arrived at 2:48 AM: "Standby apply lag crossing 90 minutes." I connected to DGMGRL immediately. SHOW CONFIGURATION confirmed it: the MRP process was not running on the standby. Every transaction committed on the primary for the past 90 minutes was sitting unprocessed in Standby Redo Logs, and the gap was growing by the second.

In my 15 years managing Oracle production environments, a stopped MRP process is one of the most common Data Guard incidents I have resolved. It is not complicated once you know which of the five root causes you are dealing with. The problem is that each cause has a completely different fix, and applying the wrong one wastes critical time.

This guide gives you the exact decision path, the diagnostic commands to identify your specific cause, and the precise fix for each scenario. In most cases the MRP process not running in Data Guard is resolved in under 5 minutes.

SAP HANA Logging Behavior Explained: Commit, Savepoint & Crash Recovery (Real Production Guide)

⏱️ Estimated Reading Time: 12–14 minutes

SAP HANA Logging Behavior: What Really Happens During a Production Crash

It’s 2:13 AM. Your SAP application suddenly freezes. End users can’t post transactions. SAP HANA Studio shows the database restarting. Phone calls start coming in.

“Did we lose data?”

In real production environments, this question does not depend on luck. It depends entirely on how SAP HANA logging behavior works behind the scenes.

This article explains SAP HANA logging not as documentation theory, but as it behaves during real crashes, restarts, and recovery situations.

Oracle Listener Health Check: Preventing Silent Production Outages

This guide explains how Oracle Listener failures silently impact production systems Oracle Listener Health Check – Production Monitoring Guide | Chetan Yadav

⏱️ Estimated Reading Time: 14 minutes

Oracle Listener Health Check

It’s 2 AM. Your phone lights up with alerts. Applications are down, dashboards are red, and every connection attempt fails with TNS-12541: TNS:no listener. The database is up — but the business is still dead.

In real production environments, a failed Oracle Listener can block thousands of users, cause SLA breaches, and trigger revenue loss within minutes. We’ve seen P99 login latency jump from milliseconds to total outages.

This guide shows how to implement a production-grade Oracle Listener health check using scripts, monitoring logic, and automation — before the listener becomes your single point of failure.

Chetan Yadav

Pages

Monday, June 8, 2026

Oracle High CPU Usage: Causes and Fix in 19c

Oracle High CPU Usage: Causes and Fix in 19c

Monday, June 1, 2026

How to Read Oracle AWR Report in 19c: DBA Guide

How to Read an Oracle AWR Report in 19c

Monday, May 25, 2026

Standby Redo Logs Not Applying Oracle Data Guard Fix

Standby Redo Logs Not Applying in Oracle Data Guard: Complete Fix Guide

Monday, May 11, 2026

MRP Process Not Running in Data Guard: Fix in Oracle 19c

MRP Process Not Running in Data Guard? Fix It Step-by-Step (Oracle 19c)

Sunday, January 11, 2026

SAP HANA Logging Behavior Explained: Commit, Savepoint & Crash Recovery (Real Production Guide)

SAP HANA Logging Behavior: What Really Happens During a Production Crash

Thursday, January 8, 2026

Oracle Listener Health Check: Preventing Silent Production Outages

Oracle Listener Health Check