PostgreSQL ERROR: could not obtain lock — diagnosis and recovery
ERROR: could not obtain lock on row in relation and ERROR: canceling statement due to lock timeout mean a query requested a lock but PostgreSQL refused to wait. The database is not down; a session is holding a resource another transaction needs.
Three variants produce these errors:
SELECT ... FOR UPDATE NOWAITfails immediately if the row is locked.- A statement that exceeds
lock_timeoutfails after waiting. - DDL such as
ALTER TABLEorCREATE INDEXrequiresAccessExclusiveLockand will wait or fail depending on session configuration.
These often cascade: one long-running query blocks a schema change, the schema change queues behind it, and subsequent queries queue behind the DDL until the connection pool exhausts.
What this means
PostgreSQL’s MVCC keeps ordinary DML from blocking reads or writes. Explicit row-level locking (FOR UPDATE, FOR SHARE), DDL, and maintenance commands use heavyweight locks outside MVCC. When a session requests a conflicting lock, it enters a wait queue.
NOWAIT raises an error immediately instead of queuing. lock_timeout cancels the waiter after the set duration. Without either, the session waits indefinitely, tying up a connection and often blocking operations behind it.
DDL such as ALTER TABLE, TRUNCATE, and DROP TABLE requires AccessExclusiveLock, which conflicts with all other access. Once DDL queues for a table, every new conflicting lock request queues behind it in FIFO order. A single long-running SELECT becomes a table-wide jam.
Row-level contention is subtle. Because row locks are stored on disk, pg_locks often shows a waiting process blocked on the holder’s transactionid rather than on the tuple itself. This confuses naive lock diagnosis queries.
flowchart LR
A[Lock error] --> B{NOWAIT or timeout?}
B -->|NOWAIT| C[Check pg_stat_activity for row lock holder]
B -->|Timeout| D[Check pg_locks.waitstart]
D --> E[Use pg_blocking_pids to find root blocker]
C --> E
E --> F{Blocker state}
F -->|Active query| G[Evaluate terminate]
F -->|Idle in transaction| H[Kill immediately]
F -->|DDL queue| I[Cancel DDL or wait]Common causes
| Cause | What it looks like | First thing to check |
|---|---|---|
| NOWAIT row lock | Application logs show “could not obtain lock on row in relation” immediately after FOR UPDATE NOWAIT | pg_stat_activity for the PID holding the row lock |
| lock_timeout expiration | Logs show “canceling statement due to lock timeout” after a wait | pg_locks for ungranted locks with waitstart older than lock_timeout |
| DDL blocked by long query | ALTER TABLE or CREATE INDEX hangs or fails; subsequent queries on the same table also hang | pg_stat_activity for active queries on the target relation older than a few minutes |
| Lock queue behind DDL | Multiple sessions show wait_event_type = 'Lock' on the same relation after a DDL statement was issued | pg_locks ordered by granted to see the DDL statement at the head of the queue |
| Idle in transaction holding locks | pg_stat_activity shows idle in transaction with an old state_change; row or table locks are held | pg_stat_activity.state and xact_start age |
Quick checks
Run these in a psql session as a superuser or a user with pg_monitor privileges.
-- Check for sessions currently waiting on locks
SELECT pid, usename, state, wait_event_type, wait_event, query_start, query
FROM pg_stat_activity
WHERE wait_event_type = 'Lock';
-- Find blockers for current lock waiters
SELECT pid, pg_blocking_pids(pid) AS blockers, query
FROM pg_stat_activity
WHERE wait_event_type = 'Lock';
-- List ungranted locks with query details
SELECT l.pid, l.locktype, l.relation::regclass, l.mode, l.granted,
a.query_start, a.query
FROM pg_locks l
JOIN pg_stat_activity a ON l.pid = a.pid
WHERE NOT l.granted;
-- Find long-running active queries that may block DDL
SELECT pid, query_start, query
FROM pg_stat_activity
WHERE state = 'active'
AND query_start < NOW() - INTERVAL '5 minutes';
-- Check if lock wait logging is enabled
SHOW log_lock_waits;
-- Review lock timeout and related settings
SELECT name, setting, unit FROM pg_settings
WHERE name IN ('lock_timeout', 'statement_timeout', 'deadlock_timeout', 'idle_in_transaction_session_timeout');
How to diagnose it
Identify the variant from the error message. “Could not obtain lock on row in relation” indicates
NOWAIT. “Canceling statement due to lock timeout” indicates a timeout. DDL failures indicateAccessExclusiveLockcontention.Find the session in
pg_stat_activity. Note the PID and error time. If still connected,wait_event_typeisLock.Walk the blocker chain with
pg_blocking_pids(). The immediate blocker may itself be waiting; recurse until you reach the granted lock holder.Inspect the blocker in
pg_stat_activity. Checkstate,xact_start, andquery.idle in transactionmeans it holds locks while doing no work.activewith a long runtime may be a legitimate query.Determine lock type from
pg_locks. Filter by the waiting PID and checklocktypeandrelation::regclass. Row-level waits often surface astransactionidwaits, not tuple locks.Calculate wait duration. On PostgreSQL 16+, use
pg_locks.waitstart. On earlier versions, infer frompg_stat_activity.query_start, remembering thatquery_startis the query start time, not necessarily the lock wait start.Decide: terminate the blocker or cancel the waiter. Abandoned
idle in transactionsessions are usually safe to terminate. Critical long-running reports may require canceling the DDL orNOWAITquery instead. In a DDL queue cascade, canceling the DDL usually unblocks the queue fastest.Verify. Re-run the quick checks. Confirm
pg_locksshows no ungranted locks on the affected relation andpg_stat_activity.wait_event_typehas cleared.
Metrics and signals to monitor
| Signal | Why it matters | Warning sign |
|---|---|---|
pg_stat_activity.wait_event_type = 'Lock' | Indicates active lock contention | Any sustained count above baseline |
pg_locks ungranted with waitstart older than 30 seconds | Quantifies how long sessions have been stalled | Individual waits approaching lock_timeout |
pg_stat_database.deadlocks | Deadlocks force aborts and retries | Counter incrementing between samples |
pg_stat_activity idle in transaction age | Idle transactions hold row locks and block DDL and VACUUM | state_change older than 5 minutes |
log_lock_waits log entries | Audit trail showing which statements waited longer than deadlock_timeout | Recurring entries for the same relation or query pattern |
Fixes
NOWAIT row-level lock failures
If the business logic tolerates waiting, remove NOWAIT and rely on lock_timeout. For queue-like workloads, use FOR UPDATE SKIP LOCKED, which skips locked rows instead of failing.
Tradeoff: SKIP LOCKED skips rows. Use it for work queues, not for operations that must process every row in order.
lock_timeout cancellations
Do not retry with a longer timeout until you identify and resolve the blocker.
Set application lock_timeout lower than statement_timeout so lock waits fail fast while runaway queries are still caught. A production-safe starting point is lock_timeout = '10s' and statement_timeout = '30s'.
For DDL, set a brief lock_timeout (1 to 5 seconds) before ALTER TABLE. If the lock is unavailable, the statement fails instead of queuing and blocking traffic.
DDL blocked by long queries
Identify the long query on the target table. Terminate it only if non-critical. If the query is critical, cancel the DDL and reschedule it. If the DDL has already acquired its lock and queries queue behind it, canceling the DDL frees the queue immediately.
Use CREATE INDEX CONCURRENTLY instead of plain CREATE INDEX during traffic. It takes longer and cannot run inside a transaction, but it avoids blocking reads and writes.
For table reorganization, prefer pg_repack over VACUUM FULL. pg_repack rewrites the table online and only needs a brief ACCESS EXCLUSIVE lock at the final swap.
Tradeoff: pg_repack requires a primary key or unique NOT NULL index and roughly twice the table’s size in free disk space.
Idle in transaction blockers
WARNING: pg_terminate_backend() forcibly closes the connection and rolls back its transaction. Confirm the PID and its impact before running.
If pg_stat_activity shows idle in transaction as the blocker:
SELECT pg_terminate_backend(pid)
FROM pg_stat_activity
WHERE state = 'idle in transaction'
AND state_change < NOW() - INTERVAL '5 minutes';
Then set idle_in_transaction_session_timeout between 60 seconds and 5 minutes to prevent abandoned transactions from holding locks indefinitely.
Prevention
- Set short
lock_timeoutfor DDL. Run migrations with a brief timeout so they fail fast instead of queuing and serializing traffic. - Set
idle_in_transaction_session_timeoutto'2min'or'5min'. - Keep transactions short. Do not open a transaction, call an external service, then commit. Hold locks only for database work.
- Use
CREATE INDEX CONCURRENTLY. Never run plainCREATE INDEXon a production table during traffic. - Schedule heavy DDL in maintenance windows. Even with timeouts, some DDL requires exclusive locks. Run them when long queries are unlikely.
- Review long-running queries before they block maintenance. A 30-minute report is a blocker waiting to happen.
- Monitor
pg_stat_database.deadlocks. Rising counts indicate inconsistent lock ordering or locks held too long.
How Netdata helps
- Correlates PostgreSQL connection states with system metrics to distinguish lock contention from CPU or disk saturation.
- Surfaces
pg_stat_activitystate distributions (active,idle,idle in transaction) without manual polling. - Alerts on sustained
idle in transactiondurations that indicate abandoned lock holders. - Tracks query latency percentiles alongside lock wait events to confirm whether a latency spike is caused by contention or plan regression.
- Visualizes replication lag on replicas to rule out streaming delays when diagnosing primary-side lock issues.
Related guides
- How PostgreSQL actually works in production: a mental model for operators: /guides/postgres/how-postgres-works-in-production/
- PostgreSQL connection exhaustion: detection, diagnosis, and prevention: /guides/postgres/postgres-connection-exhaustion/
- PostgreSQL connection refused: pg_hba, listen_addresses, and TCP diagnosis: /guides/postgres/postgres-connection-refused/
- PostgreSQL idle in transaction: detecting and killing zombie sessions: /guides/postgres/postgres-idle-in-transaction/
- PostgreSQL monitoring checklist: the signals every production database needs: /guides/postgres/postgres-monitoring-checklist/
- PostgreSQL monitoring maturity model: from reactive to self-healing: /guides/postgres/postgres-monitoring-maturity-model/
- PgBouncer pool exhausted: how to diagnose and fix client waits: /guides/postgres/postgres-pgbouncer-pool-exhausted/
- PostgreSQL FATAL: too many connections - causes and fixes: /guides/postgres/postgres-too-many-connections/






