MySQL Waiting for table metadata lock: diagnosing the DDL stall
You run SHOW PROCESSLIST and see a wall of threads in Waiting for table metadata lock. An ALTER TABLE that should finish in seconds hangs for minutes. Queries against one table stop returning, the application connection pool drains, and CPU and disk look calm. This is a metadata lock (MDL) stall. It is a queueing failure, not resource exhaustion. Left alone, it cascades into connection exhaustion and a partial outage where every other table continues to work normally.
What this means
MySQL uses metadata locks to protect table definitions. When a transaction accesses a table, it acquires a shared MDL. DDL statements such as ALTER TABLE, DROP TABLE, RENAME TABLE, or TRUNCATE TABLE need an exclusive MDL. If a long-running transaction or an idle connection with an open transaction holds a shared MDL, the DDL waits. Once the DDL is waiting for the exclusive lock, every new DML query against that table queues behind the DDL request. The shared lock from the original transaction blocks the exclusive request, and the exclusive request blocks all subsequent shared requests. Within minutes the queue grows, Threads_connected climbs toward max_connections, and the Questions rate for that table drops to zero.
Metadata locks are separate from InnoDB row locks. They do not appear in SHOW ENGINE INNODB STATUS or data_lock_waits. That is why the stall is invisible to InnoDB lock monitoring, and why operators search for the exact state string Waiting for table metadata lock.
flowchart TD
T[Idle transaction with open trx] -->|holds SHARED MDL| Tab[Target table]
DDL[DDL e.g. ALTER TABLE] -->|needs EXCLUSIVE MDL| Tab
DML[Incoming DML] -->|needs SHARED MDL| Tab
T -->|blocks| DDL
DDL -->|blocks| DMLCommon causes
| Cause | What it looks like | First thing to check |
|---|---|---|
| DDL during active load | One thread running ALTER TABLE, DROP TABLE, RENAME TABLE, or TRUNCATE TABLE; all waiters name the same table | SHOW FULL PROCESSLIST filtered by the table name |
| Idle transaction holding shared MDL | Oldest row in INNODB_TRX has trx_query IS NULL and age in minutes or hours | SELECT trx_mysql_thread_id, trx_started, trx_query FROM information_schema.INNODB_TRX ORDER BY trx_started ASC LIMIT 5 |
| Backup or dump with table locking | Connection from a backup host running FLUSH TABLES WITH READ LOCK or LOCK TABLES | SHOW FULL PROCESSLIST for a backup user or LOCK TABLES statements |
| ORM or connection pool leak | Multiple application connections in Sleep with high Time values and open transactions | SHOW FULL PROCESSLIST for Command = 'Sleep' from application hosts |
Quick checks
# Count metadata lock waiters across all tables
mysql -e "SELECT OBJECT_SCHEMA, OBJECT_NAME, LOCK_TYPE, COUNT(*) AS waiting FROM performance_schema.metadata_locks WHERE LOCK_STATUS = 'PENDING' AND OBJECT_TYPE = 'TABLE' GROUP BY OBJECT_SCHEMA, OBJECT_NAME, LOCK_TYPE;"
# Compare active connections to the configured limit
mysql -e "SHOW GLOBAL STATUS LIKE 'Threads_connected'; SHOW GLOBAL VARIABLES LIKE 'max_connections';"
# Rule out InnoDB row lock contention
mysql -e "SHOW GLOBAL STATUS LIKE 'Innodb_row_lock_current_waits';"
# Find the oldest open transaction
mysql -e "SELECT trx_mysql_thread_id, trx_started, TIMESTAMPDIFF(SECOND, trx_started, NOW()) AS age_seconds, trx_rows_modified, trx_query FROM information_schema.INNODB_TRX ORDER BY trx_started ASC LIMIT 5;"
# Isolate metadata lock states and DDL in the processlist
mysql -e "SHOW FULL PROCESSLIST" | grep -E "Waiting for table metadata lock|ALTER TABLE|DROP TABLE|RENAME TABLE|TRUNCATE TABLE"
# Sample Questions to check for throughput collapse
mysql -e "SHOW GLOBAL STATUS LIKE 'Questions';"
# Take a second sample after 10 seconds and compute the rate.
How to diagnose it
Confirm the scope. Run
SHOW FULL PROCESSLIST. If every waiter names the same table and the state isWaiting for table metadata lock, you have a targeted MDL stall. If many unrelated tables are affected, look for a global read lock from a backup instead.Find the DDL. Look for a thread whose
InfocontainsALTER TABLE,DROP TABLE,RENAME TABLE, orTRUNCATE TABLE. Note itsIdandTime. If there is no DDL visible, a prior DDL may have timed out and left the queue in a confused state, or the stall is caused by aLOCK TABLESstatement.Find the blocking transaction. Query
information_schema.INNODB_TRXordered bytrx_started. The oldest transaction is almost always the blocker. Pay attention totrx_mysql_thread_idandtrx_query. Iftrx_queryisNULL, the connection is idle between statements but still holds the lock. This is the most common production scenario.Map the lock ownership in Performance Schema. Query
performance_schema.metadata_locksfor rows withLOCK_STATUS = 'GRANTED'andOBJECT_NAMEmatching the affected table. If this table is empty, themetadata_locksinstrument may not be enabled.Measure the cascade. Compare
Threads_runningandThreads_connectedtomax_connections. A risingThreads_connectedwith a flat or lowThreads_runningwhileQuestionsdrops indicates the queue is deepening. FlatThreads_runningmeans work is not progressing; the threads are parked.Exclude InnoDB row lock contention. Check
SHOW GLOBAL STATUS LIKE 'Innodb_row_lock_current_waits';. If this is zero or low while dozens of threads are stuck, the stall is purely at the MDL layer.Decide who to kill. If the long-running transaction is expendable, kill it. If it belongs to a critical reporting job or an uninterruptible backup, kill the DDL instead and reschedule the schema change. Consider whether killing a modifying transaction requires downstream reconciliation before you run
KILL.
Metrics and signals
| Signal | Why it matters | Warning sign |
|---|---|---|
performance_schema.metadata_locks PENDING count | Direct measure of MDL queue depth on specific objects | More than 3 sessions pending on the same table for over 30 seconds |
SHOW PROCESSLIST state Waiting for table metadata lock | Immediate user-facing symptom | Sustained presence during any DDL window |
Threads_running | Active work piling up behind the lock | Rising steadily while CPU and disk remain normal |
Threads_connected / max_connections | Proximity to connection exhaustion | Ratio above 0.80 and climbing |
Questions rate | Confirms throughput collapse | Drop greater than 50 percent from baseline while clients remain connected |
Innodb_row_lock_current_waits | Distinguishes MDL stalls from row lock contention | Near zero despite many stuck threads |
Fixes
Kill the DDL. If the schema change is not urgent, identify the DDL thread ID from SHOW FULL PROCESSLIST and run KILL <thread_id>;. If the DDL has not yet acquired the exclusive lock, this removes the request instantly. If it has already acquired the lock and is executing, KILL initiates rollback; relief is not instant on large tables and may hold the MDL until rollback completes. The tradeoff is that the schema change must be reattempted later, ideally during a maintenance window after confirming no long transactions.
Kill the blocking transaction. If the oldest transaction in INNODB_TRX is idle or expendable, run KILL <trx_mysql_thread_id>;. This rolls back uncommitted work and releases the shared MDL. The DDL then proceeds, followed by the queued DML. The tradeoff is application errors for whatever work was in flight, plus any downstream effects if the transaction had already modified data.
Do not kill the waiting queries. Sessions showing Waiting for table metadata lock are victims, not blockers. Killing them frees no locks and can trigger application retry loops that make the queue worse.
Address backup and monitoring connections. If a backup job caused the stall, stop the backup tool or switch to a non-locking method such as --single-transaction without --lock-tables or --master-data if consistency allows. For monitoring tools that open transactional connections, configure them to use READ COMMITTED and to commit or disconnect promptly.
Prevention
Inspect
INNODB_TRXbefore DDL. Always checkSELECT COUNT(*), MAX(TIMESTAMPDIFF(SECOND, trx_started, NOW())) AS max_age FROM information_schema.INNODB_TRX;before starting anALTER TABLE. If transactions exist, wait or notify owners.Keep application transactions short. Ensure ORMs and connection pools commit or rollback promptly. An idle connection with an open transaction is the most common blocker.
Reserve an admin connection. MySQL reserves one connection above
max_connectionsfor users withSUPERorCONNECTION_ADMIN. Ensure your on-call credential has this privilege so you can diagnose even during connection exhaustion.Schedule DDL during low-traffic windows. Even if the DDL itself is online, the exclusive MDL acquisition at the beginning and end can stall if transactions are active.
Monitor open transaction age. Alert on any transaction older than a few minutes in OLTP workloads. This catches idle blockers before they cause a cascade.
How Netdata helps
- A
Threads_runningspike with flat CPU and disk points to a lock stall rather than resource saturation. Threads_connectedagainstmax_connectionswarns before connection exhaustion.- A
Questionsrate drop during active hours is visible immediately. Innodb_row_lock_current_waitsdistinguishes MDL stalls from row lock contention.- Historical context shows when a DDL began and when the query backlog started forming.
Related guides
- How MySQL actually works in production: a mental model for operators
- MySQL Aborted_connects and Aborted_clients climbing: diagnosis
- MySQL connection exhaustion: detection, diagnosis, and prevention
- MySQL ERROR 1213: Deadlock found when trying to get lock; try restarting transaction
- MySQL gap locks and next-key locks: surprising deadlocks under REPEATABLE READ
- MySQL ERROR 1205: Lock wait timeout exceeded; try restarting transaction
- MySQL: got a packet bigger than ‘max_allowed_packet’ bytes - causes and fixes
- MySQL monitoring checklist: the signals every production instance needs
- MySQL monitoring maturity model: from survival to expert
- MySQL InnoDB row lock contention: finding who blocks whom
- MySQL Threads_created climbing: thread cache churn and missing pooling
- MySQL Threads_connected vs Threads_running: which one to actually alert on







