MySQL for Admins

Offer suggestions by opening an issue

(not clickable for now)

MySQL Community vs Enterprise
MySQL Architecture
Installation
Administration
Backup and Restore
Upgrade
Replication
- Group Replication

Diction

MySQL -> relational database management system (RDBMS) - database have schemas
Schema - table structures (columns, data types) + relationships through primary / foreign keys

Feature Category	Community Edition	Enterprise Edition
License	FOSS (Free and Open Source Software)	Commercial License
Kubernetes Support	Operator for Kubernetes (InnoDB & NDB clusters, full lifecycle management)	✔
Backup	logical backup utils - mysqldump, mysqlbinlog for PITR	physical backup util - mysqlbackup -> hot backup, faster, compression with heuristics, zero-storage streaming backup and restore, selective TTS-based backup and restore with table renaming
Availability	Manual failover and clustering setups	InnoDB Cluster: fault tolerance, data consistency; InnoDB ClusterSet: primary-replica clusters with automatic failover (replacement)
Scalability	Basic concurrency features	Thread pool for scalable thread management, reduced overhead
Stored Programs	Stored procedures/functions	JavaScript Stored Programs – run inside server, reduce client-server data movement
Security: Authentication	Native MySQL users/passwords	External authentication modules (Linux PAM, Windows AD), single sign-on, (OS - DBMS) unified credential management, enhanced password policies
Security: Encryption	Basic support (e.g., SSL/TLS)	Transparent Data Encryption (TDE) for data-at-rest, PCI DSS and GDPR compliance
Security: Firewall	x	mysql_firewall - query allowlist (per user/group of users), monitoring,
Security: Auditing	x	MySQL Enterprise Audit Plugin
Monitoring/Management	`SHOW PROCESSLIST, SHOW STATUS, EXPLAIN, and SHOW ENGINE INNODB STATUS`, MySQL Workbench, Percona Monitoring and Management	MySQL Enterprise Monitor, MySQL Enterprise Manager

MySQL Architecture

Logical

Logical Architecture

Client

Contains server connectors and APIs that send connection request to server
CLI - mysql, required by GUI - MySQL Workbench

Server

Authentication

'username'@'hostname' identified by 'password';

Connection Manager
1. Check thread cache;
2. if thread available: provide thread; else create new thread, 1 per client
3. Establish connection
Security

Verify if user has privilege for each query
Parsing
1. Lexer/Lexical Analyzer/Tokenizer/Scanner Breaks string into tokens (meaningful elements) - keywords, identifiers, operators, literals
2. Parser
  1. Checks if tokens follow syntax structure based on rules
  2. If valid, creates parse tree (Abstract Syntax Tree) - represents logical structure of query
    - Each node represents a SQL operation
    - Edges represent relationships between operations
Optimizer
1. Reads AST
2. Generates multiple candidate execution plans compatible with storage engine:
  - Explores different table access methods - no index/full scan, single/multi-column index, Adapative Hash Index
  - Evaluates possible primary/secondary index usage
  - Considers various join orders (sequence of joining tables)
  - Chooses join methods (e.g., nested loop, hash join)
  - Reorders operations (e.g., applies filters before or after joins) to improve efficiency
  - Considers data distribution and available indexes for join strategies
- Can be influenced by index and join hints:
  
  USE INDEX, FORCE INDEX, IGNORE INDEX, STRAIGHT_JOIN
1. Does cost-based optimization:
  - References the cost model (I/O, CPU, memory) for every operation in each plan
  - Uses data statistics (row counts, index selectivity, data distribution)
2. selects plan with lowest total estimated cost as optimized query plan Display query execution plan:
EXPLAIN QUERY;

EXPLAIN ANALYZE QUERY; (8.0+)
Storage engine performs data lookup in caches & buffers
1. if not found, fetch from disk
2. updates to disk

Cache Name	Description
Table Open Cache	Caches file descriptors to avoid repeated table opening overhead
Metadata Cache	Caches schema and column metadata to reduce lookup cost in data dictionary
Query Cache	Caches query strings and result sets; deprecated in 5.7.20, removed in 8.0; use Redis or app-level caching instead
Key Cache	Caches MyISAM index blocks to speed up index reads

Physical

Base Directory -> Executables (default `/usr/bin`)

Client Apps	Use
mysql	CLI client
mysqladmin	CLI for quick server management - status, processlist, kill, flush (reload) tables/logs/privileges, create/drop db, shutdown
mysqlbinlog	Read binary logs
myisamlog	Read MyISAM log
mysqlcheck	-c check -a analyze -o optimize db [table_name]
mysql_config_editor	Client - store encrypted authentication credentials in .mylogin.cnf for secure passwordless login, useful for scripts
mysqldump	Logical backup
mysqlimport	Import CSV/TSV - text format data files directly into tables
mysql_migrate_keyring	Migrate encryption keys between keyring components
mysqlshow	Quick overview of databases, tables, columns, or indexes
mysqlslap	Simulate load from multiple clients and measure performance by the timing of each stage

Server Apps	Use
mysqld	Server
mysqld_pre_systemd	Initializes datadir (create system tables). Systemd integration -> ExecStartPre - before server starts
mysqldumpslow	Summarize slow query logs to prioritize queries for tuning
mysql_secure_installation	Set root password, remove anonymous users, disallow remote root login, remove test databases, reload privilege tables
mysql_tzinfo_to_sql	Load system time zones into `mysql` schema
my_print_defaults	Print options in options groups of option files

Additional Utils	Use
mysqltuner	Script to analyze and suggest MySQL server optimizations
mysqlreport	Generate readable reports from MySQL status and variables
mysqlauditadmin	Manage MySQL audit plugins and logs
mysqlauditgrep	Search/filter MySQL audit logs
mysqldbcompare	Compare the structure and data of two databases
mysqldbcopy	Copy databases or tables between MySQL servers
mysqldbexport	Export database objects (schema/data)
mysqldbimport	Import database objects (schema/data)
mysqldiff	Find differences between database schemas
mysqldiskusage	Estimate MySQL disk space usage
mysqlfailover	Automatic failover for MySQL replication setups
mysqlfrm	Recover or analyze table structure from .frm files
mysqlindexcheck	Check for duplicate or redundant indexes
mysqlmetagrep	Search for metadata patterns in database objects
mysqlprocgrep	Search/filter running MySQL processes
mysqlreplicate	Set up replication between MySQL servers
mysqlrpladmin	Administer and monitor MySQL replication
mysqlrplcheck	Check replication health and configuration
mysqlrplshow	Show replication topology and status
mysqlserverclone	Clone a MySQL server instance
mysqlserverinfo	Display detailed MySQL server information
mysqluc	Unified CLI for multiple MySQL utilities
mysqluserclone	Clone MySQL user accounts and privileges

Schemas and their corresponding files in Data Directory (default `/var/lib/mysql`)

├── User Schemas (Subdirectory)
│   ├── Tables (InnoDB .ibd or MyISAM .FRM .MYI .MYD .PAR)
│   ├── Routines - reusable SQL statements
│   │   ├── Stored Procedures
│   │   └── Stored Functions
|   ├── Triggers - auto-execute procedures in response to events like DML (MyISAM .TRG)
|   └── Views - virtual tables (representing query result)
│
└── System Schemas
    ├── mysql
    │   ├── Objects
    │   │   ├── Data Dictionary Tables (DD)
    │   │   │   └── Internal InnoDB tables containing metadata about all database objects
    │   │   ├── Tables
    │   │   │   ├── user
    │   │   │   ├── db
    │   │   │   ├── tables_priv
    │   │   │   ├── columns_priv
    │   │   │   ├── schemata
    │   │   │   ├── tables
    │   │   │   ├── columns
    │   │   │   ├── indexes
    │   │   │   └── events
    │   │   ├── Routines
    │   │   ├── Triggers
    │   │   └── Views
    │   │
    │   └── Files
    │       └── mysql.ibd (InnoDB tablespace containing DD and all mysql schema tables)
    │
    └── Virtual Schemas
        ├── information_schema
        │   ├── Objects
        │   │   ├── Read-only views exposing metadata from DD and privileges from mysql schema
        |   |   └── Referred by `SHOW` command
        │   └── Files
        │       └── None (virtual)
        │
        ├── performance_schema
        │   ├── Objects
        │   │   └── In-memory tables of type performance_schema engine for runtime monitoring
        │   └── Files
        │       ├── *.sdi (metadata)
        │       └── None (in-memory)
        │
        └── sys
            ├── Objects
            │   ├── Views (actionable performance and info schema summaries for I/O latency, memory usage, schema info, etc)
            │   ├── Stored Procedures (for generating diagnostics reports, configuring perf schema)
            │   └── Stored Functions (querying/formatting perf schema data)
            └── Files
                └── sysconfig.ibd - configuration settings for sys schema

Non-schema files

/var/lib/mysql
├── ibdata1
│   └── Shared InnoDB tablespace (internal InnoDB metadata, doublewrite buffer, undo logs)
│
├── Logs
│   ├── General Query Log - all SQL statements
│   ├── Slow Query Log - queries > specified execution time
│   ├── DDL Log - DDL queries
│   ├── Binary Log - for PITR and replication - events that describe changes to DB 
│   └── Relay Log - for replication - events read from source's binlog
│
├── InnoDB Log Files
│   ├── `#innodb_redo/` - redo logs
│   └── `undo_001` - undo logs
│
├── mysql.sock
│   └── Temporary communication socket -- server start - generates, stop - deletes
│
└── mysql.sock.lock - contains mysqld PID that owns the socket, useful for multiple instances 

/etc/my.cnf
└── MySQL config file

Storage Engines

NDBCluster

Clustered storage engine for high-availability and scalability

Memory

Store data in RAM for fast access
For temp, non-persistent data

CSV

Store and retrieve data from csv files

Blackhole

Discards all data written to it
For testing or logging purposes

Archive

Store and retrieve data from compressed files

MyISAM

Smaller and faster than InnoDB
More suitable for read-heavy applications

InnoDB

ACID Property	InnoDB (Default in 5.5)	MyISAM
Atomicity	Each transaction is treated as a single unit, either fully completing (commit) or rollback using undo tablespaces/logs if any part of the transaction fails	X
Consistency	Supports foreign keys, referential integrity, and other constraints to maintain data consistency	X
Isolation	Supports row-level locking, transaction isolation levels, and prevents interference between transactions	Table-level locking leads to lack of isolation
Durability	Uses redo log (write-ahead logging), doublewrite buffer, and crash recovery for durability	No recovery in case of crashes

InnoDB Architecture

In-Memory Data

Located completely in RAM

Buffer Pool

Default 128M, up to 80% server
Stores modified pages that haven't been written to disk (dirty pages) - table and index data
Least Recently Used (LRU) algorithm
- New (Young) Sublist (5/8)
  - Head
    - Most accessed pages
  - Tail
- Old Sublist (3/8)
  - Head
    - New pages
    - Less accessed pages
  - Tail
  - Flushed to data files
Change Buffer (25%, up to 50%)
- Caches changes to secondary index pages not currently in buffer pool
- Merged later when index pages are loaded by buffer pool
Adaptive Hash Index
- Constructed dynamically by InnoDB
- Stores frequently used indexes
- Speeds up data retrieval from buffer pool
  - B-Tree index lookups -> faster hash-based search

Log Buffer

Maintains record of dirty pages in buffer pool
Transaction commit/log buffer reaches threshold/regular interval
- Flush to redo log files

On-Disk Data

Redo Logs

Write-ahead logging
- Persistent log of changes, before applied to on-disk pages
Changes can be reapplied to data pages if system crashes before/during writing
Durability - committed transactions are not lost
Temporary redo logs (#ib_redoXXX_tmp) - internal, pre-created spare files to handle log resizing and rotation

Tablespaces

InnoDB (System) Tablespace (ibdata1)
- Change buffer
  - Persists secondary index buffered changes across restarts (durability)
- Doublewrite Buffer
  - Protects against partial page writes due to crash while writing pages to tables
- Undo logs
  - In case instance isn't started with undo tablespace
- Data Dictionary (8.0 -> mysql.ibd)
- Table and Index Data (5.7 -> For tables without innodb_file_per_table option)
General Tablespace .ibd
- Can host multiple tables
File-Per-Table Tablespace .ibd
- Each table has its own .ibd file
Temporary Tablespace
- Session (#innodb_temp dir)
  - User-created
```
CREATE TEMPORARY TABLE table_name ();
```
  - Internal temp tables - auto created by optimizer for operations like sorting, grouping
    - Optimizer may materialize Common Table Expressions' (CTE - modular query, introduced in 8.0) result sets into temp table if they are frequently referenced in a query
- Global (ibtmp1)
  - Stores rollback segments for changes to user-created temp tables
  - Redo logs not needed since not persistent
  - Auto-extending
  - Removed on normal shutdown, recreated on server startup
Undo Tablespaces
- Store undo logs
  - Records original data before changes
  - Enable rollback in case transaction not reflected on receiver's end

Glossary

Term	Definition
Page	Smallest writable unit of data. Default 16KB.
Clustered Key	PRIMARY KEY, UNIQUE NOT NULL key/column, or a hidden system-generated row ID
Clustered Index	A data structure (usually B-tree) that physically stores the full row data ordered by the clustered key; there is exactly one clustered index per InnoDB table.
Secondary Key	Columns other than the clustered key used to speed up queries on those columns. e.g. `SELECT * WHERE NAME='A';`
Secondary Index	A B-tree index built on secondary key columns that stores those columns plus the clustered key to locate full rows in the clustered index; multiple secondary indexes can exist per table. Explicitly created by user to increase query performance.

Installation

Import gpg keys to RPM Database (/var/lib/rpm)

rpm --import https://repo.mysql.com/RPM-GPG-KEY-mysql
rpm --import https://repo.mysql.com/RPM-GPG-KEY-mysql-2022
rpm --import https://repo.mysql.com/RPM-GPG-KEY-mysql-2023

1. Using Yum Repository

Auto-installs missing dependencies

Install repository

sudo yum install https://repo.mysql.com/mysql84-community-release-el8-1.noarch.rpm -y

Install server and other required packages

sudo yum install mysql-community-server -y

2. Using packages

Automatically installs binaries or libraries in respective directories
Includes additional files for compatibility compared to generic archive e.g. Systemd service file configured to initialize server on first start

Download required packages, older versions from archives
Check .rpm package integrity

rpm -K pkg.rpm
Use package manager to install packages from dir

Dependency Tree

mysql-community-server (server apps, my.cnf, datadir w ownership, mysqld.service)

└── mysql-community-client (client apps)

    └── mysql-community-libs (shared libraries for MySQL client applications)
        ├── mysql-community-client-plugins`
        └── mysql-community-common (common files for client and server binaries)

Optional:
- icu-data-files - Unicode support
- test - test apps for server
- debuginfo - debugging symbols - function/variable names, filepaths that is usually stripped from compiled binaries (to secure internal working) for debugging crashes with detailed stack traces
- devel - development header files and libraries
- libs-compat - older versions of client libraries for legacy applications that require specific version or binary interface

3. Generic Linux - Tarball

Contains all - prebuilt binaries for specific glibc dependency, libs, docs,

support-files
- SysVinit service files for backward compatibility

Download and extract to required directories
Download and move the below config and service files to their respective directories
Edit according to preference

my.cnf

Move to /etc/my.cnf

wget https://linked-blog-blush.vercel.app/md_assets/my.cnf -P /etc/

Systemd Service File

Systemd Service File for Multiple Instances

Used to start daemon(s) on boot

^^Move to /etc/systemd/system/mysqld.service, preferred over /usr/lib/systemd/system/mysqld.service to prevent overwriting during updates

wget https://linked-blog-blush.vercel.app/md_assets/mysqld.service https://linked-blog-blush.vercel.app/md_assets/mysqld@.service -P /etc/systemd/system/

sudo systemctl daemon-reload

Troubleshoot

systemctl status
journalctl -xe
mysqld --verbose --help - lists referenced variables

Reset start limit

sudo systemctl reset-failed mysqld

Security Management

firewalld

Identifies incoming traffic from data frame Network/IP & Transport/TCP layer headers

Use rich rules to block service names based on source ips, destination ports

firewall-cmd --list-all    # services, ports
firewall-cmd --permanent --add-service=mysql
firewall-cmd --permanent --add-port=port_no/protocol
firewall-cmd --reload

selinux

semanage [-h]

show ports enabled for specific service

semanage port -l | grep mysql
add/delete port for specific service

semanage port [-a][-d] -t mysqld_port_t -p tcp 3307

set file context for custom datadir

semanage fcontext -a -t mysqld_db_t "/datadir(/.*)?"
restorecon -Rv /datadir

Start background process

Exits if TTY (parent) closes

processname &

Daemonize process

Changes parent to init (PID=1) if parent dies

Partially detach process from TTY:

nohup (No Hang Up)
Sets process to ignore SIGHUP (hangup signal) TTY sends to its children when it closes
Closes stidn, redirects stdout and stderr to nohup.out

nohup mysqld --defaults-group-suffix=1 &

Completely make process independent from TTY:

setsid (set session id)
creates a new session and process group and makes process its leader, fully independent from TTY, no accidental read or write to closed terminal

setsid mysqld --defaults-group-suffix=1 & or setsid bash -c 'mysqld --defaults-group-suffix=1' & # bash run command

Multiple Instances

Parameter	Multiple Instances	Multiple Databases
Data Integrity	Data is physically separate; relationships between data in different instances cannot be enforced	logical controls like access controls, data classification, rest & transit encryption, regular monitoring & audits to comply with data protection laws
High Availability	Stock markets use instance-based failover (clusters) to prevent downtime during peak hours	X
Security	Diff memory, configs, users. Banks create separate instances for savings, credit cards, loans and each region for isolating technical problems or security breaches, meeting strict risk and regulatory requirements. Government databases separate classified data by instance for strict access control	Smaller orgs like educational institutes may centralize restricted data for easier management of their platforms due to less risk and compliance needs
Backup, Maintenance & Recovery	Enterprises like SaaS providers (prioritize tenant isolation and + high availability) have to backup, monitor, perform routine mainenance, update (patch) and recover for each instance separately, or automate with a script. Avoids downtime for unaffected customers	Easier to manage. Many large social media platforms update the entire database at once—potentially disrupting all users for a short time
Cost Efficiency	A multinational retailer invests in separate instances for high-traffic countries	Small startups opt for multiple databases within one instance to cut costs
Performance	Each instance has its own dedicated resources (CPU, memory, storage). In a financial institution, a spike in mortgage processing won’t slow down credit card transactions, as each runs on its own instance	X
Scalability	A SaaS provider gives large customers their own dedicated instances, allowing them to scale up or move independently, even to different servers or data centers without affecting others	Easy to add more databases, but all share the same instance limits

Administration

Display

Object Type	Query Example
Tables	`SHOW TABLES FROM your_database;`
Views	`SHOW FULL TABLES IN your_database WHERE Table_type = 'VIEW';`
Procedures	`SHOW PROCEDURE STATUS WHERE Db = 'your_database';`
Functions	`SHOW FUNCTION STATUS WHERE Db = 'your_database';`
Triggers	`SHOW TRIGGERS FROM your_database;`
Events	`SHOW EVENTS FROM your_database;`

Status

status or \s

mysqladmin [auth] status

Displays:

Connection/thread id, (server) uptime, connection (socket), threads, open tables, slow queries, query per sec avg

Show active threads (connections) and their processes

mysqladmin [ auth ] processlist -i 2 # interval = 2s show processlist; SELECT * FROM sys.processlist;

Each row displays:

Active client connection - connection id, username, hostname, db in use, time (duration of state), state (sorting result, waiting for table metadata lock)

kill process

mysqladmin kill pid

Identify high load users/problematic queries

SELECT * FROM sys.user_summary;

Change prompt

PROMPT (\u@\h) [\d]\

Query Table Data without USE Db

SELECT table_name
FROM information_schema.tables
WHERE table_schema = 'db_name' AND table_type = '';

Log client statements and output

TEE filename # appends

Stop

NOTEEE

Performance Monitoring / Tuning

high I/O latency files

sys.user_summary_by_file_io

full table scan tables

sys.schema_tables_with_full_table_scans

Database Maintenance

Monitor buffer pool usage:

sys.innodb_buffer_stats_by_table

System Variables

Show variables

SHOW [GLOBAL/SESSION/ ] VARIABLES [LIKE '%var%'];

Set for session

SET [GLOBAL/LOCAL] variable_name='value';

Set persist - stored in data_dir/mysqld-auto.cnf

SET PERSIST variable_name = value;

User Management

Authentication

mysql_config_editor print --all

Set:

mysql_config_editor set --login-path=client --host=localhost --user=root --password

Remove:

mysql_config_editor remove --login-path=client

mysql --login-path=client

Privileges

SHOW GRANTS [FOR CURRENT_USER/'username'@'hostname']

SELECT user, host, select_priv, insert_priv, update_priv, delete_priv FROM mysql.user;

SELECT * FROM information_schema.table_privileges;

GRANT SELECT (column1, column2), INSERT, UPDATE, DELETE, CREATE, DROP ON db_name.table_name TO 'user'[@'hostname'];

GRANT ALL ON db_name.* to 'user'[@'hostname'] WITH GRANT OPTION;

Create/drop user

CREATE USER 'user'[@'hostname'] IDENTIFIED BY 'P@55w0rd';

DROP USER 'user1'[@'hostname'], 'user2'[@'hostname'];

Reset root password

systemctl stop mysqld

Run mysql as mysql user, not root (my.cnf/param)

mysqld --skip-grant-tables --skip-networking &

FLUSH PRIVILEGES;`    # Loads the grant tables
ALTER USER 'root'@'localhost' IDENTIFIED BY 'P@55w0rd';

EXIT

pkill mysql

Table Management

Stored Procedure

Create dummy test data

DELIMITER $$

CREATE PROCEDURE test.create_dummy_data()
BEGIN
  DECLARE n INT;
  -- Find next table number
  SELECT IFNULL(MAX(CAST(SUBSTRING(table_name,2) AS UNSIGNED)),0)+1 INTO n
    FROM information_schema.tables
   WHERE table_schema='test' AND table_name REGEXP '^t[0-9]+$';

  SET @tbl := CONCAT('test.t', n);
  -- Create new table
  SET @sql := CONCAT('CREATE TABLE ', @tbl, ' (id INT AUTO_INCREMENT PRIMARY KEY, hash CHAR(64))');
  PREPARE s FROM @sql; EXECUTE s; DEALLOCATE PREPARE s;

  -- Insert 1 million hashes using recursive CTE
  SET SESSION cte_max_recursion_depth = 1000000;
  SET @sql := CONCAT(
    'INSERT INTO ', @tbl, ' (hash) ',
    'WITH RECURSIVE cte(n) AS (SELECT 1 UNION ALL SELECT n+1 FROM cte WHERE n<1000000) ',
    'SELECT SHA2(n,256) FROM cte'
  );
  PREPARE s FROM @sql; EXECUTE s; DEALLOCATE PREPARE s;
END$$

DELIMITER ;

Change auto-increment value

ALTER TABLE table_name AUTO_INCREMENT = value; # if greater than max - next insertion starts w value, else no effect

Create general tablespace and add tables

CREATE TABLESPACE ts
    ADD DATAFILE 'ts.ibd'    # base dir is datadir by default
    ENGINE=InnoDB;

CREATE TABLE t1 (
    id INT PRIMARY KEY,
    name VARCHAR(50)
) ENGINE=InnoDB
  TABLESPACE ts;

Switch data dir after install

mkdir /newpath
chown -R mysql:mysql /newpath
chmod -R 750 /newpath

systemctl stop mysqld    # copying mid insertion causes parital or mismatched data & index or corruption if copied mid-modification
cp -r /var/lib/mysql /newpath

vi /etc/my.cnf    # datadir=/newpath
systemctl restart mysqld

Change storage engine

Backup before conversion

ALTER TABLE table_name ENGINE = InnoDB;

Backup and Restore

Term	Meaning
Cold	DB offline (locked)
Warm	Table locks (partial access)
Hot	No locks (full access)

-	-
Restore	Return to original state
Recover	Salvage missing data using specialized tools, partial/full

Logical

Produce a set of SQL statements (.sql, csv, other text) to restore the original database object definitions and data

mysqldump

--single-transaction - starts a transaction (a logical unit of work)

SET SESSION TRANSACTION ISOLATION LEVEL REPEATABLE READ;    # default level for InnoDB transactions
START TRANSACTION;

REPEATABLE READ: ensures the transaction sees a stable snapshot of committed data at its start, preventing dirty and non-repeatable reads.
At transaction start, InnoDB creates a Read View - maintains list of trx_ids that represent transactions whose changes should not be visible to the current transaction for consistent reads.
MVCC stores multiple row versions; undo logs keep old versions to reconstruct the snapshot dynamically without copying data pages.
Snapshot creation is logical and instant; readers (dump) and writers (other transactions) do not block each other.
Only transactional (InnoDB) tables get consistent snapshots; avoid DDL during dump to prevent inconsistencies.

mysqldump flag	Description
`-A`, `--all-databases`	Dump all databases
`-B`, `--databases`	Dump specified databases, add create DB statement
`-R`	Include stored routines (procedures & functions)
`-E`	Include events (scheduled tasks)
`--triggers`	Include triggers
`--single-transaction`	Dump tables in a single transaction, disables 'LOCK TABLES', allows changes during dump
`--lock-all-tables`	Lock all tables across all databases before dumping
`--set-gtid-purged=off`	Exclude GTIDs in backup, creates new transaction IDs upon restore
`--ignore-table='db1.tb1,'`	Ignore specified table(s)
`--add-drop-database`	Add 'DROP DATABASE IF EXISTS' before each database, useful for replication
`--no-create-db`	Do not include 'CREATE DATABASE IF NOT EXISTS' statement
`--no-create-info`	Do not include table creation info (no 'CREATE TABLE' statements)
`--no-data`	Do not include row information (only schema)
`--source-data <1/2>`	Include `1` - `CHANGE REPLICATION SOURCE TO SOURCE_LOG_FILE, SOURCE_LOG_POS` statement or `2` - comment (for information) used for replication. Replaces `--master-data`
`--compact`	Produce less verbose output by removing comments
`	pv -trb >`
`	gzip >`
`> $(date +"%F_%T").sql[.gz]`	Redirect output to a timestamped file (optionally compressed if .gz is used)

mysqlpump

mysqlpump [auth] -B db1 db2 > pump.sql

# number of threads = physical cores should give most perf; if >, context switching will consume CPU time 
--parallel-schemas=4:db1,db2    # number of threads for a new queue for specific DBs

--default-parallelism=4    # number of threads for the default queue that processes all DBs not in separate queues, default 2

Prerequisites to Restoring on Production

Import on test server and check table integrity

mysql [auth] --database=test < data.sql
mysqlcheck [auth] [--databases] test

Run queries from stored procedure to verify data integrity

Verify backup (Enterprise)

mysqlbackup --backup-image=/path/to/backup.mbi validate

Point in Time Recovery (PITR - Incremental) using binlog

Binary Log Type	Description
Statement	Logs SQL statements that modify data. Efficient, but can be unreliable for non-deterministic operations like NOW().
Row (Default)	Logs actual row changes. Most reliable for replication, but uses more storage.
Mixed	Uses statement-based by default, switches to row-based for unsafe (non-deterministic or complex) statements. Efficient + safe.

SHOW BINARY LOGS;

Reset binary logs and gtids, delete logs before a specific date / upto a log file

RESET BINARY LOGS AND GTIDS;

PURGE BINARY LOGS BEFORE '2025-05-01 00:00:00';    # or NOW()

PURGE BINARY LOGS TO 'mysql-bin.000123';

View binary logs with -v
1. /*! ... */ is a MySQL versioned comment that specifies minimum MySQL version to execute the code inside it, ignored by other DBMS
Convert binary logs to SQL statements and pipe them into the server

`mysqlbinlog [--start-datetime=, --stop-datetime="2025-05-21 18:00:00" / --start-position=, --stop-position= ] binlog.000001 binlog.000002 | mysql [authentication]`    # only replay changes for specific time/position

--read-from-remote-server    # if binlog encrypted
--include-gtids=server_uuid:tr_id --exclude-gtids=    # used for replication only, gtid_executed gtids aren't replayed

# modified output w flags may not be able to be used to replay changes
-v, --verbose [--base64-output=decode-rows]    # Comment reconstructed pseudo-SQL statements out of row events
--base64-output=    # whether to display base64-encoded binlog statements: never, decode-rows disables row-based events, default auto

mydumper / myloader

mydumper -u user -p pa55 [-t, --threads 4] [-d, --database dbname] -o [--outputdir] /backups/dbname
myloader -u user -p pa55 [-t] -d [--directory] /backups/dbname

MySQL Shell

\js

Action/Utility	Syntax Example
Instance Backup	`util.dumpInstance('/path/to/backup-directory')`
Schema(s) Backup	`util.dumpSchemas(['database_name'], '/path/to/backup-directory')`
Table(s) Backup	`util.dumpTables('database_name', ['table1', 'table2'], '/path/to/backup-directory')`
With Options	`util.dumpInstance('/backup/dir', {threads: 4, ocimds: true, consistent: true})`
Import Dump	`util.loadDump('/path/to/backup-directory')`

Option	Description
`ocimds`	Checks and prepares the dump for compatibility with MySQL HeatWave Service (Oracle Cloud). Use `true` if targeting HeatWave.
`threads`	Sets the number of parallel worker threads for the backup. Cannot assign threads to specific queues; only total count is set.
`consistent`	Ensures a consistent snapshot by locking tables during the dump (default: `true`). Disabling may cause inconsistencies.

Physical

Percona XtraBackup (xtrabackup)

Backup

xtrabackup [auth] [--host=] --backup [--tables=<db.tb1>] [--databases<-exclude>=] --target-dir=</inc \| /full> --incremental-basedir=<prev-backup> [--encrypt] [--compress] [--no-timestamp] [--parallel=] [--throttle=]

mysqlbackup [auth] [--host=] --backup-dir= --incremental --incremental-base=<dir:/prev or history:/full> [--<include/exclude>-tables=db.tb1,] [--include-purge-gtids=off] [--no-locking] [--skip-binlog] [--encrypt] [--compress] [--with-timestamp] [--<process/read/write>-threads=] backup

Restore

xtrabackup --<prepare/apply-log> --target-dir=/full --incremental-dir=/inc [--apply-log-only] [--parallel=] [-use-memory=]
xtrabackup [auth] --copy-back --target-dir= --incremental-dir= --data-dir=<new_datadir>

mysqlbackup --backup-dir=<backup_dir> [--uncompress] [--decrypt] prepare
mysqlbackup [auth] [--host=] --backup-dir=<backup_dir> [--uncompress] [--decrypt] copy-back

Tables - Warm Backup

Source

flush tables db.table_name for export;    # locks table for export - copying mid insertion causes parital or mismatched data & index or corruption if copied mid-modification
cp table_name.ibd table_name.cfg destination/
unlock tables;

Destination

create table db.table_name(exact table_definition);
alter table db.table_name discard tablespace;
alter table db.table_name import tablespace;

Compress and Delete Old Text Logs

set login-path=client

/etc/logrotate.d/mysql

/var/log/mysql/mysql_error.log /var/log/mysql/slow_query.log {
compress
create 660 mysql mysql
size 1G
dateext
missingok
notifempty
sharedscripts
postrotate
    /usr/bin/mysql -e 'FLUSH SLOW LOGS; FLUSH ERROR LOGS;'
endscript
rotate 30
}

Upgrade

Take logical backup for failsafe (can be used for downgrade too), physical backup for faster restoration

Supported Paths	Method
< 5.7 -> 5.7	Update binary, run `mysql_upgrade`
5.7 -> 8.0.15	Update binary, run `mysql_upgrade`
5.7 -> 8.0.16+	Update binary, start server (auto-upgrade)
8.0.x -> 8.4	Update binary, start server (auto-upgrade)
8.4 -> 9.x	Update binary, start server (auto-upgrade)

8.0.16 + mysql_upgrade (data dir) deprecated, functions embedded into server

mysqld --upgrade= (8.0.16+)	Upgrades
AUTO (default)	Data dictionary (DD), system schemas (mysql (incl help tables), Performance Schema, INFORMATION_SCHEMA, sys), user schemas, if not upgraded.
MINIMAL	Core metadata: DD, Performance Schema, INFORMATION_SCHEMA. Skip mysql (incl help tables), sys schemas, user schemas. Useful for faster startup and upgrading user schemas later
FORCE	All: DD, system schmas (mysql (incl help tables), Performance Schema, INFORMATION_SCHEMA, sys), user schemas, even if prev upgraded. Useful for checking and forcing repairs.
NONE	Skip server auto upgrade; server will not start if DD upgrade is required. Used for manual handling

Replication

Supports ABBA, ABCA

Replication Architecture

1. Source

a. Add to my.cnf:

server_id=1

b. Create a user only for replication

CREATE USER replica@'hostname' IDENTIFIED BY 'Redhat@1';
GRANT REPLICATION SLAVE ON *.* TO replica@'hostname';

Version	Command
< 8	`RESET MASTER;`
> 8	`RESET BINARY LOGS AND GTIDS;`

2. Replica

a. Add to my.cnf:

server_id=2
relay_log=/var/lib/mysql/relaylog.log

CHANGE REPLICATION FILTER

# < 8 statements for multi-source replica channels
# SET GLOBAL master_info_repository = 'TABLE';
# SET GLOBAL relay_log_info_repository = 'TABLE';

# < 8 - replace %SOURCE with MASTER and REPLICA with SLAVE
RESET BINARY LOGS AND GTIDS;    # < 8 - RESET MASTER
RESET REPLICA;

CHANGE REPLICATION SOURCE TO
  SOURCE_HOST = '192.168.8.2',
  SOURCE_USER = 'replica',
  SOURCE_PASSWORD = 'Redhat@1',
  #SOURCE_AUTO_POSITION = 1,    # If GTIDs enabled
  SOURCE_LOG_FILE = 'binlog.000001',
  SOURCE_LOG_POS = 157,
  GET_SOURCE_PUBLIC_KEY = 1;
#FOR CHANNEL 'channel_name';

START REPLICA;
SHOW REPLICA STATUS\G

Skip problematic statements

# < 8 - replace replica with slave
STOP REPLICA;
SET GLOBAL sql_replica_skip_counter = N
START REPLICA;

Automated crash recovery

SQL thread checks last committed position from mysql.slave_relay_log.info or GTID from mysql.gtid_replica_pos for transactions committed in replica's redo logs
Applies remaining relay log statements, purges relay log files
I/O thread requests new binlog events from the source starting from the last committed transaction recorded by SQL thread

SET GLOBAL relay_log_recovery=ON;

Semi-sync replication

Source waits for confirmation from replica if transaction committed to its relay log prior to committing in its own redo log

Add to my.cnf:

Source

plugin_load_add='semisync_source.so'
SET GLOBAL rpl_semi_sync_source_enabled=1

Replica

plugin_load_add='semisync_replica.so'
SET GLOBAL rpl_semi_sync_replica_enabled=1

Restart server

Enable at runtime

Source

INSTALL PLUGIN rpl_semi_sync_source SONAME 'semisync_source.so';
SET GLOBAL rpl_semi_sync_source_enabled=1;

Replica

INSTALL PLUGIN rpl_semi_sync_replica SONAME 'semisync_replica.so';
SET GLOBAL rpl_semi_sync_replica_enabled=1;

# Verify if enabled
SHOW PLUGINS;
SHOW VARIABLES LIKE 'rpl_semi_sync_source_enabled';

# Restart I/O thread to load plugin settings
STOP REPLICA IO_THREAD;
START REPLICA IO_THREAD;

Group Replication

Uses synchronous consensus protocol before a primary can commit a transaction to its own redo log.

Primary Member

1. Set hostnames for IPs (for correct resolution)

/etc/hosts

192.168.8.69 mysql1
192.168.8.96 mysql2

2. a. Add variables to `my.cnf`:

plugin_load_add='group_replication.so'
plugin_load_add='mysql_clone.so'

group_replication_communication_stack=MYSQL    # > 8.0.27
group_replication_group_name="744bce81-a89c-4526-8841-ec030bd1a8f7"    # uuidgen or select UUID();
group_replication_start_on_boot=off
# Change ports to 33061 for default (XCOM) stack
group_replication_local_address="mysql1:3306"    # Current host's IP
group_replication_group_seeds="mysql1:3306,mysql2:3306"    
group_replication_bootstrap_group=off

# for multi-primary
# group_replication_single_primary_mode=OFF
# group_replication_enforce_update_everywhere_checks=ON

server_id=1
gtid_mode=ON
enforce_gtid_consistency=ON

systemctl restart mysqld

b. Ensure gtid_mode=ON

SET PERSIST gtid_mode=ON_PERMISSIVE;
SET PERSIST gtid_mode=ON;

3. Temporarily change clone threshold

SET GLOBAL group_replication_clone_threshold=1;

4. Create rpl_user and grant privileges

# SET SQL_LOG_BIN=0;
CREATE USER rpl_user@'%' IDENTIFIED BY 'Redhat@1';
GRANT REPLICATION SLAVE ON *.* TO rpl_user@'%';
GRANT CONNECTION_ADMIN ON *.* TO rpl_user@'%';
GRANT BACKUP_ADMIN ON *.* TO rpl_user@'%';
GRANT GROUP_REPLICATION_STREAM ON *.* TO rpl_user@'%';
FLUSH PRIVILEGES;
# SET SQL_LOG_BIN=1;

5. Start

SET GLOBAL group_replication_bootstrap_group=ON;
START GROUP_REPLICATION USER='rpl_user', PASSWORD='Redhat@1';    # Include credentials to copy user during distributed recovery
SET GLOBAL group_replication_bootstrap_group=OFF;    # if ON, ^ spawns new groups with current host as primary member

Joining Member

1. a. Precheck:

SHOW BINLOG EVENTS

b. if GTIDs conflict

RESET BINARY LOGS AND GTIDS    # RESET MASTER

2. Follow previous steps 1-3

3. Start

START GROUP_REPLICATION USER='rpl_user', PASSWORD='Redhat@1';

Troubleshooting

View status

SELECT * FROM performance_schema.replication_group_members;

SHOW REPLICA STATUS FOR CHANNEL 'group_replication_recovery';

Create rpl_user
Connect each member to corresponding ssh server to exchange keys, else

Copy ssl key to joining member

On donor

scp /var/lib/mysql/public_key.pem mysql2:/etc/mysql

On joining member

SET PERSIST group_replication_recovery_public_key_path='/etc/mysql/public_key.pem'

Switch replication modes on an active setup

SELECT group_replication_switch_to_multi_primary_mode();

SELECT group_replication_switch_to_single_primary_mode();

Monitor switch progress

SELECT event_name, work_completed, work_estimated 
FROM performance_schema.events_stages_current 
WHERE event_name LIKE "%stage/group_rpl%";