PostgreSQL Backup Strategies

This file contains some strategies for backing up and restoring a PostgreSQL database. Many of the items were sourced from Magnus Hagander's PGCon 2013 presentation (PostgreSQL Backup Strategies) 1.

Contents:

Dump Specified Database
Dump Global Database Objects
Restore Database Database
Sources

Postgres database backups involve two main activities, dumping the data and restoring the data.

For databases smaller than 1TB, it is worthwhile to use pg_dump to dump all database values into an archival object. The size limitation is based on the time required to not only dump the data, but also to restore the data. Restoring the data is a much more time sensitive activity, because you are often rebuilding the database from scratch after a database issue, thus you need to be up and running as quickly as possible.

Alternatively, you can also use live replication of the database, creating a fail-over system. Thus, if the main database system fails, it could seamlessly fail onto the replicated system, preventing any downtime.

Dump Specified Database

The easiest way to backup a database is to dump the values into an object and store it on another machine. Using pg_dump, you can specify the object type, if it should be compressed, and whether to parallellize the dump.

Custom format dump with good compression

When using the custom format flag -Fc with pd_dump, the process will compresses the data while still being human readable. Additionally, when using the -Z<n> flag, you can specify the compression level between 0 (off; -Z0) and 9 (max; -Z9), with 6 being the default of -Fc.

Choosing where to run the dump

When running pg_dump, you can determine what machine is performing the compression of the data, along with the impact of IO. This is done by either running pg_dump on the database server or on a remote backup server.

Local Database Server

Run the pg_dump on the database server directly.

Impacts the database server's CPU, while having minimal IO impact.

Best to store dump on either a separate disk or on another machine, just not on the same disk (causes random read/write of the data).

Default level 6 compression:

pg_dump -Fc -U postgres mydb > mydb.dump

Maximum compression level 9:

pg_dump -Z9 -Fc -U postgres mydb > mydb.dump

Backing up the pySecMaster with default settings:

"C:\Program Files\PostgreSQL\9.5\bin\pg_dump" -Fc -U postgres pysecmaster > C:\Users\joshs\Desktop\pysecmaster_%date:~-4,4%%date:~-7,2%%date:~-10,2%t%time:~0,2%%time:~3,2%%time:~6,2%.dump

Remote Backup Server

SSH into the database server to run pg_dump.

Allows the remote server's CPU to compress the dump, using SSH to securely transfer the uncompressed data (you must specify for SSH to not compress the data). Does not impact the database server's CPU, but does impact it's IO.

Maximum compression level 9:

ssh -o "Compression=no" db.domain.com "pg_dump -Z9 -Fc -U postgres mydb" > mydb.dump

Dump Global Database Objects

Along with dumping the actual database values, it is important to also dump the database global objects, including the roles and tablespaces. This is done via the pg_dumpall -g command (the -g variable dumps global objects only).

pg_dumpall

Backing up the pySecMaster global objects:

"C:\Program Files\PostgreSQL\9.5\bin\pg_dumpall" -g -U postgres > C:\Users\joshs\Desktop\postgres_misc_%date:~-4,4%%date:~-7,2%%date:~-10,2%t%time:~0,2%%time:~3,2%%time:~6,2%.dump

Restore the Database

Backing up databases is not only about dumping database data. The database dump must be able to be quickly and accurately restored. It is essential to test the database dump to ensure that in a crisis, you can get your database up and running as quickly as possible.

To restore the database from scratch, first load the global objects (users, groups, tablespaces, etc.) into the new server, create the database object, and then restore the actual database values.

Restore Global Objects

Change the user to postgres, open the psql command interface and load the stored global objects.

su postgres
psql -U postgres -h localhost -p 5432 < <global object dump>

Create the Database Object

Simply create the an empty database with the same name as the database being restored

su postgres
psql CREATE DATABASE database_name OWNER database_owner_name

Restore Database Values

In Postgres, pg_restore allows for loading the previously dumped data into a fresh database structure. However, pg_restore starts to fail with databases over 1TB because it can take far too long to restore the data dump. If this is the case, then it is pertinent to use a replicated database system to prevent having to restore massive databases from scratch.

Use pg_restore to rebuild the database with the values stored in the dump file

su postgres
pg_restore -d database_name -1 mydb.dump

Important Flags

-1 flag does a full restore as a single transaction. With this flag, the database will only be restored if the entire restore was successful. Meaning you won't be left with a corrupt or missing database structure.

-j <n> flag parallels the restore, however, it is not compatible with -1 flag. This is a very important flag for large databases where they need to be restored as quickly as possible. The number of parallel processes is limited by the IO.

Sources

Magnus Hagander's PGCon 2013 Presentation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PostgreSQL Backup Strategies

Dump Specified Database

Custom format dump with good compression

Choosing where to run the dump

Local Database Server

Remote Backup Server

Dump Global Database Objects

Restore the Database

Restore Global Objects

Create the Database Object

Restore Database Values

Important Flags

Sources

FilesExpand file tree

PostgreSQL_Backup_Strategies.md

Latest commit

History

PostgreSQL_Backup_Strategies.md

File metadata and controls

PostgreSQL Backup Strategies

Dump Specified Database

Custom format dump with good compression

Choosing where to run the dump

Local Database Server

Remote Backup Server

Dump Global Database Objects

Restore the Database

Restore Global Objects

Create the Database Object

Restore Database Values

Important Flags

Sources