PostgreSQL configuration

There are different ways to set configuration parameters for PostgreSQL.

This is discussed in the Setting Parameters documentation .

The configurations below can be directly pasted into a PostgreSQL configuration file.

Connections and prepared transactions

    

        
        max_connections = 1000             # Default 100
max_prepared_transactions = 1000   # Default 0

        
    

max_connections doesn’t impact performance of queries directly, but if a high client connection count is expected or required, it’s a good idea to raise this in order to not reject request from clients. If connections start getting reject, this will affect overall performance.

Keep in mind that more client connections means more resources will be consumed (especially memory). Setting this to something higher is completely dependent on use case and requirements.

max_prepared_transactions is 0 by default.

This means that stored procedures and functions cannot be used out of the box. It must be enabled by setting max_prepared_transactions to a value greater than 0. If this is set to a number larger than 0, a good number to start with would be at least as large as max_connections. In a test or development environment, it doesn’t hurt to set it to an even larger value(10000) to avoid errors.

Using procedures and functions can greatly improve performance.

    

        
        huge_pages = on    # default is try
shared_buffers = <25%-40% system memory>    # Default is 128MB
work_mem = 32MB    # default is 4MB
maintenance_work_mem = 2GB    # Default is 64MB

        
    

Turning on huge_pages is not required because the default is try.

However, you can explicitly set it to on because errors will be produced if huge pages are not enabled in Linux.

shared_buffers is one of the most important configuration parameters that can be set. It determines how much memory can be used to store indexes and table data. It’s a cache that improves read/write latency and relieves pressure on storage. The PostgreSQL documentation suggests this be set to 25% - 40% of total system memory.

work_mem is memory used when queries are being processed. Raising this significantly from the default value can help performance.

maintenance_work_mem is memory used for operations like VACUUM. In a scenario where data is removed often, raising this can help performance.

Processing and process count

    

        
        deadlock_timeout = 10s                      # Default is 1s
max_worker_processes = <num_system_cpus>    # Default is 8

        
    

deadlock_timeout sets a polling interval for checking locks. The documentation states that this check is expensive from a CPU cycles standpoint, and that the default of 1s is probably the smallest that should be used. Consider raising this timeout much higher to save some CPU cycles.

max_worker_processes is a key parameter for performance. It’s the number of total background processes allowed. A good starting point is to set this to the number of cores present on the PostgreSQL node.

Write Ahead Log (WAL) configuration

    

        
        synchronous_commit = off    # Default is on
max_wal_size = 20GB         # Default is 1GB
min_wal_size = 1GB          # Default is 80MB
wal_recycle = off           # Default is on

        
    

If synchronous_commit is on (default), it tells the WAL processor to wait until more of the log is applied before reporting success to clients. Turning this off means that the PostgreSQL instance will report success to clients sooner. This will result in a performance improvement. It is safe to turn this off in most cases, but keep in mind that it will increase the risk of losing transactions if there is a crash. However, it will not increase the risk of data corruption.

In high load scenarios, check pointing can happen very often. In fact, in testing with HammerDB, there may be so much check pointing that PostgreSQL reports warnings. One way to reduce how often check pointing occurs is to increase the max_wal_size of the WAL log. Setting it to 20GB can make the excessive check pointing warnings go away. min_wal_size can also be increased to help absorb spikes in WAL log usage under high load.

wal_recycle does not impact performance. However, in scenarios where a large amount of data is being loaded (for example, restoring a database), turning this off will speed up the data load and reduce the chances of replication errors to occur if streaming replication is used.

Planner/Optimizer configuration

The optimizer (also called planner) is responsible for taking statistics about the execution of previous queries, and using that information to figure out what is the fastest way to process new queries. Some of these statistics include shared buffer hit/miss rate, execution time of sequential scans, and execution time of index scans. Below are some parameters that affect the optimizer.

    

        
        effective_cache_size = <80% of system memory>    # Default is 4GB
random_page_cost = 1.1    # Default is 4.0

        
    

One key piece of information that a PostgreSQL instance will not have access to is the size of the OS page cache. effective_cache_size provides a way to inform PostgreSQL of the page cache size. Assuming the host is dedicated to running PostgreSQL, a good starting value is to set this to about 80% of total system memory. The value of this parameter should roughly be the shared buffer size and the OS page cache size combined. Use a tool like free while PostgreSQL is running to understand how much memory is being used for the OS page cache. This can help further refine the value from the suggested 80%. Also note that this parameter does not affect memory allocations.

How does effective_cache_size affect the optimizer and help performance?

When data is loaded into the PostgreSQL shared buffer, the same data may also be present in the page cache. It is also possible that data that isn’t in the shared buffer is present in the page cache. This second case creates a scenario where tuning effective_cache_size can help improve performance.

Sometimes PostgreSQL needs to read data that is not in the shared buffer, but it is in the page cache. From the perspective of PostgreSQL, there will be a shared buffer miss when it tries to read the data. When this happens, the PostgreSQL instance will assume that reading this data will be slow because it will come from disk. It assumes the data will come from disk because PostgreSQL has no way to know if the data is in the page cache. However, if it turns out that the data is present in the page cache, the data will be read faster than if it was read from disk.

If the page cache is large, it is far more likely that the data will in fact be in the page cache. effective_cache_size gives us a way to tell PostgreSQL that there is a lot of system memory used for the page cache, and thus, even if there is a shared buffer miss, it’s very possible it will be “saved” by the page cache. The bigger effective_cache_size is set, the more likely PostgreSQL is to favor doing something like trying to read an index that is not present in the shared buffer, over doing a sequential scan of a table that is in the shared buffer. Even with the overhead of moving the index to the shared buffer from the page cache, the index scan will likely be faster than a sequential scan from the shared buffer. On average, this should improve performance.

random_page_cost has a similar effect as effective_cache_size.

random_page_cost tells the optimizer how much of a relative cost there is to accessing data from storage. The default of 4.0 is fairly conservative and is more appropriate for HDD based storage or when the shared buffer hit rate is below 90%. If the underlying storage technology is SSD, then it’s best to reduce this number. Also, if the shared buffer hit rate is very high (90%+), it is also a good idea to reduce this number. The documentation suggests 1.1 for these cases.

How does random_page_cost affect the optimizer and help performance?

The effect to the planner/optimizer is similar to that of effective_cache_size. Basically, a lower random_page_cost tells the optimizer to favor doing something like reading an index from disk over doing a sequential table scan from the shared buffer. Also, keep in mind that when PostgreSQL tries to access the disk, it might actually be accessing from the page cache which is faster.

Concurrency configuration

Increasing parallelism uses available resources more efficiently. It’s always a good idea to look at parameters related to parallel execution.

    

        
        max_parallel_workers = <num_system_cpus>    # Default is 8
max_parallel_workers_per_gather = 4    # Default is 2
max_parallel_maintenance_workers = 4    # Default is 2
effective_io_concurrency = 300    # Default is 1

        
    

max_parallel_workers selects how many parallel operations can occur. A good starting point for this parameter is to set this to match the number of cores in the system.

Doubling max_parallel_workers_per_gather and max_parallel_maintenance_workers to 4 seems to provide the most benefit.

effective_io_concurrency affects how many parallel IO requests you can send to storage. Modern storage technologies tend to allow a large number of IOPS. Thus, setting this higher is advised. Also note, this parameter only affects bitmap heap scans. A bitmap heap scan is an “in between” method for processing a query. That is, while Index scans (and Index only scans) are typically the fastest way to access data, and sequential scans are typically the slowest way to access data. A bitmap heap scan is in between these extremes.

Back
Next