What causes the planner to use a sequential scan instead of an index?

The planner prefers sequential scans when the table is small, the indexed column has low selectivity, statistics are stale, or a function is applied to the indexed column. Refresh statistics with ANALYZE and create functional indexes when needed.

How does the N+1 query problem appear in SQL applications?

The N+1 problem occurs when an application fetches N records then executes one additional query per record for related data. Fix it by fetching all needed data with a single JOIN or by using an IN query followed by in-application joining.

← Back to SQL Mastery

SQL Performance Tuning: Master of EXPLAIN ANALYZE

Q: What is the difference between EXPLAIN and EXPLAIN ANALYZE?

EXPLAIN shows the query plan without executing the query, displaying estimated costs based on statistics. EXPLAIN ANALYZE actually runs the query and shows both estimated and actual row counts and execution times for real performance diagnosis.

In your first 23 modules, you learned how to write SQL. Now, you learn to make it Scream.

Most developers treat a query as a static command. But the database treats it as a Problem to Solve. The software that solves this problem is the Query Planner. To be a Senior Architect, you must be able to open the planner's head and read its thoughts. In SQL, we do this with EXPLAIN ANALYZE.

1. Hardware-Mirror: The Planner's Cost Model

The database doesn't just "Run" your query. It generates 100 different ways to run it and picks the one with the lowest Cost.

The Math of Cost

Every operation in a database has a physical cost:

seq_page_cost (1.0): Reading a 1MB file block sequentially.
random_page_cost (4.0): Jumping to a random place on the disk to read 1MB.
cpu_tuple_cost (0.01): The work the CPU does to process one row.

The Mirror Physics: The Planner uses the pg_statistic catalog. It knows how many rows are in your table and how many "Distinct Values" are in your columns. It does the math: "If I use the index, I'll have 500 random reads. If I read the table, I'll have 1,000 sequential reads. Random reads are 4x slower, so I'll just read the table."

2. Parsing the Plan: Scans and Joins

When you run EXPLAIN ANALYZE, you get a "Tree" of operations. Read it from the Inside Out.

The Scan Hierarchy

Sequential Scan (The Red Flag): The engine is reading every single page of the table. Usually means a missing index or low selectivity.
Index Scan: The engine finds a row in the index and then jumps to the heap for the data. Good for small result sets.
Index Only Scan (The Gold Standard): The engine stays in the index. No heap jump. Extremely fast.
Bitmap Heat Scan: A hybrid. The engine creates a "Map" of all rows from the index and then reads the heap in bulk. Great for medium-sized results.

The Join Physics

Nested Loop: For every row in Table A, search Table B. Perfect for small tables.
Hash Join: The engine creates a "Hash Table" from Table A in RAM and then scans Table B. Excellent for giant equality joins.
Merge Join: If both tables are already sorted by the join key, the engine "Steps" through them together. The fastest possible join.

3. Buffer Physics: Hit vs. Read

Speed is all about the Buffer Cache.

sql

EXPLAIN (ANALYZE, BUFFERS) 
SELECT * FROM orders WHERE user_id = 99;

Look for the Buffers line:

Shared Hit: The data was already in the database's RAM. Latency: Microseconds.
Shared Read: The data had to be fetched from the SSD. Latency: Milliseconds.
Shared Dirtied: Your query forced the database to "Clear out" some old data to make room. This indicates "Cache Pressure."

Architect's Standard: A well-tuned production database should have a Shared Hit Ratio of >95%. If yours is lower, you either have too little RAM or your queries are "Evicting" the cache with giant, unnecessary table scans.

4. Parallel Workers: The Multi-Threaded Mirror

In 2026, we have 64-core CPUs. Why run a query on just 1 core? Modern SQL engines use Parallel Query Execution.

When the Planner sees a giant table, it spawns "Worker Processes."
Each worker reads a chunk of the table.
A "Gather" node collects the results and merges them.
The Physics: If your EXPLAIN shows "Workers Planned: 4," your query is physically running 4x faster by utilizing the hardware lanes.

5. Case Study: The "Sorting" Bottleneck

A client had a report that was taking 10 seconds.

The EXPLAIN: It showed a Sort node with the comment Method: external merge Disk.
The Physics: The database didn't have enough RAM (work_mem) to sort the data in memory. It was writing temporary files to the SSD, sorting them there, and reading them back.
The Fix: We increased work_mem to 256MB.
The Result: The Sort node changed to Method: quicksort Memory. Accuracy: 50ms. We traded a tiny bit of RAM for a 200x increase in speed.

6. Summary: The Tuning Checklist

Always use (ANALYZE, BUFFERS): EXPLAIN alone is just a guess.
Hardware Alignment: Tune random_page_cost to match your actual SSD performance.
Eliminate Disk Sorts: Increase work_mem to ensure sorts stay in the high-speed CPU/RAM boundary.
Index Only is the Goal: Use INCLUDE to create "Fat Indexes" that eliminate heap-jumps.
Watch the WAL: Monitor WAL write timing for high-concurrency mutation queries.
JIT Compilation: For multi-column analytical aggregates, ensure JIT is compiling your SQL for maximum silicon efficiency.

Performance tuning is the "Art of the Engine." By mastering the cost model, the buffer cache, and the execution tree, you gain the power to make your systems faster than your competitors while using half the hardware. You graduate from "Building applications" to "Engineering High-Speed Infrastructure."

Phase 24: Tuning Mastery Action Items

Take a query that feels "Slow" (>200ms) and identify the innermost nodes of its EXPLAIN ANALYZE tree.
Measure the Shared Hit Ratio of your most frequent query.
Update your random_page_cost based on your production SSD performance.
Convert a heavy "Index Scan" into an Index Only Scan using the INCLUDE keyword.
Advanced Goal: Use pg_stat_statements to find the "Top 5 most expensive" queries in your entire database by total execution time.

Frequently Asked Questions

Q: What is the difference between EXPLAIN and EXPLAIN ANALYZE?

EXPLAIN shows the query plan the optimiser chose without actually executing the query - it displays estimated costs and row counts based on table statistics. EXPLAIN ANALYZE actually runs the query and shows both the estimated and actual row counts and execution times. Always use EXPLAIN ANALYZE for real performance diagnosis - the estimates in plain EXPLAIN can be significantly wrong if statistics are stale. For queries that modify data, wrap in a transaction and roll back: BEGIN; EXPLAIN ANALYZE UPDATE ...; ROLLBACK;.

Q: What causes a query plan to use a sequential scan instead of an index?

Several situations cause the planner to prefer a sequential scan: the table is small (seq scan is faster than index lookup for tiny tables), the index column has low selectivity (if 80% of rows match your WHERE condition, scanning all rows is faster than random index lookups), statistics are stale (ANALYZE to refresh), or the query uses a function on the indexed column (WHERE UPPER(email) = 'X' cannot use an index on email - create a functional index instead). When you believe an index should be used but is not, check selectivity with SELECT COUNT(*) FILTER (WHERE condition) * 100.0 / COUNT(*) FROM table.

Q: How does the N+1 query problem manifest in SQL applications and how do I fix it?

The N+1 problem occurs when an application fetches a list of N records and then executes one additional query per record to load related data - 1 query to fetch orders, then 1 query per order to fetch order lines, totalling N+1 database round trips. Fix it with a single JOIN that fetches all needed data at once, or with a single IN query for related records followed by in-application joining. ORMs often cause this automatically - enable SQL logging to detect it and use eager loading (JOIN FETCH in JPA, `includes` in Rails) to fix it.

Part of the SQL Mastery Course - engineering the speed.