Understanding Indexes in SQL Server — What, Why, and When to Use Them?

Vishal Gami
2h
73
0
0

Article

Introduction

When an application is small, every SQL query feels fast. But as data grows into thousands, millions, or even billions of rows, performance problems begin appearing. A SELECT query that took 20 milliseconds earlier may suddenly take 5 seconds or more. Developers often assume something is wrong with code or infrastructure, but the real bottleneck is frequently the absence (or misuse) of indexes.

Indexes are one of the most important performance optimization features in SQL Server. They allow the database engine to find rows faster, just like an index in a book helps you quickly locate a topic instead of reading every page.

This article explains indexes from a practical angle:

What indexes are
How they work internally
When to use them
When not to use them
Real-world examples and measurable impact

Real-World Context

Consider a ServiceRecord table storing service history for a large automobile service chain:

CREATE TABLE ServiceRecord (
    ServiceId INT PRIMARY KEY,
    CarId INT,
    ServiceDate DATE,
    Cost DECIMAL(10,2),
    Description NVARCHAR(500)
);

After 20 customers, everything is fast.

After 200,000 records, queries start slowing.

After 2 million records, basic SELECT queries may become painfully slow without indexing.

What Is an Index?

An index is a data structure that SQL Server uses to quickly locate rows without scanning the entire table.

Example

Query without an index:

SELECT * FROM ServiceRecord 
WHERE CarId = 101;

SQL Server must scan the whole table row by row. This is called a Table Scan.

Query with an index on CarId:

CREATE INDEX IX_ServiceRecord_CarId ON ServiceRecord(CarId);

Now SQL Server performs an Index Seek, which is significantly faster.

How Indexes Work Internally

SQL Server primarily uses a B-Tree structure.

Root Level
    ↓
Intermediate Nodes
    ↓
Leaf Level (actual row pointers)

Instead of searching every record, SQL follows this sorted tree structure, making lookups significantly faster.

Performance Comparison

Operation Type	With Index	Without Index
Table scan	Yes	Yes
Index seek	Yes	No
Lookup time	Fast (logarithmic)	Slow (linear)
Useful for large tables	Yes	No

Types of Indexes in SQL Server

1. Clustered Index

A clustered index defines how data is physically stored in the table.
A table can have only one clustered index.

By default, the Primary Key becomes the clustered index.

Example

CREATE CLUSTERED INDEX IX_ServiceRecord_ServiceId
ON ServiceRecord(ServiceId);

Clustered indexes are ideal for:

Primary keys
Columns used in sorting and range searches (BETWEEN, <, >, ORDER BY)

2. Non-Clustered Index

A non-clustered index contains a copy of the indexed column and a pointer to the actual data.

Example

CREATE NONCLUSTERED INDEX IX_ServiceRecord_ServiceDate
ON ServiceRecord(ServiceDate);

Use non-clustered indexes for:

Frequently searched columns
JOIN conditions
WHERE filters

3. Composite Index

Indexes on multiple columns.

Example

CREATE INDEX IX_ServiceRecord_CarId_ServiceDate
ON ServiceRecord(CarId, ServiceDate);

Useful when queries combine both fields.

4. Included Column Index

Adds non-key columns to improve coverage and avoid extra lookups.

Example

CREATE INDEX IX_ServiceRecord_CarId
ON ServiceRecord(CarId)
INCLUDE (Cost, ServiceDate);

Now SQL Server does not need to return to the table for those values.

When Should You Create an Index?

Create an index when:

A column is frequently used in WHERE
A column is frequently used in JOIN
Data retrieval is slow
Reports filter or sort data
The table contains large volumes of data

When You Should NOT Create an Index

Indexes are not free. They cost:

Disk storage
Memory usage
Slower INSERT, UPDATE, DELETE operations (because SQL must update indexes too)

Avoid indexing

Column Type	Reason
Columns with high data change frequency	Slows down writes
Very small tables	Table scan is already fast
Columns with many duplicate values	Poor selectivity
Boolean fields (Yes/No)	Not enough uniqueness

Case Study

A car service company noticed a dashboard query took 14 seconds:

SELECT CarId, SUM(Cost) AS TotalSpent
FROM ServiceRecord
GROUP BY CarId;

After creating an index on CarId:

CREATE INDEX IX_ServiceRecord_CarId 
ON ServiceRecord(CarId);

Execution dropped to 0.2 seconds.

This was a 70x performance improvement without changing a single line of application code.

Identifying Missing Indexes

SQL Server provides a dynamic view to suggest missing indexes:

SELECT 
    migs.user_seeks,
    mid.statement,
    mid.equality_columns,
    mid.included_columns
FROM sys.dm_db_missing_index_details mid
JOIN sys.dm_db_missing_index_group_stats migs 
    ON mid.index_handle = migs.group_handle
ORDER BY migs.user_seeks DESC;

This helps identify real-world performance gaps.

How to Check Performance Execution Plan

Always review the execution plan in SQL Server Management Studio using:

Right-click Query → Select "Include Actual Execution Plan"

Look for:

Index Seek (Good)
Table Scan (Bad on large tables)
Key Lookup (May indicate missing include columns)

Best Practices for Indexing

Rule	Best Practice
Primary Key	Always indexed (clustered recommended)
Foreign Keys	Should have non-clustered index
Frequently filtered columns	Add index
Composite index	Use most selective column first
Maintenance	Rebuild or reorganize indexes periodically

Example maintenance

ALTER INDEX ALL ON ServiceRecord REBUILD;

Common Beginner Mistakes

Mistake	Fix
Indexing every field	Only index frequently queried columns
Ignoring composite indexes	Use when multiple conditions are common
Forgetting INCLUDE	Use to avoid key lookups
No performance testing	Always check execution plans

Summary

Indexes are essential for database performance optimization. They help SQL Server find data faster and avoid scanning large table volumes.

You should now understand:

What indexes are
Types of indexes and when to use them
Real examples and performance impact
Risks of over-indexing
Best practices for production environments

Once you begin working with large datasets or enterprise reporting systems, indexing is no longer optional but necessary for efficient system performance.