Git Internals

Working Tree, Index, and Object Store

Separating the working tree, index, and object database is the clearest way to understand add, commit, restore, and reset.

Written by Lance MQ · 12 years of Git

Who This Is For

Readers building a durable Git mental model
Developers who keep running into history, ref, or recovery confusion

Prerequisites

Comfort reading basic Git output
A rough idea of commits, branches, and HEAD

Common Risks

Learning low-level terms without connecting them to commands
Collapsing objects, refs, and working state into one concept

Data & Performance

3 stagesduring a merge conflict the index holds base (1), ours (2), and theirs (3) per fileSource: gitglossary(7) / Pro Git §7.2

Key Quotes

The index is a binary file (generally kept in .git/index) that stores information about what is currently going into your next commit.
— Pro Git, 2nd Ed., §10.2 Git Objects

Citations & Further Reading

Git Basics Recording Changes to the Repository [Book]
Git Tools Reset Demystified [Book]

What you will learn

Understand the core purpose of Working Tree, Index, and Object Store
Master the basic usage and common options of Working Tree, Index, and Object Store
Separating the working tree, index, and object database is the clearest way to understand add, commit, restore, and reset.
Understand key concepts: Why this matters
Know when to use this feature and when to avoid it

Many Git problems become easier once these three layers are separated:

working tree: files you are editing now
index: the next snapshot being prepared
object store: durable Git objects already written

Working tree, index, and object storeMost Git confusion is really layer confusion. Once you know which command touches which layer, the behavior becomes much easier to predict.

Working treecheckout.tsedited and visible on disk

Indexnext snapshotstaged content for the next commit

Object storeblob / tree / commitdurable history recorded by Git

Start with a problem

You use Git commands daily, but occasionally encounter 'strange' behavior — like being told a file changed when you didn't touch it, or unexpected conflicts during a rebase. You want to understand how Git works under the hood.

Why this matters

The real difference between many commands is simply which layer they touch:

git add updates the index
git commit writes index state into durable objects
git restore can restore the working tree or the index
git reset can move refs and may also realign the index and working tree

If you treat the index as “the candidate snapshot for the next commit,” Git behavior becomes much more predictable.

The three layers in more detail

Working tree

The working tree is what you can see and edit directly in your filesystem right now.

It includes:

modified files you have not staged
untracked files
the current checked-out version of tracked files

This layer is for active editing, not for durable history.

Index

The index, also called the staging area, sits between the working tree and committed history.

Its job is to represent the exact snapshot Git will use for the next commit.

That is why git add is not just "save this file." It is more precisely:

take current content from the working tree
write it into the index as part of the next candidate snapshot

Object store

The object store contains durable Git objects such as:

blobs
trees
commits
tags

These are the objects that become part of repository history and internal structure.

Once data has been written into objects and connected by refs, it is in a much more durable layer than the working tree.

Why the index is the layer people most often miss

A lot of Git confusion comes from mentally collapsing the working tree and the index into one thing.

But Git keeps them separate on purpose:

the working tree can stay messy while you experiment
the index can be kept clean as the next precise commit snapshot

That is what makes partial staging and deliberate commit shaping possible.

Use case 1: why `git add` does not create a commit

After git add, your working tree may still contain more changes than the index does.

That is because the index is its own layer. You staged a snapshot candidate, not a finalized history entry.

Use case 2: why `git status` is so informative

git status becomes much easier to read once you realize it is mostly comparing these layers:

working tree vs index
index vs current commit

That is why it can separately tell you:

what is staged
what is modified but unstaged
what is untracked

Use case 3: why `git restore` and `git reset` can feel tricky

These commands can affect different layers depending on flags and mode.

That is why they feel complicated until you start asking:

is this changing the ref?
is this changing the index?
is this changing the working tree?

Once you ask those three questions, their behavior becomes much less mysterious.

Special case: the index is not just a temporary clipboard

Some people imagine the index as a vague buffer. It is more precise than that. It is Git's structured representation of the next commit snapshot.

That is why it matters so much for partial staging and commit crafting.

Common misconceptions

"Working tree and index are basically the same"

No. They are separate layers, and many commands are defined by which one they touch.

"`git add` saves work permanently"

Not by itself. It stages content into the index, but it does not create commit history yet.

"`git reset` is one single kind of action"

Not really. Reset can move refs and may also realign the index and working tree depending on the mode.

Why this helps you understand commands

Once the three-layer model is clear, it becomes much easier to reason about:

staging vs committing
unstaged vs staged changes
restore behavior
reset behavior
why status outputs are structured the way they are

Suggested follow-up

It pairs especially well with:

git add
git status
git restore
git reset
git commit

Try it yourself

Practice the index-and-working-tree command in a test repository and observe state changes before and after
Experiment with different options and compare the output differences
Simulate a real scenario where you would need to use this, and walk through the full process

Previous / Next

PreviousPlumbing and PorcelainGit Internals NextRepository Layout, .git, and GIT_DIRGit Internals