Git Internals

Commit Objects, Parents, and Messages

Show how commit objects connect trees, parent commits, and messages into the history graph.

Written by Lance MQ · 12 years of Git

Who This Is For

Readers building a durable Git mental model
Developers who keep running into history, ref, or recovery confusion

Prerequisites

Comfort reading basic Git output
A rough idea of commits, branches, and HEAD

Common Risks

Learning low-level terms without connecting them to commands
Collapsing objects, refs, and working state into one concept

Citations & Further Reading

Git Internals Git Objects [Book]
Git Basics Viewing the Commit History [Book]

What you will learn

Understand the core purpose of Commit Objects, Parents, and Messages
Master the basic usage and common options of Commit Objects, Parents, and Messages
Show how commit objects connect trees, parent commits, and messages into the history graph.
Understand key concepts: How commits link through parents
Know when to use this feature and when to avoid it

A commit is not just "some saved changes plus a message." It is the object that ties together a repository snapshot, parent history, authorship metadata, and message text into one node of the history graph.

Start with a problem

You use Git commands daily, but occasionally encounter 'strange' behavior — like being told a file changed when you didn't touch it, or unexpected conflicts during a rebase. You want to understand how Git works under the hood.

How commits link through parents

How Commits Link Through Parents to Form HistoryEach commit object contains a reference to a tree, one or more parent commits, author and committer metadata, and a commit message. Parent commits are the key links in the history graph.

Normal commit (one parent)

ABCD

HEAD points to: main

Merge commit (two parents)

ABCM

BEF

Rebased commits (new ID chain)

ABCE'F'

HEAD points to: feature

What a commit object contains

A commit object typically includes:

a pointer to a root tree
one or more parent commits
author and committer metadata
the commit message

Those fields are what let Git treat a commit as both:

a snapshot of repository state
a connected point in the broader history graph

Why parent links matter so much

The parent list is what gives Git history its shape.

a normal commit usually has one parent
an initial commit has none
a merge commit usually has two or more parents

When Git walks history, computes ancestry, decides whether a fast-forward is possible, or draws a graph, it is following these parent links.

Without parent relationships, Git would have snapshots but not a meaningful notion of history structure.

Why the message is part of the object identity

The commit message is not just an annotation outside the commit. It is part of the commit object content.

That means if you change the message, you change the object content, and therefore the commit ID changes too.

This is why commands like git commit --amend or history rewriting operations produce new commit IDs even if the file snapshot barely changed or did not change at all.

Why changing one commit can ripple through later history

If a commit gets a new identity, its child commits may also need new identities because their parent pointer changed.

That is one reason rebases and amends can rewrite a whole stretch of visible history:

one commit changes
descendant commits now point to a different parent
so they become new objects too

Use case 1: why amend creates a new commit

People often talk about amend as if it edits a commit in place.

Internally, it is better understood as:

create a new commit object
point the branch at the new object
leave the old commit behind until it becomes unreachable and eventually collectible

That mental model explains both the new SHA and the recovery patterns around reflog.

Use case 2: why merge commits look different

A merge commit is different because it records more than one parent.

That is how Git knows the merge commit joined multiple history lines. The commit message may explain the intent to humans, but the parent list is what makes the merge real structurally.

Use case 3: why log and graph tools can reconstruct history

Commands like git log --graph are not guessing from timestamps or branch names. They are traversing parent links stored in commit objects.

That is why parent structure is much more important than branch labels when it comes to how Git understands history.

Special case: author and committer are not always the same

Commit metadata can distinguish between:

who originally authored the change
who actually created the commit object in its current form

This becomes especially relevant in workflows involving rebases, cherry-picks, or patch application.

Special case: message-only rewrites still rewrite history

Because the message is part of the commit content, even a pure message rewrite still produces a new commit object. That is why changing "just the wording" is still history rewriting from Git's point of view.

Common misconceptions

"A commit is just a patch plus a label"

Not really. A commit points to a tree, records parents, and stores metadata and message content.

"Changing only the message should keep the same commit ID"

No. The message is part of the object content.

"Branch names define the history graph"

No. Branches are refs. Parent links inside commits define the graph.

Why this helps you understand commands

Once commit structure becomes clearer, it is easier to understand:

why amend changes SHAs
why rebase rewrites descendant commits
why merge commits are structurally different
why log commands can reconstruct ancestry
why branch names are pointers, not the history itself

Suggested follow-up

It pairs especially well with:

git commit
git commit --amend
git rebase
git log --graph
git cat-file

Try it yourself

Practice the commit-message-and-parents command in a test repository and observe state changes before and after
Experiment with different options and compare the output differences
Simulate a real scenario where you would need to use this, and walk through the full process

Previous / Next

PreviousBlob Objects and Content AddressingGit Internals NextRefspecs and Ref UpdatesGit Internals