Update documentation related to sparsity and the skip-worktree bit

Make several small updates, to address a few documentation issues
I spotted:
  * sparse-checkout focused on "patterns" even though the inputs (and
    outputs in the case of `list`) are directories in cone-mode
  * The description section of the sparse-checkout documentation
    was a bit sparse (no pun intended), and focused more on internal
    mechanics rather than end user usage.  This made sense in the
    early days when the command was even more experimental, but let's
    adjust a bit to try to make it more approachable to end users who
    may want to consider using it.  Keep the scary backward
    compatibility warning, though; we're still hard at work trying to
    fix up commands to behave reasonably in sparse checkouts.
  * both read-tree and update-index tried to describe how to use the
    skip-worktree bit, but both predated the sparse-checkout command.
    The sparse-checkout command is a far easier mechanism to use and
    for users trying to reduce the size of their working tree, we
    should recommend users to look at it instead.
  * The update-index documentation pointed out that assume-unchanged
    and skip-worktree sounded similar but had different purposes.
    However, it made no attempt to explain the differences, only to
    point out that they were different.  Explain the differences.
  * The update-index documentation focused much more on (internal?)
    implementation details than on end-user usage.  Try to explain
    its purpose better for users of update-index, rather than
    fellow developers trying to work with the SKIP_WORKTREE bit.
  * Clarify that when core.sparseCheckout=true, we treat a file's
    presence in the working tree as being an override to the
    SKIP_WORKTREE bit (i.e. in sparse checkouts when the file is
    present we ignore the SKIP_WORKTREE bit).

Note that this commit, like many touching documentation, is best viewed
with the `--color-words` option to diff/log.

Signed-off-by: Elijah Newren <newren@gmail.com>
Signed-off-by: Junio C Hamano <gitster@pobox.com>
This commit is contained in:
Elijah Newren 2022-01-14 15:59:42 +00:00 committed by Junio C Hamano
parent af6a51875a
commit 9023535bd3
3 changed files with 97 additions and 46 deletions

View File

@ -375,9 +375,14 @@ have finished your work-in-progress), attempt the merge again.
SPARSE CHECKOUT SPARSE CHECKOUT
--------------- ---------------
Note: The `update-index` and `read-tree` primitives for supporting the
skip-worktree bit predated the introduction of
linkgit:git-sparse-checkout[1]. Users are encouraged to use
`sparse-checkout` in preference to these low-level primitives.
"Sparse checkout" allows populating the working directory sparsely. "Sparse checkout" allows populating the working directory sparsely.
It uses the skip-worktree bit (see linkgit:git-update-index[1]) to tell It uses the skip-worktree bit (see linkgit:git-update-index[1]) to
Git whether a file in the working directory is worth looking at. tell Git whether a file in the working directory is worth looking at.
'git read-tree' and other merge-based commands ('git merge', 'git 'git read-tree' and other merge-based commands ('git merge', 'git
checkout'...) can help maintaining the skip-worktree bitmap and working checkout'...) can help maintaining the skip-worktree bitmap and working
@ -385,7 +390,8 @@ directory update. `$GIT_DIR/info/sparse-checkout` is used to
define the skip-worktree reference bitmap. When 'git read-tree' needs define the skip-worktree reference bitmap. When 'git read-tree' needs
to update the working directory, it resets the skip-worktree bit in the index to update the working directory, it resets the skip-worktree bit in the index
based on this file, which uses the same syntax as .gitignore files. based on this file, which uses the same syntax as .gitignore files.
If an entry matches a pattern in this file, skip-worktree will not be If an entry matches a pattern in this file, or the entry corresponds to
a file present in the working tree, then skip-worktree will not be
set on that entry. Otherwise, skip-worktree will be set. set on that entry. Otherwise, skip-worktree will be set.
Then it compares the new skip-worktree value with the previous one. If Then it compares the new skip-worktree value with the previous one. If

View File

@ -3,9 +3,7 @@ git-sparse-checkout(1)
NAME NAME
---- ----
git-sparse-checkout - Initialize and modify the sparse-checkout git-sparse-checkout - Reduce your working tree to a subset of tracked files
configuration, which reduces the checkout to a set of paths
given by a list of patterns.
SYNOPSIS SYNOPSIS
@ -17,8 +15,20 @@ SYNOPSIS
DESCRIPTION DESCRIPTION
----------- -----------
Initialize and modify the sparse-checkout configuration, which reduces This command is used to create sparse checkouts, which means that it
the checkout to a set of paths given by a list of patterns. changes the working tree from having all tracked files present, to only
have a subset of them. It can also switch which subset of files are
present, or undo and go back to having all tracked files present in the
working copy.
The subset of files is chosen by providing a list of directories in
cone mode (which is recommended), or by providing a list of patterns
in non-cone mode.
When in a sparse-checkout, other Git commands behave a bit differently.
For example, switching branches will not update paths outside the
sparse-checkout directories/patterns, and `git commit -a` will not record
paths outside the sparse-checkout directories/patterns as deleted.
THIS COMMAND IS EXPERIMENTAL. ITS BEHAVIOR, AND THE BEHAVIOR OF OTHER THIS COMMAND IS EXPERIMENTAL. ITS BEHAVIOR, AND THE BEHAVIOR OF OTHER
COMMANDS IN THE PRESENCE OF SPARSE-CHECKOUTS, WILL LIKELY CHANGE IN COMMANDS IN THE PRESENCE OF SPARSE-CHECKOUTS, WILL LIKELY CHANGE IN
@ -28,7 +38,7 @@ THE FUTURE.
COMMANDS COMMANDS
-------- --------
'list':: 'list'::
Describe the patterns in the sparse-checkout file. Describe the directories or patterns in the sparse-checkout file.
'set':: 'set'::
Enable the necessary config settings Enable the necessary config settings
@ -38,20 +48,26 @@ COMMANDS
list of arguments following the 'set' subcommand. Update the list of arguments following the 'set' subcommand. Update the
working directory to match the new patterns. working directory to match the new patterns.
+ +
When the `--stdin` option is provided, the patterns are read from When the `--stdin` option is provided, the directories or patterns are
standard in as a newline-delimited list instead of from the arguments. read from standard in as a newline-delimited list instead of from the
arguments.
+ +
When `--cone` is passed or `core.sparseCheckoutCone` is enabled, the When `--cone` is passed or `core.sparseCheckoutCone` is enabled, the
input list is considered a list of directories instead of input list is considered a list of directories. This allows for
sparse-checkout patterns. This allows for better performance with a better performance with a limited set of patterns (see 'CONE PATTERN
limited set of patterns (see 'CONE PATTERN SET' below). Note that the SET' below). The input format matches the output of `git ls-tree
set command will write patterns to the sparse-checkout file to include --name-only`. This includes interpreting pathnames that begin with a
all files contained in those directories (recursively) as well as double quote (") as C-style quoted strings. Note that the set command
files that are siblings of ancestor directories. The input format will write patterns to the sparse-checkout file to include all files
matches the output of `git ls-tree --name-only`. This includes contained in those directories (recursively) as well as files that are
interpreting pathnames that begin with a double quote (") as C-style siblings of ancestor directories. This may become the default in the
quoted strings. This may become the default in the future; --no-cone future; --no-cone can be passed to request non-cone mode.
can be passed to request non-cone mode. +
When `--no-cone` is passed or `core.sparseCheckoutCone` is not enabled,
the input list is considered a list of patterns. This mode is harder
to use and less performant, and is thus not recommended. See the
"Sparse Checkout" section of linkgit:git-read-tree[1] and the "Pattern
Set" sections below for more details.
+ +
Use the `--[no-]sparse-index` option to use a sparse index (the Use the `--[no-]sparse-index` option to use a sparse index (the
default is to not use it). A sparse index reduces the size of the default is to not use it). A sparse index reduces the size of the
@ -69,11 +85,10 @@ understand the sparse directory entries index extension and may fail to
interact with your repository until it is disabled. interact with your repository until it is disabled.
'add':: 'add'::
Update the sparse-checkout file to include additional patterns. Update the sparse-checkout file to include additional directories
By default, these patterns are read from the command-line arguments, (in cone mode) or patterns (in non-cone mode). By default, these
but they can be read from stdin using the `--stdin` option. When directories or patterns are read from the command-line arguments,
`core.sparseCheckoutCone` is enabled, the given patterns are interpreted but they can be read from stdin using the `--stdin` option.
as directory names as in the 'set' subcommand.
'reapply':: 'reapply'::
Reapply the sparsity pattern rules to paths in the working tree. Reapply the sparsity pattern rules to paths in the working tree.
@ -117,13 +132,14 @@ decreased in utility.
SPARSE CHECKOUT SPARSE CHECKOUT
--------------- ---------------
"Sparse checkout" allows populating the working directory sparsely. "Sparse checkout" allows populating the working directory sparsely. It
It uses the skip-worktree bit (see linkgit:git-update-index[1]) to tell uses the skip-worktree bit (see linkgit:git-update-index[1]) to tell Git
Git whether a file in the working directory is worth looking at. If whether a file in the working directory is worth looking at. If the
the skip-worktree bit is set, then the file is ignored in the working skip-worktree bit is set, and the file is not present in the working tree,
directory. Git will avoid populating the contents of those files, which then its absence is ignored. Git will avoid populating the contents of
makes a sparse checkout helpful when working in a repository with many those files, which makes a sparse checkout helpful when working in a
files, but only a few are important to the current user. repository with many files, but only a few are important to the current
user.
The `$GIT_DIR/info/sparse-checkout` file is used to define the The `$GIT_DIR/info/sparse-checkout` file is used to define the
skip-worktree reference bitmap. When Git updates the working skip-worktree reference bitmap. When Git updates the working

View File

@ -351,6 +351,10 @@ unchanged". Note that "assume unchanged" bit is *not* set if
the index (use `git update-index --really-refresh` if you want the index (use `git update-index --really-refresh` if you want
to mark them as "assume unchanged"). to mark them as "assume unchanged").
Sometimes users confuse the assume-unchanged bit with the
skip-worktree bit. See the final paragraph in the "Skip-worktree bit"
section below for an explanation of the differences.
EXAMPLES EXAMPLES
-------- --------
@ -392,22 +396,47 @@ M foo.c
SKIP-WORKTREE BIT SKIP-WORKTREE BIT
----------------- -----------------
Skip-worktree bit can be defined in one (long) sentence: When reading Skip-worktree bit can be defined in one (long) sentence: Tell git to
an entry, if it is marked as skip-worktree, then Git pretends its avoid writing the file to the working directory when reasonably
working directory version is up to date and read the index version possible, and treat the file as unchanged when it is not
instead. present in the working directory.
To elaborate, "reading" means checking for file existence, reading Note that not all git commands will pay attention to this bit, and
file attributes or file content. The working directory version may be some only partially support it.
present or absent. If present, its content may match against the index
version or not. Writing is not affected by this bit, content safety The update-index flags and the read-tree capabilities relating to the
is still first priority. Note that Git _can_ update working directory skip-worktree bit predated the introduction of the
file, that is marked skip-worktree, if it is safe to do so (i.e. linkgit:git-sparse-checkout[1] command, which provides a much easier
working directory version matches index version) way to configure and handle the skip-worktree bits. If you want to
reduce your working tree to only deal with a subset of the files in
the repository, we strongly encourage the use of
linkgit:git-sparse-checkout[1] in preference to the low-level
update-index and read-tree primitives.
The primary purpose of the skip-worktree bit is to enable sparse
checkouts, i.e. to have working directories with only a subset of
paths present. When the skip-worktree bit is set, Git commands (such
as `switch`, `pull`, `merge`) will avoid writing these files.
However, these commands will sometimes write these files anyway in
important cases such as conflicts during a merge or rebase. Git
commands will also avoid treating the lack of such files as an
intentional deletion; for example `git add -u` will not not stage a
deletion for these files and `git commit -a` will not make a commit
deleting them either.
Although this bit looks similar to assume-unchanged bit, its goal is Although this bit looks similar to assume-unchanged bit, its goal is
different from assume-unchanged bit's. Skip-worktree also takes different. The assume-unchanged bit is for leaving the file in the
precedence over assume-unchanged bit when both are set. working tree but having Git omit checking it for changes and presuming
that the file has not been changed (though if it can determine without
stat'ing the file that it has changed, it is free to record the
changes). skip-worktree tells Git to ignore the absence of the file,
avoid updating it when possible with commands that normally update
much of the working directory (e.g. `checkout`, `switch`, `pull`,
etc.), and not have its absence be recorded in commits. Note that in
sparse checkouts (setup by `git sparse-checkout` or by configuring
core.sparseCheckout to true), if a file is marked as skip-worktree in
the index but is found in the working tree, Git will clear the
skip-worktree bit for that file.
SPLIT INDEX SPLIT INDEX
----------- -----------