Merge branch 'mm/gc-safety-doc'

Doc update.

* mm/gc-safety-doc:
  git-gc.txt: expand discussion of races with other processes
This commit is contained in:
Junio C Hamano 2017-01-10 15:24:25 -08:00
commit 979b82f19f
2 changed files with 29 additions and 9 deletions

View File

@ -1409,7 +1409,9 @@ gc.pruneExpire::
Override the grace period with this config variable. The value Override the grace period with this config variable. The value
"now" may be used to disable this grace period and always prune "now" may be used to disable this grace period and always prune
unreachable objects immediately, or "never" may be used to unreachable objects immediately, or "never" may be used to
suppress pruning. suppress pruning. This feature helps prevent corruption when
'git gc' runs concurrently with another process writing to the
repository; see the "NOTES" section of linkgit:git-gc[1].
gc.worktreePruneExpire:: gc.worktreePruneExpire::
When 'git gc' is run, it calls When 'git gc' is run, it calls

View File

@ -63,11 +63,10 @@ automatic consolidation of packs.
--prune=<date>:: --prune=<date>::
Prune loose objects older than date (default is 2 weeks ago, Prune loose objects older than date (default is 2 weeks ago,
overridable by the config variable `gc.pruneExpire`). overridable by the config variable `gc.pruneExpire`).
--prune=all prunes loose objects regardless of their age (do --prune=all prunes loose objects regardless of their age and
not use --prune=all unless you know exactly what you are doing. increases the risk of corruption if another process is writing to
Unless the repository is quiescent, you will lose newly created the repository concurrently; see "NOTES" below. --prune is on by
objects that haven't been anchored with the refs and end up default.
corrupting your repository). --prune is on by default.
--no-prune:: --no-prune::
Do not prune any loose objects. Do not prune any loose objects.
@ -138,17 +137,36 @@ default is "2 weeks ago".
Notes Notes
----- -----
'git gc' tries very hard to be safe about the garbage it collects. In 'git gc' tries very hard not to delete objects that are referenced
anywhere in your repository. In
particular, it will keep not only objects referenced by your current set particular, it will keep not only objects referenced by your current set
of branches and tags, but also objects referenced by the index, of branches and tags, but also objects referenced by the index,
remote-tracking branches, refs saved by 'git filter-branch' in remote-tracking branches, refs saved by 'git filter-branch' in
refs/original/, or reflogs (which may reference commits in branches refs/original/, or reflogs (which may reference commits in branches
that were later amended or rewound). that were later amended or rewound).
If you are expecting some objects to be deleted and they aren't, check
If you are expecting some objects to be collected and they aren't, check
all of those locations and decide whether it makes sense in your case to all of those locations and decide whether it makes sense in your case to
remove those references. remove those references.
On the other hand, when 'git gc' runs concurrently with another process,
there is a risk of it deleting an object that the other process is using
but hasn't created a reference to. This may just cause the other process
to fail or may corrupt the repository if the other process later adds a
reference to the deleted object. Git has two features that significantly
mitigate this problem:
. Any object with modification time newer than the `--prune` date is kept,
along with everything reachable from it.
. Most operations that add an object to the database update the
modification time of the object if it is already present so that #1
applies.
However, these features fall short of a complete solution, so users who
run commands concurrently have to live with some risk of corruption (which
seems to be low in practice) unless they turn off automatic garbage
collection with 'git config gc.auto 0'.
HOOKS HOOKS
----- -----