52fe41ff1c
The previous change cleaned up loose objects using the 'loose-objects' that can be run safely in the background. Add a similar job that performs similar cleanups for pack-files. One issue with running 'git repack' is that it is designed to repack all pack-files into a single pack-file. While this is the most space-efficient way to store object data, it is not time or memory efficient. This becomes extremely important if the repo is so large that a user struggles to store two copies of the pack on their disk. Instead, perform an "incremental" repack by collecting a few small pack-files into a new pack-file. The multi-pack-index facilitates this process ever since 'git multi-pack-index expire' was added in19575c7
(multi-pack-index: implement 'expire' subcommand, 2019-06-10) and 'git multi-pack-index repack' was added ince1e4a1
(midx: implement midx_repack(), 2019-06-10). The 'incremental-repack' task runs the following steps: 1. 'git multi-pack-index write' creates a multi-pack-index file if one did not exist, and otherwise will update the multi-pack-index with any new pack-files that appeared since the last write. This is particularly relevant with the background fetch job. When the multi-pack-index sees two copies of the same object, it stores the offset data into the newer pack-file. This means that some old pack-files could become "unreferenced" which I will use to mean "a pack-file that is in the pack-file list of the multi-pack-index but none of the objects in the multi-pack-index reference a location inside that pack-file." 2. 'git multi-pack-index expire' deletes any unreferenced pack-files and updaes the multi-pack-index to drop those pack-files from the list. This is safe to do as concurrent Git processes will see the multi-pack-index and not open those packs when looking for object contents. (Similar to the 'loose-objects' job, there are some Git commands that open pack-files regardless of the multi-pack-index, but they are rarely used. Further, a user that self-selects to use background operations would likely refrain from using those commands.) 3. 'git multi-pack-index repack --bacth-size=<size>' collects a set of pack-files that are listed in the multi-pack-index and creates a new pack-file containing the objects whose offsets are listed by the multi-pack-index to be in those objects. The set of pack- files is selected greedily by sorting the pack-files by modified time and adding a pack-file to the set if its "expected size" is smaller than the batch size until the total expected size of the selected pack-files is at least the batch size. The "expected size" is calculated by taking the size of the pack-file divided by the number of objects in the pack-file and multiplied by the number of objects from the multi-pack-index with offset in that pack-file. The expected size approximates how much data from that pack-file will contribute to the resulting pack-file size. The intention is that the resulting pack-file will be close in size to the provided batch size. The next run of the incremental-repack task will delete these repacked pack-files during the 'expire' step. In this version, the batch size is set to "0" which ignores the size restrictions when selecting the pack-files. It instead selects all pack-files and repacks all packed objects into a single pack-file. This will be updated in the next change, but it requires doing some calculations that are better isolated to a separate change. These steps are based on a similar background maintenance step in Scalar (and VFS for Git) [1]. This was incredibly effective for users of the Windows OS repository. After using the same VFS for Git repository for over a year, some users had _thousands_ of pack-files that combined to up to 250 GB of data. We noticed a few users were running into the open file descriptor limits (due in part to a bug in the multi-pack-index fixed byaf96fe3
(midx: add packs to packed_git linked list, 2019-04-29). These pack-files were mostly small since they contained the commits and trees that were pushed to the origin in a given hour. The GVFS protocol includes a "prefetch" step that asks for pre-computed pack- files containing commits and trees by timestamp. These pack-files were grouped into "daily" pack-files once a day for up to 30 days. If a user did not request prefetch packs for over 30 days, then they would get the entire history of commits and trees in a new, large pack-file. This led to a large number of pack-files that had poor delta compression. By running this pack-file maintenance step once per day, these repos with thousands of packs spanning 200+ GB dropped to dozens of pack- files spanning 30-50 GB. This was done all without removing objects from the system and using a constant batch size of two gigabytes. Once the work was done to reduce the pack-files to small sizes, the batch size of two gigabytes means that not every run triggers a repack operation, so the following run will not expire a pack-file. This has kept these repos in a "clean" state. [1] https://github.com/microsoft/scalar/blob/master/Scalar.Common/Maintenance/PackfileMaintenanceStep.cs Signed-off-by: Derrick Stolee <dstolee@microsoft.com> Signed-off-by: Junio C Hamano <gitster@pobox.com>
666 lines
19 KiB
Bash
Executable File
666 lines
19 KiB
Bash
Executable File
#!/bin/sh
|
|
|
|
test_description='multi-pack-indexes'
|
|
. ./test-lib.sh
|
|
|
|
GIT_TEST_MULTI_PACK_INDEX=0
|
|
objdir=.git/objects
|
|
|
|
midx_read_expect () {
|
|
NUM_PACKS=$1
|
|
NUM_OBJECTS=$2
|
|
NUM_CHUNKS=$3
|
|
OBJECT_DIR=$4
|
|
EXTRA_CHUNKS="$5"
|
|
{
|
|
cat <<-EOF &&
|
|
header: 4d494458 1 $NUM_CHUNKS $NUM_PACKS
|
|
chunks: pack-names oid-fanout oid-lookup object-offsets$EXTRA_CHUNKS
|
|
num_objects: $NUM_OBJECTS
|
|
packs:
|
|
EOF
|
|
if test $NUM_PACKS -ge 1
|
|
then
|
|
ls $OBJECT_DIR/pack/ | grep idx | sort
|
|
fi &&
|
|
printf "object-dir: $OBJECT_DIR\n"
|
|
} >expect &&
|
|
test-tool read-midx $OBJECT_DIR >actual &&
|
|
test_cmp expect actual
|
|
}
|
|
|
|
test_expect_success 'setup' '
|
|
test_oid_cache <<-EOF
|
|
idxoff sha1:2999
|
|
idxoff sha256:3739
|
|
|
|
packnameoff sha1:652
|
|
packnameoff sha256:940
|
|
|
|
fanoutoff sha1:1
|
|
fanoutoff sha256:3
|
|
EOF
|
|
'
|
|
|
|
test_expect_success "don't write midx with no packs" '
|
|
test_must_fail git multi-pack-index --object-dir=. write &&
|
|
test_path_is_missing pack/multi-pack-index
|
|
'
|
|
|
|
test_expect_success "Warn if a midx contains no oid" '
|
|
cp "$TEST_DIRECTORY"/t5319/no-objects.midx $objdir/pack/multi-pack-index &&
|
|
test_must_fail git multi-pack-index verify &&
|
|
rm $objdir/pack/multi-pack-index
|
|
'
|
|
|
|
generate_objects () {
|
|
i=$1
|
|
iii=$(printf '%03i' $i)
|
|
{
|
|
test-tool genrandom "bar" 200 &&
|
|
test-tool genrandom "baz $iii" 50
|
|
} >wide_delta_$iii &&
|
|
{
|
|
test-tool genrandom "foo"$i 100 &&
|
|
test-tool genrandom "foo"$(( $i + 1 )) 100 &&
|
|
test-tool genrandom "foo"$(( $i + 2 )) 100
|
|
} >deep_delta_$iii &&
|
|
{
|
|
echo $iii &&
|
|
test-tool genrandom "$iii" 8192
|
|
} >file_$iii &&
|
|
git update-index --add file_$iii deep_delta_$iii wide_delta_$iii
|
|
}
|
|
|
|
commit_and_list_objects () {
|
|
{
|
|
echo 101 &&
|
|
test-tool genrandom 100 8192;
|
|
} >file_101 &&
|
|
git update-index --add file_101 &&
|
|
tree=$(git write-tree) &&
|
|
commit=$(git commit-tree $tree -p HEAD</dev/null) &&
|
|
{
|
|
echo $tree &&
|
|
git ls-tree $tree | sed -e "s/.* \\([0-9a-f]*\\) .*/\\1/"
|
|
} >obj-list &&
|
|
git reset --hard $commit
|
|
}
|
|
|
|
test_expect_success 'create objects' '
|
|
test_commit initial &&
|
|
for i in $(test_seq 1 5)
|
|
do
|
|
generate_objects $i
|
|
done &&
|
|
commit_and_list_objects
|
|
'
|
|
|
|
test_expect_success 'write midx with one v1 pack' '
|
|
pack=$(git pack-objects --index-version=1 $objdir/pack/test <obj-list) &&
|
|
test_when_finished rm $objdir/pack/test-$pack.pack \
|
|
$objdir/pack/test-$pack.idx $objdir/pack/multi-pack-index &&
|
|
git multi-pack-index --object-dir=$objdir write &&
|
|
midx_read_expect 1 18 4 $objdir
|
|
'
|
|
|
|
midx_git_two_modes () {
|
|
git -c core.multiPackIndex=false $1 >expect &&
|
|
git -c core.multiPackIndex=true $1 >actual &&
|
|
if [ "$2" = "sorted" ]
|
|
then
|
|
sort <expect >expect.sorted &&
|
|
mv expect.sorted expect &&
|
|
sort <actual >actual.sorted &&
|
|
mv actual.sorted actual
|
|
fi &&
|
|
test_cmp expect actual
|
|
}
|
|
|
|
compare_results_with_midx () {
|
|
MSG=$1
|
|
test_expect_success "check normal git operations: $MSG" '
|
|
midx_git_two_modes "rev-list --objects --all" &&
|
|
midx_git_two_modes "log --raw" &&
|
|
midx_git_two_modes "count-objects --verbose" &&
|
|
midx_git_two_modes "cat-file --batch-all-objects --batch-check" &&
|
|
midx_git_two_modes "cat-file --batch-all-objects --batch-check --unordered" sorted
|
|
'
|
|
}
|
|
|
|
test_expect_success 'write midx with one v2 pack' '
|
|
git pack-objects --index-version=2,0x40 $objdir/pack/test <obj-list &&
|
|
git multi-pack-index --object-dir=$objdir write &&
|
|
midx_read_expect 1 18 4 $objdir
|
|
'
|
|
|
|
compare_results_with_midx "one v2 pack"
|
|
|
|
test_expect_success 'corrupt idx not opened' '
|
|
idx=$(test-tool read-midx $objdir | grep "\.idx\$") &&
|
|
mv $objdir/pack/$idx backup-$idx &&
|
|
test_when_finished "mv backup-\$idx \$objdir/pack/\$idx" &&
|
|
|
|
# This is the minimum size for a sha-1 based .idx; this lets
|
|
# us pass perfunctory tests, but anything that actually opens and reads
|
|
# the idx file will complain.
|
|
test_copy_bytes 1064 <backup-$idx >$objdir/pack/$idx &&
|
|
|
|
git -c core.multiPackIndex=true rev-list --objects --all 2>err &&
|
|
test_must_be_empty err
|
|
'
|
|
|
|
test_expect_success 'add more objects' '
|
|
for i in $(test_seq 6 10)
|
|
do
|
|
generate_objects $i
|
|
done &&
|
|
commit_and_list_objects
|
|
'
|
|
|
|
test_expect_success 'write midx with two packs' '
|
|
git pack-objects --index-version=1 $objdir/pack/test-2 <obj-list &&
|
|
git multi-pack-index --object-dir=$objdir write &&
|
|
midx_read_expect 2 34 4 $objdir
|
|
'
|
|
|
|
compare_results_with_midx "two packs"
|
|
|
|
test_expect_success 'write progress off for redirected stderr' '
|
|
git multi-pack-index --object-dir=$objdir write 2>err &&
|
|
test_line_count = 0 err
|
|
'
|
|
|
|
test_expect_success 'write force progress on for stderr' '
|
|
GIT_PROGRESS_DELAY=0 git multi-pack-index --object-dir=$objdir --progress write 2>err &&
|
|
test_file_not_empty err
|
|
'
|
|
|
|
test_expect_success 'write with the --no-progress option' '
|
|
GIT_PROGRESS_DELAY=0 git multi-pack-index --object-dir=$objdir --no-progress write 2>err &&
|
|
test_line_count = 0 err
|
|
'
|
|
|
|
test_expect_success 'add more packs' '
|
|
for j in $(test_seq 11 20)
|
|
do
|
|
generate_objects $j &&
|
|
commit_and_list_objects &&
|
|
git pack-objects --index-version=2 $objdir/pack/test-pack <obj-list
|
|
done
|
|
'
|
|
|
|
compare_results_with_midx "mixed mode (two packs + extra)"
|
|
|
|
test_expect_success 'write midx with twelve packs' '
|
|
git multi-pack-index --object-dir=$objdir write &&
|
|
midx_read_expect 12 74 4 $objdir
|
|
'
|
|
|
|
compare_results_with_midx "twelve packs"
|
|
|
|
test_expect_success 'verify multi-pack-index success' '
|
|
git multi-pack-index verify --object-dir=$objdir
|
|
'
|
|
|
|
test_expect_success 'verify progress off for redirected stderr' '
|
|
git multi-pack-index verify --object-dir=$objdir 2>err &&
|
|
test_line_count = 0 err
|
|
'
|
|
|
|
test_expect_success 'verify force progress on for stderr' '
|
|
git multi-pack-index verify --object-dir=$objdir --progress 2>err &&
|
|
test_file_not_empty err
|
|
'
|
|
|
|
test_expect_success 'verify with the --no-progress option' '
|
|
git multi-pack-index verify --object-dir=$objdir --no-progress 2>err &&
|
|
test_line_count = 0 err
|
|
'
|
|
|
|
# usage: corrupt_midx_and_verify <pos> <data> <objdir> <string>
|
|
corrupt_midx_and_verify() {
|
|
POS=$1 &&
|
|
DATA="${2:-\0}" &&
|
|
OBJDIR=$3 &&
|
|
GREPSTR="$4" &&
|
|
COMMAND="$5" &&
|
|
if test -z "$COMMAND"
|
|
then
|
|
COMMAND="git multi-pack-index verify --object-dir=$OBJDIR"
|
|
fi &&
|
|
FILE=$OBJDIR/pack/multi-pack-index &&
|
|
chmod a+w $FILE &&
|
|
test_when_finished mv midx-backup $FILE &&
|
|
cp $FILE midx-backup &&
|
|
printf "$DATA" | dd of="$FILE" bs=1 seek="$POS" conv=notrunc &&
|
|
test_must_fail $COMMAND 2>test_err &&
|
|
grep -v "^+" test_err >err &&
|
|
test_i18ngrep "$GREPSTR" err
|
|
}
|
|
|
|
test_expect_success 'verify bad signature' '
|
|
corrupt_midx_and_verify 0 "\00" $objdir \
|
|
"multi-pack-index signature"
|
|
'
|
|
|
|
HASH_LEN=$(test_oid rawsz)
|
|
NUM_OBJECTS=74
|
|
MIDX_BYTE_VERSION=4
|
|
MIDX_BYTE_OID_VERSION=5
|
|
MIDX_BYTE_CHUNK_COUNT=6
|
|
MIDX_HEADER_SIZE=12
|
|
MIDX_BYTE_CHUNK_ID=$MIDX_HEADER_SIZE
|
|
MIDX_BYTE_CHUNK_OFFSET=$(($MIDX_HEADER_SIZE + 4))
|
|
MIDX_NUM_CHUNKS=5
|
|
MIDX_CHUNK_LOOKUP_WIDTH=12
|
|
MIDX_OFFSET_PACKNAMES=$(($MIDX_HEADER_SIZE + \
|
|
$MIDX_NUM_CHUNKS * $MIDX_CHUNK_LOOKUP_WIDTH))
|
|
MIDX_BYTE_PACKNAME_ORDER=$(($MIDX_OFFSET_PACKNAMES + 2))
|
|
MIDX_OFFSET_OID_FANOUT=$(($MIDX_OFFSET_PACKNAMES + $(test_oid packnameoff)))
|
|
MIDX_OID_FANOUT_WIDTH=4
|
|
MIDX_BYTE_OID_FANOUT_ORDER=$((MIDX_OFFSET_OID_FANOUT + 250 * $MIDX_OID_FANOUT_WIDTH + $(test_oid fanoutoff)))
|
|
MIDX_OFFSET_OID_LOOKUP=$(($MIDX_OFFSET_OID_FANOUT + 256 * $MIDX_OID_FANOUT_WIDTH))
|
|
MIDX_BYTE_OID_LOOKUP=$(($MIDX_OFFSET_OID_LOOKUP + 16 * $HASH_LEN))
|
|
MIDX_OFFSET_OBJECT_OFFSETS=$(($MIDX_OFFSET_OID_LOOKUP + $NUM_OBJECTS * $HASH_LEN))
|
|
MIDX_OFFSET_WIDTH=8
|
|
MIDX_BYTE_PACK_INT_ID=$(($MIDX_OFFSET_OBJECT_OFFSETS + 16 * $MIDX_OFFSET_WIDTH + 2))
|
|
MIDX_BYTE_OFFSET=$(($MIDX_OFFSET_OBJECT_OFFSETS + 16 * $MIDX_OFFSET_WIDTH + 6))
|
|
|
|
test_expect_success 'verify bad version' '
|
|
corrupt_midx_and_verify $MIDX_BYTE_VERSION "\00" $objdir \
|
|
"multi-pack-index version"
|
|
'
|
|
|
|
test_expect_success 'verify bad OID version' '
|
|
corrupt_midx_and_verify $MIDX_BYTE_OID_VERSION "\02" $objdir \
|
|
"hash version"
|
|
'
|
|
|
|
test_expect_success 'verify truncated chunk count' '
|
|
corrupt_midx_and_verify $MIDX_BYTE_CHUNK_COUNT "\01" $objdir \
|
|
"missing required"
|
|
'
|
|
|
|
test_expect_success 'verify extended chunk count' '
|
|
corrupt_midx_and_verify $MIDX_BYTE_CHUNK_COUNT "\07" $objdir \
|
|
"terminating multi-pack-index chunk id appears earlier than expected"
|
|
'
|
|
|
|
test_expect_success 'verify missing required chunk' '
|
|
corrupt_midx_and_verify $MIDX_BYTE_CHUNK_ID "\01" $objdir \
|
|
"missing required"
|
|
'
|
|
|
|
test_expect_success 'verify invalid chunk offset' '
|
|
corrupt_midx_and_verify $MIDX_BYTE_CHUNK_OFFSET "\01" $objdir \
|
|
"invalid chunk offset (too large)"
|
|
'
|
|
|
|
test_expect_success 'verify packnames out of order' '
|
|
corrupt_midx_and_verify $MIDX_BYTE_PACKNAME_ORDER "z" $objdir \
|
|
"pack names out of order"
|
|
'
|
|
|
|
test_expect_success 'verify packnames out of order' '
|
|
corrupt_midx_and_verify $MIDX_BYTE_PACKNAME_ORDER "a" $objdir \
|
|
"failed to load pack"
|
|
'
|
|
|
|
test_expect_success 'verify oid fanout out of order' '
|
|
corrupt_midx_and_verify $MIDX_BYTE_OID_FANOUT_ORDER "\01" $objdir \
|
|
"oid fanout out of order"
|
|
'
|
|
|
|
test_expect_success 'verify oid lookup out of order' '
|
|
corrupt_midx_and_verify $MIDX_BYTE_OID_LOOKUP "\00" $objdir \
|
|
"oid lookup out of order"
|
|
'
|
|
|
|
test_expect_success 'verify incorrect pack-int-id' '
|
|
corrupt_midx_and_verify $MIDX_BYTE_PACK_INT_ID "\07" $objdir \
|
|
"bad pack-int-id"
|
|
'
|
|
|
|
test_expect_success 'verify incorrect offset' '
|
|
corrupt_midx_and_verify $MIDX_BYTE_OFFSET "\377" $objdir \
|
|
"incorrect object offset"
|
|
'
|
|
|
|
test_expect_success 'git-fsck incorrect offset' '
|
|
corrupt_midx_and_verify $MIDX_BYTE_OFFSET "\377" $objdir \
|
|
"incorrect object offset" \
|
|
"git -c core.multipackindex=true fsck"
|
|
'
|
|
|
|
test_expect_success 'repack progress off for redirected stderr' '
|
|
GIT_PROGRESS_DELAY=0 git multi-pack-index --object-dir=$objdir repack 2>err &&
|
|
test_line_count = 0 err
|
|
'
|
|
|
|
test_expect_success 'repack force progress on for stderr' '
|
|
GIT_PROGRESS_DELAY=0 git multi-pack-index --object-dir=$objdir --progress repack 2>err &&
|
|
test_file_not_empty err
|
|
'
|
|
|
|
test_expect_success 'repack with the --no-progress option' '
|
|
GIT_PROGRESS_DELAY=0 git multi-pack-index --object-dir=$objdir --no-progress repack 2>err &&
|
|
test_line_count = 0 err
|
|
'
|
|
|
|
test_expect_success 'repack removes multi-pack-index' '
|
|
test_path_is_file $objdir/pack/multi-pack-index &&
|
|
GIT_TEST_MULTI_PACK_INDEX=0 git repack -adf &&
|
|
test_path_is_missing $objdir/pack/multi-pack-index
|
|
'
|
|
|
|
compare_results_with_midx "after repack"
|
|
|
|
test_expect_success 'multi-pack-index and pack-bitmap' '
|
|
git -c repack.writeBitmaps=true repack -ad &&
|
|
git multi-pack-index write &&
|
|
git rev-list --test-bitmap HEAD
|
|
'
|
|
|
|
test_expect_success 'multi-pack-index and alternates' '
|
|
git init --bare alt.git &&
|
|
echo $(pwd)/alt.git/objects >.git/objects/info/alternates &&
|
|
echo content1 >file1 &&
|
|
altblob=$(GIT_DIR=alt.git git hash-object -w file1) &&
|
|
git cat-file blob $altblob &&
|
|
git rev-list --all
|
|
'
|
|
|
|
compare_results_with_midx "with alternate (local midx)"
|
|
|
|
test_expect_success 'multi-pack-index in an alternate' '
|
|
mv .git/objects/pack/* alt.git/objects/pack &&
|
|
test_commit add_local_objects &&
|
|
git repack --local &&
|
|
git multi-pack-index write &&
|
|
midx_read_expect 1 3 4 $objdir &&
|
|
git reset --hard HEAD~1 &&
|
|
rm -f .git/objects/pack/*
|
|
'
|
|
|
|
compare_results_with_midx "with alternate (remote midx)"
|
|
|
|
# usage: corrupt_data <file> <pos> [<data>]
|
|
corrupt_data () {
|
|
file=$1
|
|
pos=$2
|
|
data="${3:-\0}"
|
|
printf "$data" | dd of="$file" bs=1 seek="$pos" conv=notrunc
|
|
}
|
|
|
|
# Force 64-bit offsets by manipulating the idx file.
|
|
# This makes the IDX file _incorrect_ so be careful to clean up after!
|
|
test_expect_success 'force some 64-bit offsets with pack-objects' '
|
|
mkdir objects64 &&
|
|
mkdir objects64/pack &&
|
|
for i in $(test_seq 1 11)
|
|
do
|
|
generate_objects 11
|
|
done &&
|
|
commit_and_list_objects &&
|
|
pack64=$(git pack-objects --index-version=2,0x40 objects64/pack/test-64 <obj-list) &&
|
|
idx64=objects64/pack/test-64-$pack64.idx &&
|
|
chmod u+w $idx64 &&
|
|
corrupt_data $idx64 $(test_oid idxoff) "\02" &&
|
|
midx64=$(git multi-pack-index --object-dir=objects64 write) &&
|
|
midx_read_expect 1 63 5 objects64 " large-offsets"
|
|
'
|
|
|
|
test_expect_success 'verify multi-pack-index with 64-bit offsets' '
|
|
git multi-pack-index verify --object-dir=objects64
|
|
'
|
|
|
|
NUM_OBJECTS=63
|
|
MIDX_OFFSET_OID_FANOUT=$((MIDX_OFFSET_PACKNAMES + 54))
|
|
MIDX_OFFSET_OID_LOOKUP=$((MIDX_OFFSET_OID_FANOUT + 256 * $MIDX_OID_FANOUT_WIDTH))
|
|
MIDX_OFFSET_OBJECT_OFFSETS=$(($MIDX_OFFSET_OID_LOOKUP + $NUM_OBJECTS * $HASH_LEN))
|
|
MIDX_OFFSET_LARGE_OFFSETS=$(($MIDX_OFFSET_OBJECT_OFFSETS + $NUM_OBJECTS * $MIDX_OFFSET_WIDTH))
|
|
MIDX_BYTE_LARGE_OFFSET=$(($MIDX_OFFSET_LARGE_OFFSETS + 3))
|
|
|
|
test_expect_success 'verify incorrect 64-bit offset' '
|
|
corrupt_midx_and_verify $MIDX_BYTE_LARGE_OFFSET "\07" objects64 \
|
|
"incorrect object offset"
|
|
'
|
|
|
|
test_expect_success 'setup expire tests' '
|
|
mkdir dup &&
|
|
(
|
|
cd dup &&
|
|
git init &&
|
|
test-tool genrandom "data" 4096 >large_file.txt &&
|
|
git update-index --add large_file.txt &&
|
|
for i in $(test_seq 1 20)
|
|
do
|
|
test_commit $i
|
|
done &&
|
|
git branch A HEAD &&
|
|
git branch B HEAD~8 &&
|
|
git branch C HEAD~13 &&
|
|
git branch D HEAD~16 &&
|
|
git branch E HEAD~18 &&
|
|
git pack-objects --revs .git/objects/pack/pack-A <<-EOF &&
|
|
refs/heads/A
|
|
^refs/heads/B
|
|
EOF
|
|
git pack-objects --revs .git/objects/pack/pack-B <<-EOF &&
|
|
refs/heads/B
|
|
^refs/heads/C
|
|
EOF
|
|
git pack-objects --revs .git/objects/pack/pack-C <<-EOF &&
|
|
refs/heads/C
|
|
^refs/heads/D
|
|
EOF
|
|
git pack-objects --revs .git/objects/pack/pack-D <<-EOF &&
|
|
refs/heads/D
|
|
^refs/heads/E
|
|
EOF
|
|
git pack-objects --revs .git/objects/pack/pack-E <<-EOF &&
|
|
refs/heads/E
|
|
EOF
|
|
git multi-pack-index write &&
|
|
cp -r .git/objects/pack .git/objects/pack-backup
|
|
)
|
|
'
|
|
|
|
test_expect_success 'expire does not remove any packs' '
|
|
(
|
|
cd dup &&
|
|
ls .git/objects/pack >expect &&
|
|
git multi-pack-index expire &&
|
|
ls .git/objects/pack >actual &&
|
|
test_cmp expect actual
|
|
)
|
|
'
|
|
|
|
test_expect_success 'expire progress off for redirected stderr' '
|
|
(
|
|
cd dup &&
|
|
git multi-pack-index expire 2>err &&
|
|
test_line_count = 0 err
|
|
)
|
|
'
|
|
|
|
test_expect_success 'expire force progress on for stderr' '
|
|
(
|
|
cd dup &&
|
|
GIT_PROGRESS_DELAY=0 git multi-pack-index --progress expire 2>err &&
|
|
test_file_not_empty err
|
|
)
|
|
'
|
|
|
|
test_expect_success 'expire with the --no-progress option' '
|
|
(
|
|
cd dup &&
|
|
GIT_PROGRESS_DELAY=0 git multi-pack-index --no-progress expire 2>err &&
|
|
test_line_count = 0 err
|
|
)
|
|
'
|
|
|
|
test_expect_success 'expire removes unreferenced packs' '
|
|
(
|
|
cd dup &&
|
|
git pack-objects --revs .git/objects/pack/pack-combined <<-EOF &&
|
|
refs/heads/A
|
|
^refs/heads/C
|
|
EOF
|
|
git multi-pack-index write &&
|
|
ls .git/objects/pack | grep -v -e pack-[AB] >expect &&
|
|
git multi-pack-index expire &&
|
|
ls .git/objects/pack >actual &&
|
|
test_cmp expect actual &&
|
|
ls .git/objects/pack/ | grep idx >expect-idx &&
|
|
test-tool read-midx .git/objects | grep idx >actual-midx &&
|
|
test_cmp expect-idx actual-midx &&
|
|
git multi-pack-index verify &&
|
|
git fsck
|
|
)
|
|
'
|
|
|
|
test_expect_success 'repack with minimum size does not alter existing packs' '
|
|
(
|
|
cd dup &&
|
|
rm -rf .git/objects/pack &&
|
|
mv .git/objects/pack-backup .git/objects/pack &&
|
|
test-tool chmtime =-5 .git/objects/pack/pack-D* &&
|
|
test-tool chmtime =-4 .git/objects/pack/pack-C* &&
|
|
test-tool chmtime =-3 .git/objects/pack/pack-B* &&
|
|
test-tool chmtime =-2 .git/objects/pack/pack-A* &&
|
|
ls .git/objects/pack >expect &&
|
|
MINSIZE=$(test-tool path-utils file-size .git/objects/pack/*pack | sort -n | head -n 1) &&
|
|
git multi-pack-index repack --batch-size=$MINSIZE &&
|
|
ls .git/objects/pack >actual &&
|
|
test_cmp expect actual
|
|
)
|
|
'
|
|
|
|
test_expect_success 'repack respects repack.packKeptObjects=false' '
|
|
test_when_finished rm -f dup/.git/objects/pack/*keep &&
|
|
(
|
|
cd dup &&
|
|
ls .git/objects/pack/*idx >idx-list &&
|
|
test_line_count = 5 idx-list &&
|
|
ls .git/objects/pack/*.pack | sed "s/\.pack/.keep/" >keep-list &&
|
|
test_line_count = 5 keep-list &&
|
|
for keep in $(cat keep-list)
|
|
do
|
|
touch $keep || return 1
|
|
done &&
|
|
git multi-pack-index repack --batch-size=0 &&
|
|
ls .git/objects/pack/*idx >idx-list &&
|
|
test_line_count = 5 idx-list &&
|
|
test-tool read-midx .git/objects | grep idx >midx-list &&
|
|
test_line_count = 5 midx-list &&
|
|
THIRD_SMALLEST_SIZE=$(test-tool path-utils file-size .git/objects/pack/*pack | sort -n | sed -n 3p) &&
|
|
BATCH_SIZE=$((THIRD_SMALLEST_SIZE + 1)) &&
|
|
git multi-pack-index repack --batch-size=$BATCH_SIZE &&
|
|
ls .git/objects/pack/*idx >idx-list &&
|
|
test_line_count = 5 idx-list &&
|
|
test-tool read-midx .git/objects | grep idx >midx-list &&
|
|
test_line_count = 5 midx-list
|
|
)
|
|
'
|
|
|
|
test_expect_success 'repack creates a new pack' '
|
|
(
|
|
cd dup &&
|
|
ls .git/objects/pack/*idx >idx-list &&
|
|
test_line_count = 5 idx-list &&
|
|
THIRD_SMALLEST_SIZE=$(test-tool path-utils file-size .git/objects/pack/*pack | sort -n | head -n 3 | tail -n 1) &&
|
|
BATCH_SIZE=$(($THIRD_SMALLEST_SIZE + 1)) &&
|
|
git multi-pack-index repack --batch-size=$BATCH_SIZE &&
|
|
ls .git/objects/pack/*idx >idx-list &&
|
|
test_line_count = 6 idx-list &&
|
|
test-tool read-midx .git/objects | grep idx >midx-list &&
|
|
test_line_count = 6 midx-list
|
|
)
|
|
'
|
|
|
|
test_expect_success 'expire removes repacked packs' '
|
|
(
|
|
cd dup &&
|
|
ls -al .git/objects/pack/*pack &&
|
|
ls -S .git/objects/pack/*pack | head -n 4 >expect &&
|
|
git multi-pack-index expire &&
|
|
ls -S .git/objects/pack/*pack >actual &&
|
|
test_cmp expect actual &&
|
|
test-tool read-midx .git/objects | grep idx >midx-list &&
|
|
test_line_count = 4 midx-list
|
|
)
|
|
'
|
|
|
|
test_expect_success 'expire works when adding new packs' '
|
|
(
|
|
cd dup &&
|
|
git pack-objects --revs .git/objects/pack/pack-combined <<-EOF &&
|
|
refs/heads/A
|
|
^refs/heads/B
|
|
EOF
|
|
git pack-objects --revs .git/objects/pack/pack-combined <<-EOF &&
|
|
refs/heads/B
|
|
^refs/heads/C
|
|
EOF
|
|
git pack-objects --revs .git/objects/pack/pack-combined <<-EOF &&
|
|
refs/heads/C
|
|
^refs/heads/D
|
|
EOF
|
|
git multi-pack-index write &&
|
|
git pack-objects --revs .git/objects/pack/a-pack <<-EOF &&
|
|
refs/heads/D
|
|
^refs/heads/E
|
|
EOF
|
|
git multi-pack-index write &&
|
|
git pack-objects --revs .git/objects/pack/z-pack <<-EOF &&
|
|
refs/heads/E
|
|
EOF
|
|
git multi-pack-index expire &&
|
|
ls .git/objects/pack/ | grep idx >expect &&
|
|
test-tool read-midx .git/objects | grep idx >actual &&
|
|
test_cmp expect actual &&
|
|
git multi-pack-index verify
|
|
)
|
|
'
|
|
|
|
test_expect_success 'expire respects .keep files' '
|
|
(
|
|
cd dup &&
|
|
git pack-objects --revs .git/objects/pack/pack-all <<-EOF &&
|
|
refs/heads/A
|
|
EOF
|
|
git multi-pack-index write &&
|
|
PACKA=$(ls .git/objects/pack/a-pack*\.pack | sed s/\.pack\$//) &&
|
|
touch $PACKA.keep &&
|
|
git multi-pack-index expire &&
|
|
ls -S .git/objects/pack/a-pack* | grep $PACKA >a-pack-files &&
|
|
test_line_count = 3 a-pack-files &&
|
|
test-tool read-midx .git/objects | grep idx >midx-list &&
|
|
test_line_count = 2 midx-list
|
|
)
|
|
'
|
|
|
|
test_expect_success 'repack --batch-size=0 repacks everything' '
|
|
(
|
|
cd dup &&
|
|
rm .git/objects/pack/*.keep &&
|
|
ls .git/objects/pack/*idx >idx-list &&
|
|
test_line_count = 2 idx-list &&
|
|
git multi-pack-index repack --batch-size=0 &&
|
|
ls .git/objects/pack/*idx >idx-list &&
|
|
test_line_count = 3 idx-list &&
|
|
test-tool read-midx .git/objects | grep idx >midx-list &&
|
|
test_line_count = 3 midx-list &&
|
|
git multi-pack-index expire &&
|
|
ls -al .git/objects/pack/*idx >idx-list &&
|
|
test_line_count = 1 idx-list &&
|
|
git multi-pack-index repack --batch-size=0 &&
|
|
ls -al .git/objects/pack/*idx >new-idx-list &&
|
|
test_cmp idx-list new-idx-list
|
|
)
|
|
'
|
|
|
|
test_done
|