How much space in Git does a git-annex file use?

The experiment

  1. Make an empty git annex repository
  2. Gathering disk usage and other repository size information after each change,
  3. Add small files in batches of 1000
  4. Copy to a special remote
  5. Draw graphs

What does it look like?

Configuration to store files efficiently?

I haven’t made or tested settings for annex.largefiles yet, or considered what sort of experiment to run.

Are the simplifications realistic?

Other caveats,

How did you do it?

Or “Can I repeat the experiment?”

I did it with some grubby shellscripts, Perl and filled in the gaps with paste-from-the-documentation oneliners.

Here is the git bundle of it.