Git VS git-filter-repo

Compare Git vs git-filter-repo and see what are their differences.

Git

Git Source Code Mirror - This is a publish-only repository but pull requests can be turned into patches to the mailing list via GitGitGadget (https://gitgitgadget.github.io/). Please follow Documentation/SubmittingPatches procedure for any of your improvements. (by git)

git-filter-repo

Quickly rewrite git repository history (filter-branch replacement) (by newren)
Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
Git git-filter-repo
285 50
49,964 7,411
2.0% -
10.0 1.0
5 days ago 2 months ago
C Python
GNU General Public License v3.0 or later GNU General Public License v3.0 or later
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

Git

Posts with mentions or reviews of Git. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-02-13.
  • GitHub Git Mirror Down
    1 project | news.ycombinator.com | 11 Apr 2024
  • Four ways to solve the "Remote Origin Already Exists" error.
    1 project | dev.to | 28 Mar 2024
  • So You Think You Know Git – Git Tips and Tricks by Scott Chacon
    6 projects | news.ycombinator.com | 13 Feb 2024
    Boy, I can't find this either (but also, the kernel mailing list is _really_ difficult to search). I really remember Linus saying something like "it's not a real SCM, but maybe someone could build one on top of it someday" or something like that, but I cannot figure out how to find that.

    You _can_ see, though, that in his first README, he refers to what he's building as not a "real SCM":

    https://github.com/git/git/commit/e83c5163316f89bfbde7d9ab23...

  • Maintain-Git.txt
    1 project | news.ycombinator.com | 6 Feb 2024
  • Git Commit Messages by Jeff King
    2 projects | news.ycombinator.com | 1 Feb 2024
    Here is the direct link, as HN somehow removes the query string: https://github.com/git/git/commits?author=peff&since=2023-10...
  • Git commit messages by Jeff King
    1 project | news.ycombinator.com | 1 Feb 2024
  • My favourite Git commit (2019)
    8 projects | news.ycombinator.com | 1 Feb 2024
  • Do we think of Git commits as diffs, snapshots, and/or histories?
    1 project | news.ycombinator.com | 6 Jan 2024
    I understand all that.

    I'm saying, if you write a survey and one of the possible answers is "diff", but you don't clearly define what you mean by "diff", then don't be surprised if respondents use any reasonable definition that makes sense to them. Ask an ambiguous question, get a mishmash of answers.

    The thing that Git uses for packfiles is called a "delta" by Git, but it's also reasonable to call it a "diff". After all, Git's delta algorithm is "greatly inspired by parts of LibXDiff from Davide Libenzi"[1]. Not LibXDelta but LibXDiff.

    Yes, how Git stores blobs (using deltas) is orthogonal to how Git uses blobs. But while that orthogonality is useful for reasoning about Git, it's not wrong to think of a commit as the totality of what Git does, including that optimization. (Some people, when learning Git, stumble over the way it's described as storing full copies, think it's wasteful. For them to wrap their heads around Git, they have to understand that the optimization exists. Which makes sense because Git probably wouldn't be practical if it lacked that optimization.)

    The reason I'm bringing all this up is, if you're trying to explain Git, which is what the original article is about, then it's very important to keep in mind that someone who is learning Git needs to know what you mean when you say "diff". Most people who already know Git would tend to gravitate toward the definition of "diff" that you're assuming (the thing that Git computes on the fly and never stores), but people who already know Git aren't the target audience when you're teaching Git.

    ---

    [1] https://github.com/git/git/blob/master/diff-delta.c

  • The State of Merging Technology
    1 project | news.ycombinator.com | 13 Dec 2023
    Didn't Git have a new default merge strategy, `ort` https://github.com/git/git/blob/master/Documentation/RelNote... ?
  • The bash book to rule them all
    5 projects | news.ycombinator.com | 25 Nov 2023
    Yes, but you are referring to standalone scripts, not functions defined within a Bash script.

    Compare for example the following helper code used for git command completion inside Bash and inside PowerShell.

    Bash: https://github.com/git/git/blob/master/contrib/completion/gi...

git-filter-repo

Posts with mentions or reviews of git-filter-repo. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-12-19.
  • Cleaning Your Git History: Safely Removing Sensitive Data
    2 projects | dev.to | 19 Dec 2023
    **WARNING**: git-filter-branch has a glut of gotchas generating mangled history rewrites. Hit Ctrl-C before proceeding to abort, then use an alternative filtering tool such as 'git filter-repo' (https://github.com/newren/git-filter-repo/) instead. See the filter-branch manual page for more details; to squelch this warning, set FILTER_BRANCH_SQUELCH_WARNING=1. Proceeding with filter-branch... Rewrite a3a48b09e282854c80bf4ad02a017e249e161fd8 (2/8) (0 seconds passed, remaining 0 predicted) rm 'config.js' Rewrite 6e788e83a338e45b348d93d682b32c816ee2fbff (3/8) (0 seconds passed, remaining 0 predicted) rm 'config.js' Rewrite 7a378a0145bce70bea213ca5f9062138544db5f2 (4/8) (0 seconds passed, remaining 0 predicted) rm 'config.js' Rewrite 0637c9659623644cfceb35be10f2a1fe5c468e04 (5/8) (0 seconds passed, remaining 0 predicted) rm 'config.js' Rewrite 6c421eb99adc6b987cff7f3cada31e9313638072 (6/8) (0 seconds passed, remaining 0 predicted) rm 'config.js' Rewrite 98001e5b97270efa4a8ab5bd0452be56dd76883d (7/8) (0 seconds passed, remaining 0 predicted) rm 'config.js' Rewrite 2ca4e161a4af2b8f38c46faf848fdbb3e550f23c (8/8) (0 seconds passed, remaining 0 predicted) rm 'config.js' Ref 'refs/heads/secret_keys' was rewritten.
  • (RE not sharing inputs) PSA: "deleting" and committing to git doesn't actually remove it
    2 projects | /r/adventofcode | 9 Dec 2023
    Yup you need https://github.com/newren/git-filter-repo Take a look at https://github.com/newren/git-filter-repo/blob/main/INSTALL.md for instructions
  • How I teach Git
    1 project | dev.to | 26 Nov 2023
    git filter-repo: a third-party command actually, as a replacement to Git's own filter-branch, that allows rewriting the whole history of a repository to remove a mistakenly added file, or help extract part of the repository to another.
  • Merging old repos into a monolithic git repo archive
    1 project | dev.to | 12 Jul 2023
    I needed to archive some old repositories into a monorepo and of course I gave myself the requirement of maintaining git history, in some way. I tried a couple of solutions but it wasn't until I stumbled upon the git-filter-repo project at https://github.com/newren/git-filter-repo and another article which I've since lost (which was badly documented anyway) that I was able to figure out how to do this.
  • Mass edit of .git/objects
    1 project | /r/git | 20 May 2023
    Git objects are not designed to be changed, they are immutable blobs. This is not a problem if you are making a reader, but is a problem when you want to change things, tools like old git-filter-branch or the newer filter repo abstract all reference updating away for you
  • Question about Git LFS
    1 project | /r/gamedev | 4 May 2023
    Make sure your gitignore is setup right (GitHub has a repo of good defaults). If you messed that up, you could rewrite git history to remove the big stuff. Use git-filter-repo. Not sure how that works for LFS.
  • How to open source code from a private monorepo
    3 projects | dev.to | 3 May 2023
    git-filter-repo
  • How to Push Files Over 100MB to GitHub: A Step-by-Step Guide with Git Large File Storage (LFS)
    2 projects | /r/github | 21 Apr 2023
    Check out git filter repo https://github.com/newren/git-filter-repo
  • Large initial push.
    2 projects | /r/git | 6 Apr 2023
    I personally prefer git-bfg ( https://rtyley.github.io/bfg-repo-cleaner/ ) ... though git-filter-repo ( https://github.com/newren/git-filter-repo ) is quite popular. The difference for me was that git-bfg is JVM based and my work machine has Java on it while git-filter-repo is python based... and my work machine is without python.
  • Is there a way to scrub certain info from a repo's history? I wanna make a repo public, but at one point I stored my API client credentials in the code. Presumably that makes it technically unsafe to ever share that repo. What to do?
    2 projects | /r/learnprogramming | 18 Feb 2023

What are some alternatives?

When comparing Git and git-filter-repo you can also consider the following projects:

scalar - Scalar: A set of tools and extensions for Git to allow very large monorepos to run on Git without a virtualization layer

bfg-repo-cleaner - Removes large or troublesome blobs like git-filter-branch does, but faster. And written in Scala

PineappleCAS - A generic computer algebra system targeted for the TI-84+ CE calculators

trufflehog - Find and verify credentials

Subversion - Mirror of Apache Subversion

josh - Just One Single History

vscode-gitlens - Supercharge Git inside VS Code and unlock untapped knowledge within each repository — Visualize code authorship at a glance via Git blame annotations and CodeLens, seamlessly navigate and explore Git repositories, gain valuable insights via rich visualizations and powerful comparison commands, and so much more

gh-action-pypi-publish - The blessed :octocat: GitHub Action, for publishing your :package: distribution files to PyPI: https://github.com/marketplace/actions/pypi-publish

linux - Linux kernel source tree

roadmap - GitHub public roadmap

chromebrew - Package manager for Chrome OS [Moved to: https://github.com/chromebrew/chromebrew]

git-internals-pdf - PDF on Git Internals