Git VS gh-ost

Compare Git vs gh-ost and see what are their differences.

Git

Git Source Code Mirror - This is a publish-only repository but pull requests can be turned into patches to the mailing list via GitGitGadget (https://gitgitgadget.github.io/). Please follow Documentation/SubmittingPatches procedure for any of your improvements. (by git)

gh-ost

GitHub's Online Schema-migration Tool for MySQL (by github)
InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
Git gh-ost
287 32
50,099 12,010
1.6% 0.6%
10.0 7.5
1 day ago 4 days ago
C Go
GNU General Public License v3.0 or later MIT License
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

Git

Posts with mentions or reviews of Git. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-02-13.
  • Git tracks itself. See it's first commit of itself
    1 project | news.ycombinator.com | 3 May 2024
  • Resistance against London tube map commit history (a.k.a. git merge hell) (2015)
    1 project | news.ycombinator.com | 2 May 2024
    Look at any PR/patch series that got merged into the Git project. https://github.com/git/git/

    Any random one. Because those that did not meet the minimum criteria for a well-crafted history would not have passed review.

  • GitHub Git Mirror Down
    1 project | news.ycombinator.com | 11 Apr 2024
  • Four ways to solve the "Remote Origin Already Exists" error.
    1 project | dev.to | 28 Mar 2024
  • So You Think You Know Git – Git Tips and Tricks by Scott Chacon
    6 projects | news.ycombinator.com | 13 Feb 2024
    Boy, I can't find this either (but also, the kernel mailing list is _really_ difficult to search). I really remember Linus saying something like "it's not a real SCM, but maybe someone could build one on top of it someday" or something like that, but I cannot figure out how to find that.

    You _can_ see, though, that in his first README, he refers to what he's building as not a "real SCM":

    https://github.com/git/git/commit/e83c5163316f89bfbde7d9ab23...

  • Maintain-Git.txt
    1 project | news.ycombinator.com | 6 Feb 2024
  • Git Commit Messages by Jeff King
    2 projects | news.ycombinator.com | 1 Feb 2024
    Here is the direct link, as HN somehow removes the query string: https://github.com/git/git/commits?author=peff&since=2023-10...
  • Git commit messages by Jeff King
    1 project | news.ycombinator.com | 1 Feb 2024
  • My favourite Git commit (2019)
    8 projects | news.ycombinator.com | 1 Feb 2024
  • Do we think of Git commits as diffs, snapshots, and/or histories?
    1 project | news.ycombinator.com | 6 Jan 2024
    I understand all that.

    I'm saying, if you write a survey and one of the possible answers is "diff", but you don't clearly define what you mean by "diff", then don't be surprised if respondents use any reasonable definition that makes sense to them. Ask an ambiguous question, get a mishmash of answers.

    The thing that Git uses for packfiles is called a "delta" by Git, but it's also reasonable to call it a "diff". After all, Git's delta algorithm is "greatly inspired by parts of LibXDiff from Davide Libenzi"[1]. Not LibXDelta but LibXDiff.

    Yes, how Git stores blobs (using deltas) is orthogonal to how Git uses blobs. But while that orthogonality is useful for reasoning about Git, it's not wrong to think of a commit as the totality of what Git does, including that optimization. (Some people, when learning Git, stumble over the way it's described as storing full copies, think it's wasteful. For them to wrap their heads around Git, they have to understand that the optimization exists. Which makes sense because Git probably wouldn't be practical if it lacked that optimization.)

    The reason I'm bringing all this up is, if you're trying to explain Git, which is what the original article is about, then it's very important to keep in mind that someone who is learning Git needs to know what you mean when you say "diff". Most people who already know Git would tend to gravitate toward the definition of "diff" that you're assuming (the thing that Git computes on the fly and never stores), but people who already know Git aren't the target audience when you're teaching Git.

    ---

    [1] https://github.com/git/git/blob/master/diff-delta.c

gh-ost

Posts with mentions or reviews of gh-ost. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-12-08.
  • "At GitHub we do not use foreign keys, ever, anywhere"
    1 project | news.ycombinator.com | 22 Jan 2024
  • How Modern SQL Databases Are Changing Web Development - #3 Better Developer Experience
    4 projects | dev.to | 8 Dec 2023
    I’ve been through multiple incidents where everything worked fine in the testing environment but ended up locking the production database for minutes when deployed. A category of open-source tools called OSC (Online Schema Change) exists to mitigate such pain, like gh-ost used by GitHub and OSC used by Meta. They work by creating a set of "ghost tables" to apply the migrations, copy over old data from the original tables, and catch up with new writes simultaneously. When all old data is migrated, you can trigger a cutover to make the "ghost tables" production. Check the post below for a great introduction and comparison:
  • We migrated to SQL. Our biggest learning? Don't use Prisma
    11 projects | news.ycombinator.com | 9 Oct 2023
    Sounds like it's basically explained in the gh-ost readme https://github.com/github/gh-ost#how

    I think it amounts to "use views to decouple access to the table with a fixed interface" and "use triggers for migrating data between tables"

  • Ask HN: Is PostgreSQL better than MySQL?
    2 projects | news.ycombinator.com | 17 Apr 2023
    Gh-ost is the new hotness. Simple to use and lots of great features: https://github.com/github/gh-ost
  • My Green/blue AWS db deployment strategy for avoiding data loss due to table locks
    1 project | /r/devops | 21 Mar 2023
    If the performance of the db is a concern during migrations (locking, high cpu consumption for large writes) there are tools that can help and do similiar to what your describing but with the benefit that they are battle tested tools. This one spring to mind https://github.com/github/gh-ost there are other options as well and its worth reading the trade off docs
  • Changing column from longtext to mediumtext taking over 2 hours
    3 projects | /r/mysql | 4 Nov 2022
    Not sure which version of MySQL you're using, but one approach would be to use a tool like pt-online-schema-change (from Percona) or g-host -- which will create a duplicate table and then swap it in place of the original table. It's a safer approach when operating in production environments. Here's a good comparison of the tools many people use https://planetscale.com/docs/learn/online-schema-change-tools-comparison
  • Ask HN: Do you use foreign Keys in Relational Databases
    1 project | news.ycombinator.com | 6 Sep 2022
    No, especially on large tables with billions of records. They make online schema changes impossible. More details: https://github.com/github/gh-ost/issues/331#issuecomment-266...
  • Migrating a production database without any downtime
    1 project | dev.to | 13 Aug 2022
    Tip #4: Consider slow-running migrations. Some tables can be so large that the traditional migration way is simply not a viable option for them. In such cases, you can consider embedding the data migration code right into your application, or use a special utility like GitHub's online schema migration for MySQL. A slow-running migration can work in production for days or even weeks. It gradually converts the data by small chunks, so you can carefully balance the load on the database while making sure that it doesn't cause slowness or downtime.
  • How do you handle RDS schema migrations?
    1 project | /r/aws | 27 May 2022
    GitHub gh-ost
  • Changing Tires at 100mph: A Guide to Zero Downtime Migrations
    9 projects | news.ycombinator.com | 4 May 2022
    Actually I never tried but I was scared by the small print of GH not using RDS themselves [1] and Ghost relying on lower-level features that might be not easily available in RDS. Also I had the impression you have to setup a normal non-RDS replica attached to your RDS master?

    [1] https://github.com/github/gh-ost/blob/master/doc/rds.md

What are some alternatives?

When comparing Git and gh-ost you can also consider the following projects:

scalar - Scalar: A set of tools and extensions for Git to allow very large monorepos to run on Git without a virtualization layer

pg-online-schema-change - Easy CLI tool for making zero downtime schema changes and backfills in PostgreSQL [Moved to: https://github.com/shayonj/pg-osc]

PineappleCAS - A generic computer algebra system targeted for the TI-84+ CE calculators

doctrine-test-bundle - Symfony bundle to isolate your app's doctrine database tests and improve the test performance

Subversion - Mirror of Apache Subversion

squawk - 🐘 linter for PostgreSQL, focused on migrations

vscode-gitlens - Supercharge Git inside VS Code and unlock untapped knowledge within each repository — Visualize code authorship at a glance via Git blame annotations and CodeLens, seamlessly navigate and explore Git repositories, gain valuable insights via rich visualizations and powerful comparison commands, and so much more

pg_squeeze - A PostgreSQL extension for automatic bloat cleanup

linux - Linux kernel source tree

hub - A command-line tool that makes git easier to use with GitHub.

chromebrew - Package manager for Chrome OS [Moved to: https://github.com/chromebrew/chromebrew]

Jenkins - Jenkins automation server