-
I experimented earlier to push the limits of SQLite inserts and wrote a blog post[0] about it. We can apply some of the learnings here.
I reviewed the OP's code and did some benchmarks; SQLite is not the bottleneck here. The code first generates the commit info from the git log, prints that to stdin [1] and the python script reads from it one by one in a loop [2]. Each of the commit info is written to SQLite. So, with or without WAL, the time is almost the same.
I believe the git log will be fast anyway, so other ways to make it faster would be to read a bunch of commits at once and then do batch inserts. We can also make it run in parallel since each commit info is independent, and we don't need to care about ordering while inserting.
[0] - https://avi.im/blag/2021/fast-sqlite-inserts/
[1] - https://github.com/jmforsythe/Git-Heat-Map/blob/bd9bc22/git-...
[2] - https://github.com/jmforsythe/Git-Heat-Map/blob/bd9bc22/git-...
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
If you think this is useful, you might also like codeatlas.dev and its Github Action (https://github.com/codeatlasHQ/codebase-visualizer-action). It currently does not support per-contributor activity, but we put a lot of effort into making the diagrams beautiful to look at and the basic approach of using treemaps for visualisation seems very similar. In fact, could be cool to collaborate on this, DM me if interested!
https://codeatlas.dev
-
Don't have time to install but I would pay $10 in NEAR coin if you can email or post the results of my repos to me ([email protected]):
https://github.com/breck7/jtree and https://github.com/breck7/pldb
-
Don't have time to install but I would pay $10 in NEAR coin if you can email or post the results of my repos to me ([email protected]):
https://github.com/breck7/jtree and https://github.com/breck7/pldb
-
This is cool. The visualization reminds me of the visualization of QDirStat/WinDirStat.
Also shout out to the (mostly useless but cool looking) git history visualizer, Gource[0].
https://gource.io/
-
Looks useful on big repositories!
https://github.com/nixos/nixpkgs/ would be a great benchmark for a tool like this :) One of the larger repos on github, close to half a million commits by a large set of contributors to thousands of files.