refex
checkr
refex | checkr | |
---|---|---|
3 | 3 | |
14 | 8 | |
- | - | |
5.0 | 0.0 | |
7 months ago | over 2 years ago | |
Python | JavaScript | |
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
refex
-
Writing and Linting Python at Scale
As someone who worked on a similar tool (https://github.com/ssbr/refex/tree/main/refex/fix/fixers, I did a bunch of the work to prep this for open-sourcing, though I think all my contributions are hidden behind the "Google-internal" anonymization), having auto-applied or auto-appliable fixers like this is super useful.
They can be auto-applied by post-commit (e.g. a generic `git fixcommit` style command that runs all the relevant lint tools and fixes them in the working copy, letting you review before push), or applied during code review (automatic comments with a "click here to apply fix" interface), both of which are nice.
Plus the same underlying tooling can be used to write more complex one-off fixes that may be used for migrations or cleanups.
-
I learnt to use ASTs to patch 100,000s lines of python code
You might like Refex, which automates AST transformations like this:
-
Semgrep: Like Grep but for Code
There's lots of confusion about what semgrep does here, which is kind of unfortunate. I haven't touched it much, but I have built a very similar tool (I'm one of the contributors to refex[1], which is a very similar project).
The starting point of semantic grep is very useful. When you have a big codebase, you often want to detect antipatterns, or not even antipatterns, but just uses of a thing, say you're renaming a method and want to track down the callers.
Being able to act on the AST, instead of hoping you searched up all of the variants of whitespace and line breaks and, depending on the specific example, different uses of argument passing, is really useful.
But often when you're semantically grepping, your goal is to replace something with something else (this is what refex was initially built for: to aide in large scale changes in python, as a sort of equivalent to the C++ tools that Google uses).
But then you want to shift left even further: once you have a pattern that you want to replace once, you can just enforce that a linter yell at you when anyone does it again. So it's very natural to develop a linter-style thing on top of one of these[2].
This is, as I understand it sort of the same thing that happens in C++: clang-tidy and clang-format are written on top of AST libraries that can be used for ad-hoc analysis and transformations, but you can also just plug them into a linter.
The thing is, for most organizations, enforcing code style and best practices is more valuable than apply a refactoring to 10M lines of code, because most organizations don't have 10M lines of code to refactor. That doesn't mean that these tools aren't also useful for ad-hoc transforms and exploratory analysis. They absolutely are!
[1]: https://github.com/ssbr/refex
[2]: https://github.com/ssbr/refex/tree/main/refex/fix
checkr
- Study: Developers spend almost 2 days a week just waiting for other developers to review their code
-
Semgrep: Like Grep but for Code
I wrote a small VS code extension and pre-commit hook that might meet 80% of your needs:
https://github.com/elanning/checkr
It is just simple regex at this time, but hopefully I can add something like CCGrep syntax in the future:
https://github.com/yuy-m/CCGrep
-
I wrote a tool that makes writing regex easier
It looks like the checkr extension could greatly benefit from something like this. Very nice work! Excited to try it out when I have some free time.
What are some alternatives?
pre-commit - A framework for managing and maintaining multi-language pre-commit hooks.
tree-sitter-swift - A tree-sitter grammar for the Swift programming language.
tree-sitter-swift - Swift grammar for tree-sitter
ComposableRegex - Build out composable regular expressions from simple sub blocks in a BNF type syntax. Check http://composableregex.apphb.com/ for a demo
ocaml-tree-sitter-semgrep - Generate parsers from tree-sitter grammars extended to support Semgrep patterns
human2regex - Humanized Regex
CCGrep - Code Clone Detector like grep
semgrep - Lightweight static analysis for many languages. Find bug variants with patterns that look like source code.
Bear - Bear is a tool that generates a compilation database for clang tooling.
hyperlight - Automatically exported from code.google.com/p/hyperlight
actions-preview.nvim - Fully customizable previewer for LSP code actions.