rure-go
hyperscan
rure-go | hyperscan | |
---|---|---|
2 | 25 | |
247 | 4,650 | |
- | 0.7% | |
3.0 | 0.0 | |
5 months ago | 5 days ago | |
Go | C++ | |
The Unlicense | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
rure-go
-
Best regexp alternative for Go. Benchmarks. Plots.
rure-go (regex 1.9.3) - uses the Rust regex engine with CGo bindings. The downside is a Rust library dependency that needs to be compiled;
-
Go performance from version 1.2 to 1.18
Totally agreed: almost all users (me/GoAWK included) want performance and don't care nearly as much about simplicity under the hood. Simplicity of implementation is of value for educational purposes, but we could easily have a small, simple 3rd party package for that. Go's regexp package is kinda too complex for a simple educational demonstration and too simple to be fast. :-)
I actually tried BurntSushi's https://github.com/BurntSushi/rure-go (bindings to Rust's regex engine) with GoAWK and it made regex handling 4-5x as fast for many regexes, despite the CGo overhead. However, rure-go (and CGo in general) is a bit painful to build, so I'm not going to use that. Maybe I'll create a branch for speed freaks who want it.
I've also thought of using https://gitlab.com/cznic/ccgo to convert Mawk's fast regex engine to Go source and see how that performs. Maybe on the next rainy day...
hyperscan
-
Ask HN: Regex on a File or Stream
Maybe some other PCRE-compatible implementation offers streaming. For instance, https://www.intel.com/content/www/us/en/developer/articles/t... says it has this feature, but of course given who it's from it may be tied to a single brand of CPU.
github seems to be https://github.com/intel/hyperscan
- Aho-Corasick Algorithm
-
Stop deploying web application firewalls
I think of WAFs as an extra safety net. Defense in depth.
The author complained about the performance cost of WAFs in general, but not all WAFs have be structured like ModSecurity. They could for example be based on something like https://github.com/intel/hyperscan and perf is at a very different level.
-
Be careful of the examples you use. They stick
Another example of old timey Unix code just breaking things in minor point releases. See https://abi-laboratory.pro/index.php?view=changelog&l=glibc&... and https://github.com/intel/hyperscan/issues/359.
- hypergrep: A new "fastest grep" to search directories recursively for a regex pattern
- Accelerating Regular Expressions with AVX-512 at 1.5 GB/s/core
-
GitHub push protection is free for all public repositories
It’s a bespoke scanning setup designed to deal with GitHub’s scale. Under the hood it’s using Intel’s hyperscan as the regex engine.
https://github.com/intel/hyperscan
-
RE2 VS hyperscan - a user suggested alternative
2 projects | 17 Mar 2023
Hyperscan is an Intel regular expression library.
-
hyperscan VS RE2 - a user suggested alternative
2 projects | 17 Mar 2023
-
Show HN: Unblob – extraction suite for 30+ file formats
We are using hyperscan [3] instead of grepping byte sequences with Python, which is orders of magnitudes faster. It can also handle 4Gb+ files because of this which binwalk cannot.
It's used for a year now in production and it's way more precise and faster than binwalk. We are getting less false-positives too, and even if unblob fails to extract everything, we still get meaningful information out of firmwares, where binwalk just failed with no output previously.
[1]: https://github.com/onekey-sec/unblob/blob/main/unblob/handle...
[2]: https://github.com/onekey-sec/unblob/blob/main/unblob/proces...
[3]: https://github.com/intel/hyperscan
What are some alternatives?
ccgo
regex-benchmark - It's just a simple regex benchmark of different programming languages.
go - The Go programming language
gohs - GoLang Binding of HyperScan https://www.hyperscan.io/
lunatic - Lunatic is an Erlang-inspired runtime for WebAssembly
Rusoto - AWS SDK for Rust
esqueleto - Bare bones, type-safe EDSL for SQL queries on persistent backends.
sqlx - 🧰 The Rust SQL Toolkit. An async, pure Rust SQL crate featuring compile-time checked queries without a DSL. Supports PostgreSQL, MySQL, and SQLite.
Sqinn-Go - Golang SQLite without cgo
RE2 - RE2 is a fast, safe, thread-friendly alternative to backtracking regular expression engines like those used in PCRE, Perl, and Python. It is a C++ library.