python-hyperscan
hyperscan
python-hyperscan | hyperscan | |
---|---|---|
1 | 25 | |
155 | 4,637 | |
- | 0.7% | |
8.8 | 2.0 | |
about 2 months ago | 5 months ago | |
C | C++ | |
MIT License | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
python-hyperscan
-
Intel Hyperscan is a high-performance multiple regex matching library
PSA that the python bindings for hyperscan need maintainers: https://github.com/darvid/python-hyperscan/issues/44
hyperscan
-
Ask HN: Regex on a File or Stream
Maybe some other PCRE-compatible implementation offers streaming. For instance, https://www.intel.com/content/www/us/en/developer/articles/t... says it has this feature, but of course given who it's from it may be tied to a single brand of CPU.
github seems to be https://github.com/intel/hyperscan
- Aho-Corasick Algorithm
-
Stop deploying web application firewalls
I think of WAFs as an extra safety net. Defense in depth.
The author complained about the performance cost of WAFs in general, but not all WAFs have be structured like ModSecurity. They could for example be based on something like https://github.com/intel/hyperscan and perf is at a very different level.
-
Be careful of the examples you use. They stick
Another example of old timey Unix code just breaking things in minor point releases. See https://abi-laboratory.pro/index.php?view=changelog&l=glibc&... and https://github.com/intel/hyperscan/issues/359.
- hypergrep: A new "fastest grep" to search directories recursively for a regex pattern
- Accelerating Regular Expressions with AVX-512 at 1.5 GB/s/core
-
GitHub push protection is free for all public repositories
Itâs a bespoke scanning setup designed to deal with GitHubâs scale. Under the hood itâs using Intelâs hyperscan as the regex engine.
https://github.com/intel/hyperscan
-
RE2 VS hyperscan - a user suggested alternative
2 projects | 17 Mar 2023
Hyperscan is an Intel regular expression library.
-
hyperscan VS RE2 - a user suggested alternative
2 projects | 17 Mar 2023
-
Show HN: Unblob â extraction suite for 30+ file formats
We are using hyperscan [3] instead of grepping byte sequences with Python, which is orders of magnitudes faster. It can also handle 4Gb+ files because of this which binwalk cannot.
It's used for a year now in production and it's way more precise and faster than binwalk. We are getting less false-positives too, and even if unblob fails to extract everything, we still get meaningful information out of firmwares, where binwalk just failed with no output previously.
[1]: https://github.com/onekey-sec/unblob/blob/main/unblob/handle...
[2]: https://github.com/onekey-sec/unblob/blob/main/unblob/proces...
[3]: https://github.com/intel/hyperscan
What are some alternatives?
vectorscan - A portable fork of the high-performance regular expression matching library
regex-benchmark - It's just a simple regex benchmark of different programming languages.
libfsm - DFA regular expression library & friends
go - The Go programming language
MRegex-Engine - MBU Regex engine is an open source regex engine made in C
lunatic - Lunatic is an Erlang-inspired runtime for WebAssembly
Onigmo - Onigmo is a regular expressions library forked from Oniguruma.
Rusoto - AWS SDK for Rust
linux-regex-module - High-performance regular expression matching library for Linux kernel
sqlx - 𧰠The Rust SQL Toolkit. An async, pure Rust SQL crate featuring compile-time checked queries without a DSL. Supports PostgreSQL, MySQL, and SQLite.
multiregex - Quickly match many regexes against a string
RE2 - RE2 is a fast, safe, thread-friendly alternative to backtracking regular expression engines like those used in PCRE, Perl, and Python. It is a C++ library.