codesearch
roaring-rs
codesearch | roaring-rs | |
---|---|---|
10 | 12 | |
3,422 | 682 | |
- | 0.6% | |
0.0 | 7.2 | |
almost 2 years ago | 19 days ago | |
Go | Rust | |
BSD 3-clause "New" or "Revised" License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
codesearch
- Regular Expression Matching with a Trigram Index
- How Google Code Search Worked
-
Ask HN: How do you search large code-base before adding a feature or fixing bug?
Whenever I work on huge codebase (think 1M+), I always reach for Russ Cox's codesearch https://github.com/google/codesearch. It requires indexing the codebase first, which takes 15 minutes or so, but after that searches are instant.
-
Improving GitHub Code Search
There is some older version that's open source, I haven't tried it and I don't know how much of today's code search is based on it.
https://github.com/google/codesearch
-
Facebook open sources Glean, its scalable code search and query engine
There's https://github.com/google/codesearch
-
Is there a reliable way to force google to include a search term as plain text on the result or is there a decent search platform with that feature?
If you want to find exact pieces of syntax then you should use a code search engine like GitHub's, Sourcegraph, or Google's codesearch. On the flip side, these won't be as good as Google for finding general ideas because they're focused more toward the precision of answers rather than giving you things that are related.
-
Postgres regex search over 10,000 GitHub repositories (using only a Macbook)
Check https://github.com/google/codesearch or https://swtch.com/~rsc/regexp/regexp4.html, this is actually possible.
roaring-rs
-
We’re the Meilisearch team! To celebrate v1.0 of our open-source search engine, Ask us Anything!
There are issues and pull requests but I advise you to look at the milli folder in the Meilisearch repository, it’s where all the logic is done. We extensively use RoaringBitmaps, heed the LMDB wrapper and grenad when indexing.
- Roaring-rs, better-compressed bitsets, is introducing faster multiple-bitmaps operations
-
Roaring-rs, better-compressed bitsets, is seeing the most important performance speed-up to date
On some benchmarks, we are faster but most of the time we aren't. We absolutely need to introduce benchmarks with the croaring-rs library and most of those performances gain could also be achieved with other methods to do multi-ops for example.
-
What’s everyone working on this week (9/2022)?
We tried to release the new version of roaring-rs, better compressed bitset in Rust, but found out that the core library simd module was blocking us. We now have to work on std::simd to release the blocking features.
-
Meilisearch, the Rust search engine, just raised $5M
Yeah, it can be attributed to using the roaring-rs library, but not just that, we have done so much to improve the search performances by reducing the number of set-operations we do.
-
Improving GitHub Code Search
Given the shoutouts to Burntsushi and Lemire this is almost certainly a bitmap trigram index based engine similar to https://github.com/google/zoekt
The index is likely based on Roaring bitmaps, presumably https://github.com/RoaringBitmap/roaring-rs in this case.
Nice architecture, exactly how I would have done it also.
- roaring-rs - What do you think about deprecating the set operation functions (intersect_with...) for the benefit of the std ops traits?
-
What's everyone working on this week (17/2021)?
I have worked on roaring-rs, a very fast library to do set operations like unions and intersections, and improved the four operations by using the standard ops traits.
-
What’s everyone working on this week (13/2021)?
I have implemented a better way of specifying ranges to be inserted or removed from a RoaringBitmap by using the RangeBounds trait. The roaring-rs library exposes fast data-structures to do set operations, like intersections and unions.
What are some alternatives?
zoekt - Fast trigram based code search
generic-array - Generic array types in Rust
hound - Lightning fast code searching made easy
array_tool - Array helpers for Rust's Vector and String types
Glean - System for collecting, deriving and working with facts about source code.
croaring-rs - Rust FFI wrapper for CRoaring
mozsearch - Mozilla code search website. (Please file bugs in bugzilla at https://mzl.la/2YtXmoN)
nym - Manipulate files en masse using patterns.
opengrok - OpenGrok is a fast and usable source code search and cross reference engine, written in Java
milli - Search engine library for Meilisearch ⚡️
postgres-operator - Production PostgreSQL for Kubernetes, from high availability Postgres clusters to full-scale database-as-a-service.
base_custom - Rust implementation of custom numeric base conversion.