roaring-rs
opengrok
Our great sponsors
roaring-rs | opengrok | |
---|---|---|
12 | 11 | |
682 | 4,232 | |
1.5% | 3.1% | |
7.2 | 9.0 | |
14 days ago | 6 days ago | |
Rust | Java | |
Apache License 2.0 | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
roaring-rs
-
We’re the Meilisearch team! To celebrate v1.0 of our open-source search engine, Ask us Anything!
There are issues and pull requests but I advise you to look at the milli folder in the Meilisearch repository, it’s where all the logic is done. We extensively use RoaringBitmaps, heed the LMDB wrapper and grenad when indexing.
- Roaring-rs, better-compressed bitsets, is introducing faster multiple-bitmaps operations
-
Roaring-rs, better-compressed bitsets, is seeing the most important performance speed-up to date
On some benchmarks, we are faster but most of the time we aren't. We absolutely need to introduce benchmarks with the croaring-rs library and most of those performances gain could also be achieved with other methods to do multi-ops for example.
-
What’s everyone working on this week (9/2022)?
We tried to release the new version of roaring-rs, better compressed bitset in Rust, but found out that the core library simd module was blocking us. We now have to work on std::simd to release the blocking features.
-
Meilisearch, the Rust search engine, just raised $5M
Yeah, it can be attributed to using the roaring-rs library, but not just that, we have done so much to improve the search performances by reducing the number of set-operations we do.
-
Improving GitHub Code Search
Given the shoutouts to Burntsushi and Lemire this is almost certainly a bitmap trigram index based engine similar to https://github.com/google/zoekt
The index is likely based on Roaring bitmaps, presumably https://github.com/RoaringBitmap/roaring-rs in this case.
Nice architecture, exactly how I would have done it also.
- roaring-rs - What do you think about deprecating the set operation functions (intersect_with...) for the benefit of the std ops traits?
-
What's everyone working on this week (17/2021)?
I have worked on roaring-rs, a very fast library to do set operations like unions and intersections, and improved the four operations by using the standard ops traits.
-
What’s everyone working on this week (13/2021)?
I have implemented a better way of specifying ranges to be inserted or removed from a RoaringBitmap by using the RangeBounds trait. The roaring-rs library exposes fast data-structures to do set operations, like intersections and unions.
opengrok
- OpenGrok: Fast and usable source code search and cross reference engine
-
Sourcegraph is no longer Open Source
[4] is not really a usable 'product'. Livegrep (https://github.com/livegrep/livegrep) was inspired by it and is very usable.
[3] used to be a Google open source project as well, but it fell out of maintenance, and Sourcegraph took it over. It powers most of the basic regex/literal search in Sourcegraph.
Mozilla's code is searchable in Searchfox (https://searchfox.org/) which uses the indexer from Livegrep, combined with their own Git indexer and language-specific cross reference databases.
OpenGrok (https://github.com/oracle/opengrok) is also rather well known, but I have found it to have a slightly worse UI than alternatives.
- Ask HN: What services/apps are you self-hosting?
- Searching a large code base.
-
Improving GitHub Code Search
My job uses https://oracle.github.io/opengrok/ and I'm generally happy with it. It has some problems with special character searches at times but generally does what I want. It's certainly better than code search in our on-prem github instance.
-
Is there a tool that would allow me to query (structured search) a codebase?
I used it a long time ago, but I see this is still around: https://oracle.github.io/opengrok/
-
This one made its way into my English textbook
You've never come across https://github.com/oracle/opengrok for example?
-
Ask HN: What are you using to introspect your code base
[2] https://about.sourcegraph.com/
[3] https://oracle.github.io/opengrok/
[4] https://github.com/hound-search/hound
- On Navigating a Large Codebase
What are some alternatives?
generic-array - Generic array types in Rust
hound - Lightning fast code searching made easy
array_tool - Array helpers for Rust's Vector and String types
sourcegraph - Code AI platform with Code Search & Cody
croaring-rs - Rust FFI wrapper for CRoaring
Glean - System for collecting, deriving and working with facts about source code.
nym - Manipulate files en masse using patterns.
the_silver_searcher - A code-searching tool similar to ack, but faster.
milli - Search engine library for Meilisearch ⚡️
Javet - Javet is Java + V8 (JAVa + V + EighT). It is an awesome way of embedding Node.js and V8 in Java.
base_custom - Rust implementation of custom numeric base conversion.
zoekt - Fast trigram based code search