paradedb
livegrep
paradedb | livegrep | |
---|---|---|
16 | 11 | |
3,962 | 1,896 | |
11.0% | 3.2% | |
9.8 | 5.5 | |
4 days ago | 15 days ago | |
Rust | C++ | |
GNU Affero General Public License v3.0 | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
paradedb
- Using ClickHouse to scale an events engine
-
Code Search Is Hard
Elasticsearch is good, and it does scale, but it is much more cumbersome and expensive to scale and operate than Postgres. If you use the managed service, you'll pay for the operational pain in the form of higher pricing.
The Postgres movement is strong and extensions like ParadeDB https://github.com/paradedb/paradedb are designed specifically to solve this pain point (Disclaimer: I work for ParadeDB)
-
Ask HN: Best way to mirror a Postgres database to parquet?
No timeline yet, but we know it's a high-priority feature and are working hard on it. Best way would be to join our Slack (link here: https://github.com/paradedb/paradedb/blob/dev/README.md) to follow along. It will be in the coming weeks/months, though.
-
Transforming Postgres into a Fast OLAP Database
You're right. We're working on this currently. You can track the issue here: https://github.com/paradedb/paradedb/issues/717
-
We built our customer data warehouse all on Postgres
There are definitely ways to cleanly make Postgres scale for analytics. We didn't discuss in this blog, but we will be writing about them in the future. For example, check out what the folks at ParadeDB are doing. https://github.com/paradedb/paradedb. Neon is doing an awesome job separating compute from storage. Supabase contributed foreign data wrappers make it super easy to read from S3 into Postgres. Lots of great work going out there :)
- Show HN: Pg_analytics – Speed Up Postgres Analytical Queries by 94x
-
Multi-Database Support in DuckDB
Check out https://github.com/paradedb/paradedb/tree/dev/pg_analytics, we're shipping this week
- ParadeDB – PostgreSQL for Search
-
Postgresql index
Shameless plug, but I'm one of the makers of `pg_bm25` (https://github.com/paradedb/paradedb). We're making a faster tsvector/tsrank as a Postgres extension. Maybe it can help, our benchmarks show much faster performance especially as row count increases
- Building an open source vector database. Looking for advice.
livegrep
- Livegrep: Interactively Grep Source Code
-
Code Search Is Hard
If you ever leave you can use Livegrep, which was based on code-search work done at Google. I personally don't use it right now but it's great and will probably meet all your needs.
[0] https://github.com/livegrep/livegrep
- FLaNK Stack Weekly for 13 November 2023
-
Sourcegraph is no longer Open Source
[4] is not really a usable 'product'. Livegrep (https://github.com/livegrep/livegrep) was inspired by it and is very usable.
[3] used to be a Google open source project as well, but it fell out of maintenance, and Sourcegraph took it over. It powers most of the basic regex/literal search in Sourcegraph.
Mozilla's code is searchable in Searchfox (https://searchfox.org/) which uses the indexer from Livegrep, combined with their own Git indexer and language-specific cross reference databases.
OpenGrok (https://github.com/oracle/opengrok) is also rather well known, but I have found it to have a slightly worse UI than alternatives.
- What code search tools do you use at your job?
- Ack is a grep-like source code search tool
-
Are there any good full text searching tools? I need to search against a huge amount of source code. I'm using ripgrep. The problem is that every time I search, it has to read every file again, which is kind of slow. Is there a FT searching tool that is designed with source code searching in mind.
Yes, you want https://github.com/livegrep/livegrep
-
Facebook open sources Glean: a scalable code search and query engine
If you've not had to deal with a codebase that takes VSCode longer than a few minutes to index, then you're probably outside their initial target market. If you've not had to setup a hosted code search tool (eg livegrep https://github.com/livegrep/livegrep ) because there's just too much code,
- Sourcegraph: Why we're indexing the OSS universe
What are some alternatives?
MeiliSearch - A lightning-fast search API that fits effortlessly into your apps, websites, and workflow
Glean - System for collecting, deriving and working with facts about source code.
tantivy - Tantivy is a full-text search engine library inspired by Apache Lucene and written in Rust
sourcegraph - Code AI platform with Code Search & Cody
prism - Prism is the easiest way to develop, orchestrate, and execute data pipelines in Python.
zoekt - Fast trigram based code search
retake - PostgreSQL for Search [Moved to: https://github.com/paradedb/paradedb]
linguist - Language Savant. If your repository's language is being reported incorrectly, send us a pull request!
bionicgpt - BionicGPT is an on-premise replacement for ChatGPT, offering the advantages of Generative AI while maintaining strict data confidentiality [Moved to: https://github.com/bionic-gpt/bionic-gpt]
codesearch - Fork of Google codesearch with more options
qdrant - Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
git-peek - git repo to local editor instantly