Apache Solr
Toshi
Our great sponsors
Apache Solr | Toshi | |
---|---|---|
31 | 12 | |
4,365 | 4,117 | |
0.0% | 0.8% | |
0.0 | 6.1 | |
2 months ago | 3 months ago | |
Java | Rust | |
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Apache Solr
- Iniciando no Elasticsearch: Conceitos básicos
-
YaCy, a distributed Web Search Engine, based on a peer-to-peer network
There are already many project about search:
- https://www.marginalia.nu/
- https://searchmysite.net/
- https://lucene.apache.org/
- elastic search
- https://presearch.com/
- https://stract.com/
- https://wiby.me/
I think that all project are fun. I would like to see one succeeding at reaching mainstream level of attention.
I have also been gathering links meta data for some time. Maybe I will use them to feed any eventual self hosted search engine, or language model, if I decide to experiment with that.
- domains for seed https://github.com/rumca-js/Internet-Places-Database
- bookmarks seed https://github.com/rumca-js/RSS-Link-Database
- links for year https://github.com/rumca-js/RSS-Link-Database-2024
-
Getting started with Elasticsearch + Python
Elasticsearch is based on Lucene and is used by various companies and developers across the world to build custom search solutions.
-
Tools to use to query and index data?
elastic search is kinda heavyweight infra for a small project. Its built on top of apache lucene (https://lucene.apache.org), which you can use directly.
-
Top metrics for Elasticsearch monitoring with Prometheus
Elasticsearch is based on Lucene, which is built in Java. This means that monitoring the Java Virtual Machine (JVM) memory is crucial to understand the current usage of the whole system.
-
Cross data type search that wasn’t supported well using Elasticsearch
Apache Lucene which seems to have a lot more features than Elasticsearch
-
How to find closest keyphrase match in text?
Generally with term vectors and a tf-idf index. Lucene is a good starting place to help.
-
Java Library to perform string search
try elasticsearch or solr, behind the scenes they both use https://lucene.apache.org/ if you don't want basically a full nosql database service, but I'd just slap solr up and call it a day.
-
Top 8 Open-Source Observability & Testing Tools
OpenSearch is an open-source database to ingest, search, visualize, and analyze data. It’s built on top of Apache Lucerce, a FOSS library for indexing and search, which OpenSearch leverages for more advanced analytics capabilities, like anomaly detection, machine learning, full-text search, and more.
-
grep like search with preprocessing
Lucene is the thing you think you need. Elastic Search is a nice wrapper for it. But these are Java, so maybe you want Sphinx Search (C++) or MeiliSearch (Rust).
Toshi
-
Tantivy 0.20 is released: Schemaless column store, Schemaless aggregations, Phrase prefix queries, Percentiles, and more...
I don't think you have an active project that addresses all those use cases. There was an attempt in Rust with Toshi that is built on top of tantivy, but the project seems to have stalled.
- An alternative to Elasticsearch that runs on a few MBs of RAM
-
Postgres Full Text Search vs. the Rest
I wish we had an extension like ZomboDB but using a lighter search engine like https://github.com/quickwit-oss/quickwit, https://github.com/toshi-search/Toshi and https://github.com/mosuka/bayard
Here I'm listing engines based on https://github.com/quickwit-oss/tantivy - tantivy is comparable to Lucene in its scope - but I'm sure there are other engines that could tackle ElasticSearch.
Another thing that could happen is maybe directly embed tantivy in Postgres using an extension, perhaps this could be an option too.
-
Ask HN: Does anybody still use bookmarking services?
I do something similar, though I index the page myself via a little browser extension I wrote. I click a button, the content gets POSTed to a server that throws it in Toshi[1]. I hacked it together on a Saturday, and it's been pretty handy; as you describe, much more useful than any bookmarking approach I've tried before.
[1] https://github.com/toshi-search/Toshi
-
*set Edge as default browser*
There is some incredible work being done in the web department, frameworks like rocket.rs and actix.rs are amazing. To get the latest info on web development in Rust, check arewewebyet.org. It doesn't list Toshi though, which is weird.
- Zinc Search engine. A lightweight alternative to elasticsearch that requires minimal resources, written in Go.
- Zinc Search engine. A lightweight alternative to Elasticsearch written in Go
- AWS releases forked Elasticsearch code. Announces new name: OpenSearc
What are some alternatives?
OpenSearch - 🔎 Open source distributed and RESTful search engine.
elasticsearch-rs - Official Elasticsearch Rust Client
Typesense - Open Source alternative to Algolia + Pinecone and an Easier-to-Use alternative to ElasticSearch ⚡ 🔍 ✨ Fast, typo tolerant, in-memory fuzzy Search Engine for building delightful search experiences
MeiliSearch - A lightning-fast search API that fits effortlessly into your apps, websites, and workflow
narg - A tool to generate LC/AP formulas for a given seed in Noita.
Elasticsearch - Free and Open, Distributed, RESTful Search Engine
sonic - 🦔 Fast, lightweight & schema-less search backend. An alternative to Elasticsearch that runs on a few MBs of RAM.
loki - Like Prometheus, but for logs.
lnx - ⚡ Insanely fast, 🌟 Feature-rich searching. lnx is the adaptable, typo tollerant deployment of the tantivy search engine.
Apache Lucene - Apache Lucene.NET