|4 days ago||3 days ago|
|Apache License 2.0||GNU General Public License v3.0 or later|
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
We haven't tracked posts mentioning Infinispan yet.
Tracking mentions began in Dec 2020.
[D] Are you seeing any compelling use cases of semantic search being leveraged at scale?
3 projects | reddit.com/r/MachineLearning | 29 Nov 2021
Currently Elasticsearch does not support vector search, rather, they retrieve many records using their usual query approach and then rerank them with cosine similarity. So they use sparse retrieval followed by dense vector reranking.
Best way to store BERT embeddings on AWS?
1 project | reddit.com/r/LanguageTechnology | 1 Nov 2021
The other option to consider would be Elasticsearch (and by extension Opensearch as mentioned) which is better for more keyword-based retrieval - although it seems that Opensearch do support full semantic search, I haven't had the chance to use it before. With ElasticSearch you are (for now) restricted to performing a keyword-based retrieval followed by semantic-based reranking.
What is ClickHouse how it compares to PostgreSQL and TimescaleDB for time series
11 projects | news.ycombinator.com | 21 Oct 2021
One thing I was surprised to see is that ClickHouse and ElasticSearch have the same number of contributors. That's pretty astounding given how much older and more prominent ElasticSearch has been.
⚡ 🔍 Typesense search engine: an easier-to-use alternative to ElasticSearch
3 projects | dev.to | 15 Oct 2021
In a daily development process, it's common the need to search a specific term in a large amount of data. The search engine tools came to solve this kind of problem and one of the most famous is called ElasticSearch. If you have already worked with ElasticSearch you probably know that it's such a powerful tool, but it's also complex and has a steep learning curve. For example, doing an in-house deployment of ElasticSearch you will face a high production ops overhead dealing with over 3000 configuration parameters.
Everything you need to know about Opensource Jamstack
16 projects | dev.to | 6 Oct 2021
Elastic search, for example, is an open-source search and analytics engine that can be self-hosted. It has over 1600 contributors on Github. It provides a REST API to implement search that can be used on static sites. For new contributors, it has a contribution guide and a significant number of issues tagged as good-first-issue.
Extremely slow aggregations
1 project | reddit.com/r/elasticsearch | 18 Sep 2021
Marxism gets a burger
2 projects | reddit.com/r/Polcompball | 14 Sep 2021
You can listen to Ghostler's ramblings for ideas. I won't bother covering that until I have a Vosk to Elaticsearch pipeline so I don't have to listen to it.
Amazon Elasticsearch Service Is Now Amazon OpenSearch Service
4 projects | news.ycombinator.com | 9 Sep 2021
Good point about MySQL/MariaDB. I think this is different though because search engines are at a big pivot point to include approximate-nearest-neighbor dense vector search (which has forever been sparse vector search for Lucene based platforms).
Specifically, this feature https://github.com/elastic/elasticsearch/issues/42326#issuec... will be a big change for Elasticsearch. OpenSearch might try to mimic the API, but implementation details here will matter a lot, since this type of search is really picky when it comes to performance/recall balance. OpenDistro has already been working on their own version: https://opendistro.github.io/for-elasticsearch/features/knn.... ...so will they switch their API? Perhaps - but the results are going to be very different.
Elasticsearch adding code to reject connections to OpenSearch clusters or to clusters running open source distributions of ES7
2 projects | reddit.com/r/programming | 8 Aug 2021
Source on this rejecting open source ES7? From the little bit of digging I did it appears to include the header.
The Elasticsearch Saga Continues
2 projects | news.ycombinator.com | 8 Aug 2021
What are some alternatives?
OpenSearch - 🔎 Open source distributed and RESTful search engine.
Metabase - The simplest, fastest way to get business intelligence and analytics to everyone in your company :yum:
elasticsearch-dsl-py - High level Python client for Elasticsearch
GoAccess - GoAccess is a real-time web log analyzer and interactive viewer that runs in a terminal in *nix systems or through your browser.
bleve - A modern text indexing library for go
django-haystack - Modular search for Django
cube.js - 📊 Cube — Open-Source Analytics API for Building Data Apps
Apache Superset - Apache Superset is a Data Visualization and Data Exploration Platform [Moved to: https://github.com/apache/superset]
AWStats - AWStats Log Analyzer project (official sources)
kafka-connect-elasticsearch - Kafka Connect Elasticsearch connector
PostHog - 🦔 PostHog provides open-source product analytics that you can self-host.