Java search-engine

Open-source Java projects categorized as search-engine | Edit details

Top 9 Java search-engine Projects

  • GitHub repo Elasticsearch

    Free and Open, Distributed, RESTful Search Engine

    Project mention: Best way to store BERT embeddings on AWS? | reddit.com/r/LanguageTechnology | 2021-11-01

    The other option to consider would be Elasticsearch (and by extension Opensearch as mentioned) which is better for more keyword-based retrieval - although it seems that Opensearch do support full semantic search, I haven't had the chance to use it before. With ElasticSearch you are (for now) restricted to performing a keyword-based retrieval followed by semantic-based reranking.

  • GitHub repo Apache Solr

    Apache Lucene and Solr open-source search software

    Project mention: Unified tagging | reddit.com/r/productivity | 2021-11-12

    This is kind of a personal implementation of enterprise search, so Apache Lucene or elasticsearch could do some of this, though that's a ton of development work and infrastructure just for me. I'm happy to do a bit of hacking/syncing, but building something from scratch is beyond my means at the moment.

  • Scout APM

    Scout APM: A developer's best friend. Try free for 14-days. Scout APM uses tracing logic that ties bottlenecks to source code so you know the exact line of code causing performance issues and can get back to building a great product faster.

  • GitHub repo OpenSearch

    🔎 Open source distributed and RESTful search engine.

    Project mention: Amazon Elasticsearch Service is so flexible it wants to be called by a new name | reddit.com/r/programming | 2021-09-12

    This conversation on the opensearch GitHub is very worrying:

  • GitHub repo vespa

    The open big data serving engine. https://vespa.ai

    Project mention: MeiliSearch: A Minimalist Full-Text Search Engine | news.ycombinator.com | 2021-08-15

    After looking at various alternatives, I'm thinking of trying out https://vespa.ai/ [0]

    [0] https://github.com/vespa-engine/vespa

  • GitHub repo Infinispan

    Infinispan is an open source data grid platform and highly scalable NoSQL cloud data store.

  • GitHub repo lucene

    Apache Lucene open-source search software

    Project mention: Amazon Elasticsearch Service Is Now Amazon OpenSearch Service | news.ycombinator.com | 2021-09-09

    It is pretty clear to me that Elastic is planning to build their ANN features differently than OpenDistro's k-NN implementation, or other plugins modules that extend Easticsearch in similar ways. They now will build on the Apache Lucene capabilities that were collaboratively built "upstream" by a number of individuals, some that work for Amazon and some that work for Elastic.

    From the linked issue, it seemed that they were originally planning to develop this as a proprietary feature of Elasticsearch, without contributing the functionality to Apache Lucene, but then changed direction when the Apache Lucene developers (some of which are currently employed to do such work by Amazon) started to build its approximate nearest neighbor (ANN) vector search capabilities. [1]

    It's great to see folks that work for Elastic collaborating and building on what is in Apache Lucene to extend the utility of ANN with Hierarchical Navigable Small World Graphs (HNSW) [2]! From this, I think it should be possible to implement an Open Source version of the functionality with a compatible API, if that is something that OpenSearch users seek.

    [1] https://issues.apache.org/jira/browse/LUCENE-9004

    [2] https://github.com/apache/lucene/pull/250

  • GitHub repo Para

    Multitenant backend server for building web and mobile apps rapidly. The backend for busy developers. (self-hosted or hosted) (by Erudika)

  • Nanos

    Run Linux Software Faster and Safer than Linux with Unikernels.

  • GitHub repo Sparkler

    Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.

  • GitHub repo LuceneBench

    Lucene Benchmark : benchmarking Lucene vs. SeekStorm

    Project mention: Show HN: Deephn.org full-text search 30M Hacker News posts AND linked web pages | news.ycombinator.com | 2021-04-13

    https://github.com/wolfgarbe/LuceneBench/blob/master/LuceneB...

    So, you may be hitting SimpleFSDirectory instead, which does have issues with too many searches.

    Could you share the reasons, MMapDirectory did not work for you?

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2021-11-12.

Java search-engine related posts

Index

What are some of the best open-source search-engine projects in Java? This list will help you:

Project Stars
1 Elasticsearch 57,374
2 Apache Solr 4,204
3 OpenSearch 4,124
4 vespa 3,727
5 Infinispan 910
6 lucene 569
7 Para 414
8 Sparkler 377
9 LuceneBench 5
Find remote jobs at our new job board 99remotejobs.com. There are 34 new remote jobs listed recently.
Are you hiring? Post a new remote job listing for free.
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com