Java information-retrieval

Open-source Java projects categorized as information-retrieval

Top 7 Java information-retrieval Projects

information-retrieval
  1. Apache Solr

    Apache Lucene and Solr open-source search software

    Project mention: Testing MongoDB Atlas Search Java Apps Using TestContainers | dev.to | 2025-04-24

    MongoDB Atlas Search is an extension to the built-in indexing capabilities that are part of MongoDB itself, using the awesome open source indexing and query library Lucene. MongoDB has built a wrapper around Lucene called mongot. Mongot has two responsibilities: First, it follows the change stream of any collection you choose to index and builds Lucene indexes asynchronously. Second, when you run the $search aggregation stage in a MongoDB query, mongot will be invoked to perform a Lucene query on the index and return a stream of document ids for further processing and data retrieval.

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. lucene

    Apache Lucene open-source search software

    Project mention: Unveiling Apache Lucene: Open Source Innovation, Funding, and Community | dev.to | 2025-03-06

    Apache Lucene began as a research-driven project that quickly established itself as a critical component for implementing high-performance text searching in diverse applications. Its evolutionary journey is marked by continuous improvements, bolstered by contributions from a worldwide community. The project’s development is transparently showcased on its official GitHub repository, where passionate developers, testers, and system architects work together to enhance its robust indexing and search capabilities. What sets Apache Lucene apart is its open source business model—a model driven by community participation and corporate sponsorship. This approach not only fuels rapid innovation but also provides a sustainable framework for long-term project maintenance. The dual benefit of cutting-edge innovation and financial sustainability has led many companies, including major tech players, to adopt and support Apache Lucene as the backbone of their search functionalities.

  4. solr

    Apache Solr open-source search software

    Project mention: Tudo que Estudar, para se tornar uma Engenheira(o) de Software. | dev.to | 2024-09-18

    Link

  5. anserini

    Anserini is a Lucene toolkit for reproducible information retrieval research

    Project mention: Relevance Feedback in Informational Retrieval | dev.to | 2025-03-31

    The most famous one, RM3 – interpolation of expansion terms probability with their probability in a query – is still appearing in papers of the last few years as a (noticeably decent) baseline in term-based retrieval, usually as part of anserini.

  6. Sparkler

    Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.

  7. tededroid

    miniature incremental reader for Android in Java

    Project mention: What projects are you working on? | news.ycombinator.com | 2024-07-13

    I was working on this project https://github.com/lucidl/tededroid before, now I'm going to record a video with usage, I'm looking for suitable article with text and lots of pictures.

  8. llm-integration

    spring-starter, which enables semantic search, backed by OpenAI, by couple of lines

  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Java information-retrieval discussion

Log in or Post with

Java information-retrieval related posts

  • No SNAPSHOTs

    8 projects | dev.to | 30 Jul 2024

Index

What are some of the best open-source information-retrieval projects in Java? This list will help you:

# Project Stars
1 Apache Solr 4,380
2 lucene 2,971
3 solr 1,386
4 anserini 1,055
5 Sparkler 412
6 tededroid 5
7 llm-integration 2

Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com

Did you know that Java is
the 8th most popular programming language
based on number of references?