Java Machine Learning

Open-source Java projects categorized as Machine Learning

Top 23 Java Machine Learning Projects

Machine Learning
  1. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  2. Apache Hadoop

    Apache Hadoop

    Project mention: Apache Hadoop: Open Source Business Model, Funding, and Community | dev.to | 2025-05-10

    This post provides an in‐depth look at Apache Hadoop, a transformative distributed computing framework built on an open source business model. We explore its history, innovative open funding strategies, the influence of the Apache License 2.0, and the vibrant community that drives its continuous evolution. Additionally, we examine practical use cases, upcoming challenges in scaling big data processing, and future trends in interoperability and innovative financing methods, including parallels with emerging blockchain funding models. Hyperlinks to pivotal resources such as the Apache Hadoop GitHub repository, the official Apache Hadoop website, and the Apache Software Foundation are seamlessly woven into the narrative.

  3. Deeplearning4j

    Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learn...

  4. mit-deep-learning-book-pdf

    MIT Deep Learning Book in PDF format (complete and parts) by Ian Goodfellow, Yoshua Bengio and Aaron Courville

    Project mention: Top Github repositories for 10+ programming languages | dev.to | 2024-07-16

    MIT deep learning PDF

  5. Smile

    Statistical Machine Intelligence & Learning Engine

    Project mention: Smile 4.0 | news.ycombinator.com | 2024-11-25
  6. vespa

    AI + Data, online. https://vespa.ai

  7. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  8. Deep Java Library (DJL)

    An Engine-Agnostic Deep Learning Framework in Java

  9. serve

    Serve, optimize and scale PyTorch models in production (by pytorch)

    Project mention: TorchServe is no longer actively maintained | news.ycombinator.com | 2025-03-18
  10. grobid

    A machine learning software for extracting information from scholarly documents

    Project mention: Starting July 1, Academic Publishers Can't Paywall NIH-Funded Research | news.ycombinator.com | 2025-05-01

    what do you mean exactly? I was suprised how with grobid many of at least the arXiv papers are easily converted to xml for better processing than PDF.

    Most of the papers are constructed from their latex sources so there's an easy way to undo it i guess.

    https://github.com/kermitt2/grobid

  11. Tablesaw

    Java dataframe and visualization library

  12. modeldb

    Open Source ML Model Versioning, Metadata, and Experiment Management

  13. jvector

    JVector: the most advanced embedded vector search engine

    Project mention: 5 GenAI Things You Didn't Know About Astra DB | dev.to | 2025-03-06

    Astra DB's vector indexing capabilities are a combination of Cassandra's storage-attached indexing (SAI) and JVector, a non-blocking, concurrent, graph-based vector index. What this means is that Astra DB doesn't need to rebuild or block access to its index when you are inserting vectors, they are updated live.

  14. Siddhi

    Stream Processing and Complex Event Processing Engine

  15. elasticsearch-learning-to-rank

    Plugin to integrate Learning to Rank (aka machine learning for better relevance) with Elasticsearch

  16. Tribuo

    Tribuo - A Java machine learning library

    Project mention: Tribuo – A Java machine learning library | news.ycombinator.com | 2024-09-07
  17. hopsworks

    Hopsworks - Data-Intensive AI platform with a Feature Store

  18. DatumBox

    Datumbox is an open-source Machine Learning framework written in Java which allows the rapid development of Machine Learning and Statistical applications.

  19. JSAT

    Java Statistical Analysis Tool, a Java library for Machine Learning

  20. knime-core

    KNIME Analytics Platform

  21. jblas

    Linear Algebra for Java

  22. CERMINE

    Content ExtRactor and MINEr

    Project mention: Show HN: Kreuzberg – Modern async Python library for document text extraction | news.ycombinator.com | 2025-02-15
  23. Faceplugin-FaceRecognition-LivenessDetection-Android-SDK

    Face Recognition, Face Liveness Detection, Face Detection, Face Landmarks, Face Compare, Face Matching, Face Pose, Face Expression, Face Attributes, Face Templates Extraction, Face Landmarks (by Faceplugin-ltd)

  24. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Java Machine Learning discussion

Log in or Post with

Java Machine Learning related posts

  • Apache Hadoop: Open Source Business Model, Funding, and Community

    2 projects | dev.to | 10 May 2025
  • What is Apache Flink? Exploring Its Open Source Business Model, Funding, and Community

    2 projects | dev.to | 10 May 2025
  • What is Apache Kafka? The Open Source Business Model, Funding, and Community

    3 projects | dev.to | 10 May 2025
  • Connecting RDBs and Search Engines — Chapter 1

    5 projects | dev.to | 10 May 2025
  • India Open Source Development: Harnessing Collaborative Innovation for Global Impact

    4 projects | dev.to | 4 May 2025
  • Starting July 1, Academic Publishers Can't Paywall NIH-Funded Research

    1 project | news.ycombinator.com | 1 May 2025
  • Is RisingWave the Next Apache Flink?

    1 project | dev.to | 27 Apr 2025
  • A note from our sponsor - SaaSHub
    www.saashub.com | 13 May 2025
    SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source Machine Learning projects in Java? This list will help you:

# Project Stars
1 Apache Flink 24,836
2 Apache Hadoop 15,070
3 Deeplearning4j 13,959
4 mit-deep-learning-book-pdf 13,247
5 Smile 6,174
6 vespa 6,162
7 useful-java-links 5,997
8 Deep Java Library (DJL) 4,453
9 serve 4,321
10 grobid 4,010
11 Tablesaw 3,632
12 modeldb 1,721
13 jvector 1,584
14 Siddhi 1,546
15 elasticsearch-learning-to-rank 1,499
16 Tribuo 1,319
17 hopsworks 1,223
18 DatumBox 1,083
19 JSAT 791
20 knime-core 693
21 jblas 593
22 CERMINE 493
23 Faceplugin-FaceRecognition-LivenessDetection-Android-SDK 449

Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com

Did you know that Java is
the 8th most popular programming language
based on number of references?