Java Machine Learning

Open-source Java projects categorized as Machine Learning | Edit details

Top 23 Java Machine Learning Projects

  • Apache Hadoop

    Apache Hadoop

    Project mention: Python vs. Java: Comparing the Pros, Cons, and Use Cases | | 2022-05-21

    Hadoop (a Big Data tool).

  • Scout APM

    Less time debugging, more time building. Scout APM allows you to find and fix performance issues with no hassle. Now with error monitoring and external services monitoring, Scout is a developer's best friend when it comes to application development.

  • Deeplearning4j

    Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learning using automatic differentiation.

    Project mention: Data Science Competition | | 2022-03-25


  • Smile

    Statistical Machine Intelligence & Learning Engine

    Project mention: What libraries do you use for machine learning and data visualizing in scala? | | 2021-11-27

    I use smile with ammonite and it feels pretty easy/good to work with. Of course for pure looking at data, and exploration, you're not going to beat python.

  • vespa

    The open big data serving engine.

    Project mention: MeiliSearch: A Minimalist Full-Text Search Engine | | 2021-08-15

    After looking at various alternatives, I'm thinking of trying out [0]


  • Tablesaw

    Java dataframe and visualization library

  • Deep Java Library (DJL)

    An Engine-Agnostic Deep Learning Framework in Java

    Project mention: 2021-09 - Plans & Hopes for Clojure Data Science | | 2021-09-03

    Here is link number 1 - Previous text "DJL"

  • SonarQube

    Static code analysis for 29 languages.. Your projects are multi-language. So is SonarQube analysis. Find Bugs, Vulnerabilities, Security Hotspots, and Code Smells so you can release quality code every time. Get started analyzing your projects today for free.

  • Apache Mahout

    Mirror of Apache Mahout

  • grobid

    A machine learning software for extracting information from scholarly documents

    Project mention: Project to rebuild papers with plaintext markup languages | | 2021-09-25

    - I ended up using Grobid, which converts the PDF to a very detailed XML format. The format is not a word processing format though, but a format specifically for representing scientific documents. I don't know, if it would, for example, contain tags about bold or italicized text. The tool is working really well, but since you probably cannot use the output XML format directly, it will need some postprocessing, which would be relatively simple with XML parsing libraries.

  • Siddhi

    Stream Processing and Complex Event Processing Engine

  • DatumBox

    Datumbox is an open-source Machine Learning framework written in Java which allows the rapid development of Machine Learning and Statistical applications.

  • Tribuo

    Tribuo - A Java machine learning library

    Project mention: txtai 3.4 released - Build AI-powered semantic search applications in Java | | 2021-10-09

    Tribuo (, ONNX export support is there for 2 models at the moment in main, there's a PR for factorization machines which supports ONNX export, and we plan to add another couple of models and maybe ensembles before the upcoming release. Plus I need to write a tutorial on how it all works, but you can check the tests in the meantime.

  • lychee.js

    :seedling: Next-Gen AI-Assisted Isomorphic Application Engine for Embedded, Console, Mobile, Server and Desktop

    Project mention: Abandoning GitHub | | 2021-07-03

    Note that fair use as a concept (or prior art for that matter) only exist inside the US, not globally.

    For example, I'm a European citizen and therefore the EU copyright directive of 2003 applies to me. Inside the European trade union, no legal entity and only human entities can own copyright. Legal entities such as companies can only own perpetual licenses, and contracts that give them the sole copyright usage and distribution rights have been nullified both in front of state level supreme courts and EU level courts a lot (Karlsruhe, Strasbourg, etc).

    This also means that technically, if there's no warranty disclosure issued for automated code generation, the authors of the automated program are still responsible for any copyright infringement, legal damages, etc. which is a nightmare if it turns out the code was A/GPL'ed.

    I'm just saying this, because there's a world of intellectual property guidelines outside the US, too.

    Source: was sued for my lychee.js [1] project a couple times in the past, which was successfully generating composite pattern based codes that were trained based on ES/HyperNEAT hypercubes - also in the robotics/SCADA level factory sector.


  • JSAT

    Java Statistical Analysis Tool, a Java library for Machine Learning

  • cheetah

    On-device streaming speech-to-text engine powered by deep learning (by Picovoice)

    Project mention: Transcribe Speech to Text with Python for Free | | 2022-03-30

    Cool! Leopard operates on files but Cheetah can do live (streaming)


    Content ExtRactor and MINEr

    Project mention: Project to rebuild papers with plaintext markup languages | | 2021-09-25

    - Another alternative that's on my list but that I didn't try is Cermine.

  • oj! Algorithms

    oj! Algorithms

  • picovoice

    The end-to-end platform for building voice products at scale

    Project mention: Voice processing in Embedded Systems | | 2022-04-28

    Checkout I saw it on an article from before and it seemed easy to get started on.

  • hms-ml-demo

    HMS ML Demo provides an example of integrating Huawei ML Kit service into applications. This example demonstrates how to integrate services provided by ML Kit, such as face detection, text recognition, image segmentation, asr, and tts.

    Project mention: Text Classification with HMS ML Kit Custom Model | | 2022-04-03

    GitHub - HMS-Core/hms-ml-demo

  • ksql-udf-deep-learning-mqtt-iot

    Deep Learning UDF for KSQL for Streaming Anomaly Detection of MQTT IoT Sensor Data


    The Java Graphical Authorship Attribution Program

    Project mention: Critical New 0-day Vulnerability in Popular Log4j Library - List of applications | | 2021-12-13


  • LookAtMe

    VideoView that plays video only when :eyes: are open and :boy: is detected with various other features (by Pradyuman7)

  • rumble

    ⛈️ RumbleDB 1.18.0 "Scarlet Ixora" 🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more (by RumbleDB)

    Project mention: RumbleDB: Query with ease a lot of different nested, heterogeneous data formats | | 2021-12-01
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2022-05-21.

Java Machine Learning related posts


What are some of the best open-source Machine Learning projects in Java? This list will help you:

Project Stars
1 Apache Flink 18,976
2 Apache Hadoop 12,591
3 Deeplearning4j 12,470
4 Smile 5,508
5 vespa 3,937
6 Tablesaw 2,915
7 Deep Java Library (DJL) 2,523
8 Apache Mahout 1,992
9 grobid 1,705
10 Siddhi 1,335
11 DatumBox 1,077
12 Tribuo 1,048
13 lychee.js 771
14 JSAT 735
15 cheetah 443
16 CERMINE 402
17 oj! Algorithms 382
18 picovoice 261
19 hms-ml-demo 259
20 ksql-udf-deep-learning-mqtt-iot 255
21 JGAAP 230
22 LookAtMe 180
23 rumble 169
Find remote jobs at our new job board There are 7 new remote jobs listed recently.
Are you hiring? Post a new remote job listing for free.
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives