Java Machine Learning

Open-source Java projects categorized as Machine Learning | Edit details

Top 23 Java Machine Learning Projects

  • GitHub repo Deeplearning4j

    Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learning using automatic differentiation.

    Project mention: Does Java has similar project like this one in C#? (ml, data) | | 2021-05-23

    Also, the website is now redirected to:

  • Scout APM

    Scout APM: A developer's best friend. Try free for 14-days. Scout APM uses tracing logic that ties bottlenecks to source code so you know the exact line of code causing performance issues and can get back to building a great product faster.

  • GitHub repo Apache Hadoop

    Apache Hadoop

    Project mention: Setting up a single-node Hadoop cluster | | 2021-11-14


  • GitHub repo Smile

    Statistical Machine Intelligence & Learning Engine

    Project mention: What libraries do you use for machine learning and data visualizing in scala? | | 2021-11-27

    I use smile with ammonite and it feels pretty easy/good to work with. Of course for pure looking at data, and exploration, you're not going to beat python.

  • GitHub repo vespa

    The open big data serving engine.

    Project mention: MeiliSearch: A Minimalist Full-Text Search Engine | | 2021-08-15

    After looking at various alternatives, I'm thinking of trying out [0]


  • GitHub repo Tablesaw

    Java dataframe and visualization library

    Project mention: Does Java has similar project like this one in C#? (ml, data) | | 2021-05-23

    For data frames, tablesaw or anything with apache arrow interop would be a good way to go:

  • GitHub repo Deep Java Library (DJL)

    An Engine-Agnostic Deep Learning Framework in Java

    Project mention: 2021-09 - Plans & Hopes for Clojure Data Science | | 2021-09-03

    Here is link number 1 - Previous text "DJL"

  • Nanos

    Run Linux Software Faster and Safer than Linux with Unikernels.

  • GitHub repo Apache Mahout

    Mirror of Apache Mahout

  • GitHub repo Oryx 2

    Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning

  • GitHub repo grobid

    A machine learning software for extracting information from scholarly documents

    Project mention: Project to rebuild papers with plaintext markup languages | | 2021-09-25

    - I ended up using Grobid, which converts the PDF to a very detailed XML format. The format is not a word processing format though, but a format specifically for representing scientific documents. I don't know, if it would, for example, contain tags about bold or italicized text. The tool is working really well, but since you probably cannot use the output XML format directly, it will need some postprocessing, which would be relatively simple with XML parsing libraries.

  • GitHub repo Siddhi

    Stream Processing and Complex Event Processing Engine

  • GitHub repo DatumBox

    Datumbox is an open-source Machine Learning framework written in Java which allows the rapid development of Machine Learning and Statistical applications.

  • GitHub repo Tribuo

    Tribuo - A Java machine learning library

    Project mention: txtai 3.4 released - Build AI-powered semantic search applications in Java | | 2021-10-09

    Tribuo (, ONNX export support is there for 2 models at the moment in main, there's a PR for factorization machines which supports ONNX export, and we plan to add another couple of models and maybe ensembles before the upcoming release. Plus I need to write a tutorial on how it all works, but you can check the tests in the meantime.

  • GitHub repo lychee.js

    :seedling: Next-Gen AI-Assisted Isomorphic Application Engine for Embedded, Console, Mobile, Server and Desktop

    Project mention: Abandoning GitHub | | 2021-07-03

    Note that fair use as a concept (or prior art for that matter) only exist inside the US, not globally.

    For example, I'm a European citizen and therefore the EU copyright directive of 2003 applies to me. Inside the European trade union, no legal entity and only human entities can own copyright. Legal entities such as companies can only own perpetual licenses, and contracts that give them the sole copyright usage and distribution rights have been nullified both in front of state level supreme courts and EU level courts a lot (Karlsruhe, Strasbourg, etc).

    This also means that technically, if there's no warranty disclosure issued for automated code generation, the authors of the automated program are still responsible for any copyright infringement, legal damages, etc. which is a nightmare if it turns out the code was A/GPL'ed.

    I'm just saying this, because there's a world of intellectual property guidelines outside the US, too.

    Source: was sued for my lychee.js [1] project a couple times in the past, which was successfully generating composite pattern based codes that were trained based on ES/HyperNEAT hypercubes - also in the robotics/SCADA level factory sector.


  • GitHub repo JSAT

    Java Statistical Analysis Tool, a Java library for Machine Learning

  • GitHub repo OpenMetadata

    Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.

    Project mention: OpenMetadata | | 2021-09-23

    Hi, We are a team building OpenMetadata, a single place to discover, collaborate and get your data right. Please check our announcement here Check out our code . If you are interested in learning please do join our slack and ask any questions you may have

  • GitHub repo CERMINE

    Content ExtRactor and MINEr

    Project mention: Project to rebuild papers with plaintext markup languages | | 2021-09-25

    - Another alternative that's on my list but that I didn't try is Cermine.

  • GitHub repo oj! Algorithms

    oj! Algorithms

  • GitHub repo hms-ml-demo

    HMS ML Demo provides an example of integrating Huawei ML Kit service into applications. This example demonstrates how to integrate services provided by ML Kit, such as face detection, text recognition, image segmentation, asr, and tts.

    Project mention: How to Recognize 17,000+ Landmarks With Machine Learning | | 2021-11-24

    For more details, you can go to: ML Kit official website ML Kit Development Documentation page, to find the documents you need Reddit to join our developer discussion GitHub to download ML Kit sample codes Stack Overflow to solve any integration problems

  • GitHub repo ksql-udf-deep-learning-mqtt-iot

    Deep Learning UDF for KSQL for Streaming Anomaly Detection of MQTT IoT Sensor Data

    Project mention: Machine learning & Kafka KSQL stream processing — bug me when I’ve left the heater on | | 2021-01-07

    Inspiration for this project comes from Kai Waehner and his project Deep Learning UDF for KSQL. The notification system was inspired by Robin Moffatt and his blog on Event-Driven Alerting with Slack.

  • GitHub repo JGAAP

    The Java Graphical Authorship Attribution Program

    Project mention: The layout of the threat e-mail on Phil's Twitter looks familiar | | 2021-01-15

    Example desktop tool:

  • GitHub repo LookAtMe

    VideoView that plays video only when :eyes: are open and :boy: is detected with various other features (by Pradyuman7)

  • GitHub repo Mutters

    A framework for building bot brains.

    Project mention: Ink 1.0 – open-source scripting language for interactive narrative | | 2021-04-10

    There are two types of chatbots -- unreliable freeform ones (e.g. GPT-3) and reliable script-based ones (e.g. Alexa). I believe that what you're looking to do (dynamic rewriting) is moreso an active research topic!

    The script-based chatbots are very similar to those in the 80s, with one major addition -- the options at each choice point are now "intents" instead of exact text (e.g. "the user intends to order a taxi"). Machine learning is then used to classify whatever the user says/writes as an "intent," then the script is followed to the next choice point.

    See this open-source project for an example of using Ink to drive a script-based chatbot similar to Alexa:

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2021-11-27.

Java Machine Learning related posts


What are some of the best open-source Machine Learning projects in Java? This list will help you:

Project Stars
1 Apache Flink 17,642
2 Deeplearning4j 12,257
3 Apache Hadoop 12,124
4 Smile 5,395
5 vespa 3,727
6 Tablesaw 2,758
7 Deep Java Library (DJL) 2,232
8 Apache Mahout 1,957
9 Oryx 2 1,785
10 grobid 1,599
11 Siddhi 1,281
12 DatumBox 1,073
13 Tribuo 988
14 lychee.js 757
15 JSAT 716
16 OpenMetadata 514
17 CERMINE 386
18 oj! Algorithms 364
19 hms-ml-demo 241
20 ksql-udf-deep-learning-mqtt-iot 239
21 JGAAP 223
22 LookAtMe 173
23 Mutters 159
Find remote jobs at our new job board There are 33 new remote jobs listed recently.
Are you hiring? Post a new remote job listing for free.
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives