Java Machine Learning

Open-source Java projects categorized as Machine Learning

Top 23 Java Machine Learning Projects

  • Apache Hadoop

    Apache Hadoop

    Project mention: Getting thousands of files of output back from a container | reddit.com/r/docker | 2023-05-02

    Did you check out tools like https://hadoop.apache.org/ ?

  • InfluxDB

    Access the most powerful time series database as a service. Ingest, store, & analyze all types of time series data in a fully-managed, purpose-built database. Keep data forever with low-cost storage and superior data compression.

  • Deeplearning4j

    Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learning using automatic differentiation.

    Project mention: Java for ML? | reddit.com/r/computerscience | 2022-11-13
  • mit-deep-learning-book-pdf

    MIT Deep Learning Book in PDF format (complete and parts) by Ian Goodfellow, Yoshua Bengio and Aaron Courville

    Project mention: Is supervised machine learning the same as linear regression? | reddit.com/r/learnmachinelearning | 2023-03-07
  • Smile

    Statistical Machine Intelligence & Learning Engine

    Project mention: Need statistic test library for Spark Scala | reddit.com/r/scala | 2023-05-05

    Check out Smile too.

  • vespa

    The open big data serving engine. https://vespa.ai

    Project mention: Top 10 Best Vector Databases & Libraries | dev.to | 2023-04-19

    Vespa(4.3k ⭐) → A fully featured search engine and vector database. It supports vector search (ANN), lexical search, and search in structured data, all in the same query. Integrated machine-learned model inference allows you to apply AI to make sense of your data in real time.

  • ONLYOFFICE

    ONLYOFFICE Docs — document collaboration in your environment. Powerful document editing and collaboration in your app or environment. Ultimate security, API and 30+ ready connectors, SaaS or on-premises

  • serve

    Serve, optimize and scale PyTorch models in production (by pytorch)

    Project mention: Is there a course that teaches you how to make an API with a trained model? | reddit.com/r/learnmachinelearning | 2023-05-27
  • Tablesaw

    Java dataframe and visualization library

    Project mention: Tablesaw: Java Dataframe and Visualization Library | news.ycombinator.com | 2023-02-06
  • Deep Java Library (DJL)

    An Engine-Agnostic Deep Learning Framework in Java

    Project mention: Is deeplearning4j a good choice? | reddit.com/r/java | 2023-03-11

    It seems to have been picked up by Eclipse and there is also Oracle Labs' Tribuo and Deep Java Library. All seem active, but I don't know much about any of them. I agree it's probably best to follow the community and use a more popular tool like PyTorch.

  • grobid

    A machine learning software for extracting information from scholarly documents

    Project mention: 🥪 Best Sites For ebooks, articles, research papers etc..🥪 | reddit.com/r/RockMods | 2023-05-17
  • Apache Mahout

    Mirror of Apache Mahout

  • Siddhi

    Stream Processing and Complex Event Processing Engine

    Project mention: Seeking Feedback on Siddhi | reddit.com/r/dataengineering | 2023-03-13

    Hi, I'm building a realtime analysis solution for our domain oriented microservice backend. All domain emit events in kafka. I'm looking for a solution to ingest data in an OLAP database based on processing those events (enrichment, filtering etc.). I found https://siddhi.io/ which looks promising. Since the last release (2019) the product is now part of WSo2 solution. I'm also looking at https://www.benthos.dev/. I'm more interested in a declarative solution than code.

  • elasticsearch-learning-to-rank

    Plugin to integrate Learning to Rank (aka machine learning for better relevance) with Elasticsearch

    Project mention: New free tool that uses fine-tuned BERT model to surface answers from research papers | reddit.com/r/LanguageTechnology | 2022-10-28

    I worked on a learning-to-rank problem at a previous job (which unfortunately never got deployed womp, womp). This was early days, so at the time I was looking at using LambdaMART with solr or elasticsearch for reranking with a Bayesian click model to get pseudo-labels for relevance.

  • Tribuo

    Tribuo - A Java machine learning library

    Project mention: Is deeplearning4j a good choice? | reddit.com/r/java | 2023-03-11

    It seems to have been picked up by Eclipse and there is also Oracle Labs' Tribuo and Deep Java Library. All seem active, but I don't know much about any of them. I agree it's probably best to follow the community and use a more popular tool like PyTorch.

  • DatumBox

    Datumbox is an open-source Machine Learning framework written in Java which allows the rapid development of Machine Learning and Statistical applications.

  • hopsworks

    Hopsworks - Data-Intensive AI platform with a Feature Store

    Project mention: Hopworks: MLOps platform with Python-centric Feature Store | news.ycombinator.com | 2022-12-02
  • lychee.js

    :seedling: Next-Gen AI-Assisted Isomorphic Application Engine for Embedded, Console, Mobile, Server and Desktop

  • JSAT

    Java Statistical Analysis Tool, a Java library for Machine Learning

  • submarine

    Submarine is Cloud Native Machine Learning Platform. (by apache)

    Project mention: Running Apache Submarine with minikube | dev.to | 2022-08-26

    git clone https://github.com/apache/submarine.git cd submarine git checkout rel/release-0.7.0 helm install submarine ./helm-charts/submarine

  • jblas

    Linear Algebra for Java

    Project mention: Can't satisfy libgfortran dependency | reddit.com/r/voidlinux | 2022-06-07

    I'm trying to run a minecraft mod that requires a special dependency to work properly. These are the instructions and this is the mod page. It only shows what to do on debian and redhat. I tried installing both the libgfortran and libgfortran-32bit packages, but it still doesn't work. I also tried installing apt, but it doesn't work and I was unable to find the instructions on how to use it on non-debian based distros.

  • giskard

    Collaborative & Open-Source Quality Assurance for all AI models 🧑‍🔧⚡️

    Project mention: [R] LMFlow Benchmark: An Automatic Evaluation Framework for Open-Source LLMs | reddit.com/r/MachineLearning | 2023-05-09

    This is super interesting! Thanks for sharing. We're also working on this research field from an open-source angle (https://github.com/Giskard-AI/giskard)

  • knime-core

    KNIME Analytics Platform

    Project mention: Has anybody used Orange? | reddit.com/r/datascience | 2023-04-04
  • Sonar

    Write Clean Java Code. Always.. Sonar helps you commit clean code every time. With over 600 unique rules to find Java bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2023-05-27.

Java Machine Learning related posts

Index

What are some of the best open-source Machine Learning projects in Java? This list will help you:

Project Stars
1 Apache Flink 21,279
2 Apache Hadoop 13,530
3 Deeplearning4j 12,952
4 mit-deep-learning-book-pdf 11,230
5 Smile 5,743
6 useful-java-links 5,516
7 vespa 4,419
8 serve 3,468
9 Tablesaw 3,239
10 Deep Java Library (DJL) 3,220
11 grobid 2,140
12 Apache Mahout 2,062
13 Siddhi 1,439
14 elasticsearch-learning-to-rank 1,427
15 Tribuo 1,151
16 DatumBox 1,084
17 hopsworks 921
18 lychee.js 798
19 JSAT 759
20 submarine 642
21 jblas 576
22 giskard 463
23 knime-core 449
TestGPT | Generating meaningful tests for busy devs
Get non-trivial tests (and trivial, too!) suggested right inside your IDE, so you can code smart, create more value, and stay confident when you push.
codium.ai