Machine learning on JVM

This page summarizes the projects mentioned and recommended in the original post on /r/scala

InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  1. Smile

    Statistical Machine Intelligence & Learning Engine

    I was using Smile for some period - https://haifengl.github.io/ - it's quite small and lightweight Java lib with some very basic algorithms - I was using in particularly cauterization. Along with this it provides Scala API.

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. Breeze

    Breeze is/was a numerical processing library for Scala.

    I haven't checked in on this project in a long time, but Breeze is something akin to NumPy/SciPy.

  4. SynapseML

    Simple and Distributed Machine Learning

    Microsoft ML for Spark gets you a range of powerful ML features on Spark.

  5. BigDL

    Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, DeepSpeed, Axolotl, etc.

    Intel BigDL for Spark which again is for Spark.

  6. Deep Java Library (DJL)

    An Engine-Agnostic Deep Learning Framework in Java

    AWS Deep Learning more deep learning.

  7. rainier

    Bayesian inference in Scala.

    Bayesian Inference which is more specific.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • How to Get Started with Scikit-Learn: A Beginner-Friendly Guide to Machine Learning in Python

    7 projects | dev.to | 24 Apr 2025
  • Show HN: Krixik – Easily sequence small/specialized AI models (pip installable)

    1 project | news.ycombinator.com | 4 Nov 2024
  • [Python] How do we lazyload a Python module? - analyzing LazyLoader from MLflow

    3 projects | dev.to | 5 Oct 2024
  • Show HN: Terge – an easy-to-use library for merging AI models

    1 project | news.ycombinator.com | 18 Jun 2024
  • Show HN: Cognita – open-source RAG framework for modular applications

    3 projects | news.ycombinator.com | 27 Apr 2024

Did you know that Scala is
the 35th most popular programming language
based on number of references?