Scala Science and Data analysis

Open-source Scala projects categorized as Science and Data analysis

Top 17 Scala Science and Data analysis Projects

  • Breeze

    Breeze is a numerical processing library for Scala.

    Project mention: Arbitrary functions of n dimensions in Scala | /r/scala | 2023-01-23

    Also, you can look at breeze.generic.UFunc for an inspiration.

  • Algebird

    Abstract Algebra for Scala

    Project mention: What do you use when you have to store high cardinality metrics? | /r/golang | 2023-02-13 (production ready, used at Twitter, but for the JVM)

  • Onboard AI

    Learn any GitHub repo in 59 seconds. Onboard AI learns any GitHub repo in minutes and lets you chat with it to locate functionality, understand different parts, and generate new code. Use it for free at

  • Spire

    Powerful new number types and numeric abstractions for Scala.

  • Tensorflow_scala

    TensorFlow API for the Scala Programming Language

  • Squants

    The Scala API for Quantities, Units of Measure and Dimensional Analysis

    Project mention: Show HN: Numbat – A programming language with physical dimensions as types | | 2023-11-16

    FACTORIE is a toolkit for deployable probabilistic modeling, implemented as a software library in Scala. It provides its users with a succinct language for creating relational factor graphs, estimating parameters and performing inference.

  • Compute.scala

    Scientific computing with N-dimensional arrays

  • InfluxDB

    Collect and Analyze Billions of Data Points in Real Time. Manage all types of time series data in a single, purpose-built database. Run at any scale in any environment in the cloud, on-premises, or at the edge.

  • Libra

    A dimensional analysis library based on dependent types (by to-ithaca)

  • Optimus * 96

    Optimus is a mathematical programming library for Scala. (by vagmcs)

  • OpenMOLE

    Workflow engine for exploration of simulation models using high throughput computing

  • Clustering4Ever

    C4E, a JVM friendly library written in Scala for both local and distributed (Spark) Clustering.

  • Tyche

    Statistics utilities for the JVM - in Scala!

  • LoMRF

    LoMRF is an open-source implementation of Markov Logic Networks

  • MGO

    Purely functional genetic algorithms for multi-objective optimisation

  • Axle

    Axle Domain Specific Language for Scientific Cloud Computing and Visualization (by axlelang)

  • SwiftLearner

    SwiftLearner: Scala machine learning library

  • Persist-Units

    Scala Units of Measure Types (by nestorpersist)

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2023-11-16.

Scala Science and Data analysis related posts


What are some of the best open-source Science and Data analysis projects in Scala? This list will help you:

Project Stars
1 Breeze 3,416
2 Algebird 2,269
3 Spire 1,743
4 Tensorflow_scala 928
5 Squants 906
7 Compute.scala 200
8 Libra 200
9 Optimus * 96 142
10 OpenMOLE 141
11 Clustering4Ever 127
12 Tyche 93
13 LoMRF 80
14 MGO 72
15 Axle 67
16 SwiftLearner 39
17 Persist-Units 9
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives