Data science in Scala

This page summarizes the projects mentioned and recommended in the original post on reddit.com/r/scala

Our great sponsors
  • InfluxDB - Build time-series-based applications quickly and at scale.
  • SonarQube - Static code analysis for 29 languages.
  • SaaSHub - Software Alternatives and Reviews
  • Breeze

    Breeze is a numerical processing library for Scala.

    You can use https://github.com/scalanlp/breeze. A Scala library that's sorta a numpy/plotting equivalent. Unlike Spark which covers more use cases than just the classic Data Science workflow, Breeze is built specifically for "Data Science in Scala". The drawback is a classic one in Scala land where some major libraries abruptly get abandoned. Breeze's commits seem to have slowed down significantly and their website on their github page www.scalanlp.org is broken.

  • spark-nlp

    State of the Art Natural Language Processing

    I am not aware of common open frameworks like Tensorflow, PyTorch or Scikit-learn for Scala. But specifically for natural language processing, there's SparkNLP from John Snow Labs.

  • InfluxDB

    Build time-series-based applications quickly and at scale.. InfluxDB is the Time Series Platform where developers build real-time applications for analytics, IoT and cloud-native services. Easy to start, it is available in the cloud or on-premises.

  • SynapseML

    Simple and Distributed Machine Learning

    b) There are libraries around e.g. Microsoft SynapseML, LinkedIn Photon ML

  • photon-ml

    A scalable machine learning library on Apache Spark

    b) There are libraries around e.g. Microsoft SynapseML, LinkedIn Photon ML

  • saddle

    SADDLE: Scala Data Library (by pityka)

    You might be interested in the saddle library which is a dataframe manipulation library similar to python pandas.

  • SonarQube

    Static code analysis for 29 languages.. Your projects are multi-language. So is SonarQube analysis. Find Bugs, Vulnerabilities, Security Hotspots, and Code Smells so you can release quality code every time. Get started analyzing your projects today for free.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts