Scala Data Science

Open-source Scala projects categorized as Data Science Edit details

Top 5 Scala Data Science Projects

  • SynapseML

    Simple and Distributed Machine Learning

    Project mention: [N] Microsoft Announces New Integrations with OpenAI and MLFlow | | 2022-08-09
  • metarank

    A low code Machine Learning peersonalized ranking service for articles, listings, search results, recommendations that boosts user engagement. A friendly Learn-to-Rank engine

    Project mention: Ask HN: Is it ethical for open-source projects to have usage analytics tracking? | | 2022-08-29

    We’re building an open-source tool to do search/category/recommendation personalization, eventually planning to create a business out of it. We have a small number of pilot projects with real feedback, but we rarely have a chance to see how new people interact with the service, as it’s self-hosted backend tool with no UI.

    We have an idea to add anonymous analytics reporting to get a glimpse of real usage (and places where people are struggling to improve), but are concerned if it’s ethical or not to do such intrusive things.

    Is it acceptable for an open-source project to have this type of tracking, considering our materialistic plans to transform it into a business?

  • SonarQube

    Static code analysis for 29 languages.. Your projects are multi-language. So is SonarQube analysis. Find Bugs, Vulnerabilities, Security Hotspots, and Code Smells so you can release quality code every time. Get started analyzing your projects today for free.

  • doddle-model

    :cake: doddle-model: machine learning in Scala.

  • LynxKite

    The complete graph data science platform

  • data-validator

    A tool to validate data built around Apache Spark.

  • Scout APM

    Truly a developer’s best friend. Scout APM is great for developers who want to find and fix performance issues in their applications. With Scout, we'll take care of the bugs so you can focus on building great things 🚀.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2022-08-29.

Scala Data Science related posts


What are some of the best open-source Data Science projects in Scala? This list will help you:

Project Stars
1 SynapseML 3,784
2 metarank 1,526
3 doddle-model 142
4 LynxKite 124
5 data-validator 72
Find remote jobs at our new job board There are 8 new remote jobs listed recently.
Are you hiring? Post a new remote job listing for free.
Download’s Tech Salary Report
Median salaries, most in-demand technologies, state of the remote work... all you need to know your worth on the market by tech recruitment platform