What companies/startups are using Scala (open source projects on github)?

This page summarizes the projects mentioned and recommended in the original post on /r/scala

Sevalla - Deploy and host your apps and databases, now with $50 credit!
Sevalla is the PaaS you have been looking for! Advanced deployment pipelines, usage-based pricing, preview apps, templates, human support by developers, and much more!
sevalla.com
featured
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com
featured
  1. theGardener

    Discontinued theGardener will help you to include the documentation in your development loop so that you will trust again the documentation you provide.

    Not popular TBH but we do have open sourced a tool we use for documentation at KelkooGroup called theGardener (https://github.com/KelkooGroup/theGardener).

  2. Sevalla

    Deploy and host your apps and databases, now with $50 credit! Sevalla is the PaaS you have been looking for! Advanced deployment pipelines, usage-based pricing, preview apps, templates, human support by developers, and much more!

    Sevalla logo
  3. sttp-oauth2

    OAuth2 client library implemented in Scala using sttp

    In Ocado Technology we maintain sttp-oauth2 - a OAuth2 client library. There's something more from us coming soon.

  4. circe

    Yet another JSON library for Scala

    Circe adopters should be using Scala https://github.com/circe/circe

  5. delta

    An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs (by delta-io)

    There are so many of them in big data, e.g. Kafka, Spark, Flink, Delta, Snowplow, Finagle, Deequ, CMAK, OpenWhisk, Snowflake, TheHive, TVM-VTA, etc.

  6. Snowplow

    The leader in Customer Data Infrastructure

    There are so many of them in big data, e.g. Kafka, Spark, Flink, Delta, Snowplow, Finagle, Deequ, CMAK, OpenWhisk, Snowflake, TheHive, TVM-VTA, etc.

  7. Finagle

    A fault tolerant, protocol-agnostic RPC system

    There are so many of them in big data, e.g. Kafka, Spark, Flink, Delta, Snowplow, Finagle, Deequ, CMAK, OpenWhisk, Snowflake, TheHive, TVM-VTA, etc.

  8. deequ

    Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

    There are so many of them in big data, e.g. Kafka, Spark, Flink, Delta, Snowplow, Finagle, Deequ, CMAK, OpenWhisk, Snowflake, TheHive, TVM-VTA, etc.

  9. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  10. kafka-manager

    CMAK is a tool for managing Apache Kafka clusters

    There are so many of them in big data, e.g. Kafka, Spark, Flink, Delta, Snowplow, Finagle, Deequ, CMAK, OpenWhisk, Snowflake, TheHive, TVM-VTA, etc.

  11. OpenWhisk

    Apache OpenWhisk is an open source serverless cloud platform

    There are so many of them in big data, e.g. Kafka, Spark, Flink, Delta, Snowplow, Finagle, Deequ, CMAK, OpenWhisk, Snowflake, TheHive, TVM-VTA, etc.

  12. snowflake

    Discontinued Snowflake is a network service for generating unique ID numbers at high scale with some simple guarantees.

    There are so many of them in big data, e.g. Kafka, Spark, Flink, Delta, Snowplow, Finagle, Deequ, CMAK, OpenWhisk, Snowflake, TheHive, TVM-VTA, etc.

  13. TheHive

    TheHive: a Scalable, Open Source and Free Security Incident Response Platform

    There are so many of them in big data, e.g. Kafka, Spark, Flink, Delta, Snowplow, Finagle, Deequ, CMAK, OpenWhisk, Snowflake, TheHive, TVM-VTA, etc.

  14. tvm-vta

    Open, Modular, Deep Learning Accelerator

    There are so many of them in big data, e.g. Kafka, Spark, Flink, Delta, Snowplow, Finagle, Deequ, CMAK, OpenWhisk, Snowflake, TheHive, TVM-VTA, etc.

  15. woof

    A pure Scala 3 logging library with no reflection

    Here is a fun one - https://github.com/LEGO/woof

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Apache Iceberg V3 Spec new features for more efficient and flexible data lakes

    2 projects | news.ycombinator.com | 11 Aug 2025
  • Stream Processing Systems in 2025: RisingWave, Flink, Spark Streaming, and What's Ahead

    7 projects | dev.to | 27 Jan 2025
  • Apache Zeppelin

    6 projects | news.ycombinator.com | 2 Sep 2024
  • [D] Is there other better data format for LLM to generate structured data?

    1 project | /r/MachineLearning | 10 Dec 2023
  • Delta vs Iceberg: make love not war

    1 project | /r/MicrosoftFabric | 30 Jun 2023