What companies/startups are using Scala (open source projects on github)?

This page summarizes the projects mentioned and recommended in the original post on /r/scala

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • theGardener

    theGardener will help you to include the documentation in your development loop so that you will trust again the documentation you provide.

    Not popular TBH but we do have open sourced a tool we use for documentation at KelkooGroup called theGardener (https://github.com/KelkooGroup/theGardener).

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • sttp-oauth2

    OAuth2 client library implemented in Scala using sttp

    In Ocado Technology we maintain sttp-oauth2 - a OAuth2 client library. There's something more from us coming soon.

  • circe

    Yet another JSON library for Scala

    Circe adopters should be using Scala https://github.com/circe/circe

  • delta

    An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs (by delta-io)

    There are so many of them in big data, e.g. Kafka, Spark, Flink, Delta, Snowplow, Finagle, Deequ, CMAK, OpenWhisk, Snowflake, TheHive, TVM-VTA, etc.

  • Snowplow

    The leader in Next-Generation Customer Data Infrastructure

    There are so many of them in big data, e.g. Kafka, Spark, Flink, Delta, Snowplow, Finagle, Deequ, CMAK, OpenWhisk, Snowflake, TheHive, TVM-VTA, etc.

  • Finagle

    A fault tolerant, protocol-agnostic RPC system

    There are so many of them in big data, e.g. Kafka, Spark, Flink, Delta, Snowplow, Finagle, Deequ, CMAK, OpenWhisk, Snowflake, TheHive, TVM-VTA, etc.

  • deequ

    Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

    There are so many of them in big data, e.g. Kafka, Spark, Flink, Delta, Snowplow, Finagle, Deequ, CMAK, OpenWhisk, Snowflake, TheHive, TVM-VTA, etc.

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • kafka-manager

    CMAK is a tool for managing Apache Kafka clusters

    There are so many of them in big data, e.g. Kafka, Spark, Flink, Delta, Snowplow, Finagle, Deequ, CMAK, OpenWhisk, Snowflake, TheHive, TVM-VTA, etc.

  • OpenWhisk

    Apache OpenWhisk is an open source serverless cloud platform

    There are so many of them in big data, e.g. Kafka, Spark, Flink, Delta, Snowplow, Finagle, Deequ, CMAK, OpenWhisk, Snowflake, TheHive, TVM-VTA, etc.

  • snowflake

    Discontinued Snowflake is a network service for generating unique ID numbers at high scale with some simple guarantees.

    There are so many of them in big data, e.g. Kafka, Spark, Flink, Delta, Snowplow, Finagle, Deequ, CMAK, OpenWhisk, Snowflake, TheHive, TVM-VTA, etc.

  • TheHive

    TheHive: a Scalable, Open Source and Free Security Incident Response Platform

    There are so many of them in big data, e.g. Kafka, Spark, Flink, Delta, Snowplow, Finagle, Deequ, CMAK, OpenWhisk, Snowflake, TheHive, TVM-VTA, etc.

  • tvm-vta

    Open, Modular, Deep Learning Accelerator

    There are so many of them in big data, e.g. Kafka, Spark, Flink, Delta, Snowplow, Finagle, Deequ, CMAK, OpenWhisk, Snowflake, TheHive, TVM-VTA, etc.

  • woof

    A pure Scala 3 logging library with no reflection

    Here is a fun one - https://github.com/LEGO/woof

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • [D] Is there other better data format for LLM to generate structured data?

    1 project | /r/MachineLearning | 10 Dec 2023
  • Delta vs Iceberg: make love not war

    1 project | /r/MicrosoftFabric | 30 Jun 2023
  • Databricks Strikes $1.3B Deal for Generative AI Startup MosaicML

    4 projects | news.ycombinator.com | 26 Jun 2023
  • Medallion/lakehouse architecture data modelling

    1 project | /r/dataengineering | 3 Jun 2023
  • whenNotMatchedBySourceUpdate not existing? Trying to upsert parquet into Delta table

    1 project | /r/apachespark | 10 May 2023

Did you konow that Scala is
the 36th most popular programming language
based on number of metions?