Scala Python

Open-source Scala projects categorized as Python

Top 10 Scala Python Projects

  • Apache Spark

    Apache Spark - A unified analytics engine for large-scale data processing

  • Project mention: Shades of Open Source - Understanding The Many Meanings of "Open" | dev.to | 2024-06-15

    In contrast, Databricks maintains internal forks of Spark, Delta Lake, and Unity Catalog, using the same names for both the open-source versions and the features specific to the Databricks platform. While they do provide separate documentation, online discussions often reflect confusion about how to use features in the open-source versions that only exist on the Databricks platform. This creates a "muddying of the waters" between what is open and what is proprietary. This isn't an issue if you are a Databricks user, but it can be quite confusing for those who want to use these tools outside of the Databricks ecosystem.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • mleap

    MLeap: Deploy ML Pipelines to Production

  • Cortex

    Cortex: a Powerful Observable Analysis and Active Response Engine (by TheHive-Project)

  • adam

    ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.

  • sparkMeasure

    This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spark jobs. It focuses on easing the collection and examination of Spark metrics, making it a practical choice for both developers and data engineers.

  • scalapy

    Use the world of Python from the comfort of Scala!

  • Vyxal

    A code-golfing language experience that has aspects of traditional programming languages - terse, elegant, readable.

  • Project mention: Vyxal: A code-golfing language experience | news.ycombinator.com | 2024-02-28
  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • spark-extension

    A library that provides useful extensions to Apache Spark and PySpark.

  • Project mention: Data diffs: Algorithms for explaining what changed in a dataset (2022) | news.ycombinator.com | 2023-07-26

    We're doing a env migration and I've been using spark diff extension for reconcile data, it's amazing, we've discover bugs in the data logic so quickly,

    here is the extension if anyone is interested https://github.com/G-Research/spark-extension/blob/master/DI...

  • kukulcan

    A REPL for Apache Kafka

  • stasis

    Backup and recovery system with emphasis on security and privacy (by sndnv)

  • Project mention: ⟳ 1 apps added, 3 updated at apt.izzysoft.de | /r/FDroidUpdates | 2023-10-25

    stasis: Backup and recovery system with emphasis on security and privacy

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Scala Python discussion

Log in or Post with

Scala Python related posts

  • "xAI will open source Grok"

    3 projects | news.ycombinator.com | 11 Mar 2024
  • Apache Spark VS quix-streams - a user suggested alternative

    2 projects | 7 Dec 2023
  • Semver violations are common, better tooling is the answer

    7 projects | news.ycombinator.com | 7 Sep 2023
  • Integrate Pyspark Structured Streaming with confluent-kafka

    2 projects | dev.to | 12 Aug 2023
  • Spark – A micro framework for creating web applications in Kotlin and Java

    1 project | news.ycombinator.com | 16 Jun 2023
  • Rest in Peas: The Unrecognized Death of Speech Recognition (2010)

    4 projects | news.ycombinator.com | 4 May 2023
  • PySpark SparkSession Builder with Kubernetes Master

    1 project | /r/codehunter | 20 Apr 2023
  • A note from our sponsor - SaaSHub
    www.saashub.com | 16 Jun 2024
    SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source Python projects in Scala? This list will help you:

Project Stars
1 Apache Spark 38,703
2 mleap 1,498
3 Cortex 1,268
4 adam 966
5 sparkMeasure 666
6 scalapy 539
7 Vyxal 262
8 spark-extension 174
9 kukulcan 115
10 stasis 30

Sponsored
Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com