pyspark-example-project

Implementing best practices for PySpark ETL jobs and applications. (by AlexIoannides)

Pyspark-example-project Alternatives

Similar projects and alternatives to pyspark-example-project

  1. soda-spark

    Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes

  2. Judoscale

    Save 47% on cloud hosting with autoscaling that just works. Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues.

    Judoscale logo
  3. patterns-devkit

    Data pipelines from re-usable components

  4. TypedPyspark

    Type-annotate your spark dataframes and validate them

  5. workshop-realtime-data-pipelines

    You will inspect and run a sample architecture making use of Apache Pulsar™ and Pulsar Functions for real-time, event-streaming-based data ingestion, cleaning and processing.

  6. dados-censup

    Discontinued Automação da ingestão de dados disponibilizados pelo INEP referente ao censo superior da educacão brasileira.

  7. CodeRabbit

    CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.

    CodeRabbit logo
  8. Mage

    🧙 The modern replacement for Airflow. Mage is an open-source data pipeline tool for transforming and integrating data. https://github.com/mage-ai/mage-ai

  9. hamilton

    24 pyspark-example-project VS hamilton

    Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.

  10. etl-markup-toolkit

    Discontinued ETL Markup Toolkit is a spark-native tool for expressing ETL transformations as configuration

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better pyspark-example-project alternative or higher similarity.

pyspark-example-project discussion

Log in or Post with

pyspark-example-project reviews and mentions

Posts with mentions or reviews of pyspark-example-project. We have used some of these posts to build our list of alternatives and similar projects.

Stats

Basic pyspark-example-project repo stats
1
1,860
0.0
over 2 years ago

Sponsored
Save 47% on cloud hosting with autoscaling that just works
Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues.
judoscale.com

Did you know that Python is
the 2nd most popular programming language
based on number of references?