Pyspark-example-project Alternatives

Similar projects and alternatives to pyspark-example-project

soda-spark

1 1 64 0.0 Python pyspark-example-project VS soda-spark

Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes
InfluxDB

www.influxdata.com featured

InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
TypedPyspark

2 1 14 2.4 Python pyspark-example-project VS TypedPyspark

Type-annotate your spark dataframes and validate them
Apache-Spark-Guide

3 2 31 1.8 Python pyspark-example-project VS Apache-Spark-Guide

Apache Spark Guide
patterns-devkit

4 5 108 2.9 Python pyspark-example-project VS patterns-devkit

Data pipelines from re-usable components
workshop-realtime-data-pipelines

5 1 3 2.3 Python pyspark-example-project VS workshop-realtime-data-pipelines

You will inspect and run a sample architecture making use of Apache Pulsar™ and Pulsar Functions for real-time, event-streaming-based data ingestion, cleaning and processing.
dados-censup

6 1 6 4.9 Python pyspark-example-project VS dados-censup

Discontinued Automação da ingestão de dados disponibilizados pelo INEP referente ao censo superior da educacão brasileira.
Spooq

7 1 9 7.9 Python pyspark-example-project VS Spooq
Sevalla

sevalla.com featured

Deploy and host your apps and databases, now with $50 credit! Sevalla is the PaaS you have been looking for! Advanced deployment pipelines, usage-based pricing, preview apps, templates, human support by developers, and much more!
hamilton

8 25 2,250 9.1 Jupyter Notebook pyspark-example-project VS hamilton

Apache Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.
etl-markup-toolkit

9 7 5 0.0 Python pyspark-example-project VS etl-markup-toolkit

Discontinued ETL Markup Toolkit is a spark-native tool for expressing ETL transformations as configuration
Mage

10 79 8,454 9.4 Python pyspark-example-project VS Mage

🧙 The modern replacement for Airflow. Mage is an open-source data pipeline tool for transforming and integrating data. https://github.com/mage-ai/mage-ai

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better pyspark-example-project alternative or higher similarity.

Suggest an alternative to pyspark-example-project

pyspark-example-project discussion

pyspark-example-project reviews and mentions

Posts with mentions or reviews of pyspark-example-project. We have used some of these posts to build our list of alternatives and similar projects.

Learning Pyspark for a new role
1 project | /r/dataengineering | 23 Dec 2022

https://github.com/AlexIoannides/pyspark-example-project You can use this as an example to organize your project. I have referred to this in the past.

Stats

Basic pyspark-example-project repo stats

Mentions

Stars

1,944

Activity

0.0

Last Commit

over 2 years ago

The primary programming language of pyspark-example-project is Python.