anovos
Apache-Spark-Guide
anovos | Apache-Spark-Guide | |
---|---|---|
1 | 2 | |
77 | 28 | |
- | - | |
0.0 | 1.8 | |
about 1 year ago | over 2 years ago | |
Jupyter Notebook | Python | |
GNU General Public License v3.0 or later | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
anovos
Apache-Spark-Guide
What are some alternatives?
Optimus - :truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
pyspark-example-project - Implementing best practices for PySpark ETL jobs and applications.
Hyperactive - An optimization and data collection toolbox for convenient and fast prototyping of computationally expensive models.
Traffic-Data-Analysis-with-Apache-Spark-Based-on-Mobile-Robot-Data - Mobile robot data were analyzed with Apache-Spark to extract five different statistical result such as travel time, waiting time, average speed, occupancy and density were produced.
feast - Feature Store for Machine Learning
livyc - Apache Spark as a Service with Apache Livy Client
project-atlas-sao-paulo - A project for the development of rich geospatial data from the city of São Paulo for use in Machine Learning models.
pyspark-tutorial - PySpark Tutorial for Beginners - Practical Examples in Jupyter Notebook with Spark version 3.4.1. The tutorial covers various topics like Spark Introduction, Spark Installation, Spark RDD Transformations and Actions, Spark DataFrame, Spark SQL, and more. It is completely free on YouTube and is beginner-friendly without any prerequisites.
Patek - A collection of reusable pyspark utility functions that help make development easier!