Jupyter Notebook Pyspark

Open-source Jupyter Notebook projects categorized as Pyspark

Top 14 Jupyter Notebook Pyspark Projects

  • Gather-Deployment

    Gathers Python deployment, infrastructure and practices.

  • WallStreetBets_BigDataAnalysis

    Research project aimed to classify the best stock research posts from r/WallStreetBets for you. ๐Ÿ˜

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • anovos

    Anovos - An Open Source Library for Scalable feature engineering Using Apache-Spark

  • pyspark-tutorial

    PySpark Tutorial for Beginners - Practical Examples in Jupyter Notebook with Spark version 3.4.1. The tutorial covers various topics like Spark Introduction, Spark Installation, Spark RDD Transformations and Actions, Spark DataFrame, Spark SQL, and more. It is completely free on YouTube and is beginner-friendly without any prerequisites. (by coder2j)

  • Project mention: PySpark Tutorial for Beginners: 1-Hour Full Course | /r/apachespark | 2023-10-11

    Watch it now ๐Ÿ‘‰ https://youtu.be/EB8lfdxpirM GitHub Repo ๐Ÿ‘‰ https://github.com/coder2j/pyspark-tutorial

  • lasagna

    A Docker Compose template that builds a interactive development environment for PySpark with Jupyter Lab, MinIO as object storage, Hive Metastore, Trino and Kafka

  • Project mention: FLaNK Stack Weekly for 20 Nov 2023 | dev.to | 2023-11-20
  • ESG-AI-investment-by-streamlit

    ESG-investment AI

  • reddit-streaming

    streaming eight subreddits from reddit api using kafka producer & spark structured streaming.

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • pyspark_nlp_workshop

    Instructions and code for the workshop "From Big Data to NLP Insights: Unlocking the Power of PySpark and Spark NLP"

  • Project mention: PySpark for NLP workshop โ€“ Jupyter notebooks and instructions | news.ycombinator.com | 2023-05-14
  • project-atlas-sao-paulo

    A project for the development of rich geospatial data from the city of Sรฃo Paulo for use in Machine Learning models.

  • workshop-introduction-to-machine-learning

    Come ready to discover the goals and approaches of machine learning, and how to build effective algorithms and solutions!

  • project

    Predict how many points an European football team will end the season with, according to the characteristics of its players. Project for the Big Data Computing course at Sapienza University of Rome (2021-22) (by Big-Data-FC)

  • synapse-azure-data-explorer-101

    Getting started with Azure Synapse and Azure Data Explorer

  • file-format-benchmark

    benchmark script of key operations between different file formats

  • Project mention: Different file formats, a benchmark doing basic operations | dev.to | 2024-03-10

    file-format-benchmark: benchmark script of key operations between different file formats

  • dracula

    a brief analysis to the most common words in Dracula, by Bram Stoker (by geazi-anc)

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Jupyter Notebook Pyspark related posts

Index

What are some of the best open-source Pyspark projects in Jupyter Notebook? This list will help you:

Project Stars
1 Gather-Deployment 350
2 WallStreetBets_BigDataAnalysis 165
3 anovos 77
4 pyspark-tutorial 29
5 lasagna 27
6 ESG-AI-investment-by-streamlit 21
7 reddit-streaming 18
8 pyspark_nlp_workshop 12
9 project-atlas-sao-paulo 9
10 workshop-introduction-to-machine-learning 7
11 project 6
12 synapse-azure-data-explorer-101 4
13 file-format-benchmark 2
14 dracula 0

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com