Jupyter Notebook ETL

Open-source Jupyter Notebook projects categorized as ETL

Top 3 Jupyter Notebook ETL Projects

  • hamilton

    Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage and metadata. Runs and scales everywhere python does.

  • Project mention: Building an Email Assistant Application with Burr | dev.to | 2024-04-26

    Note that this uses simple OpenAI calls — you can replace this with Langchain, LlamaIndex, Hamilton (or something else) if you prefer more abstraction, and delegate to whatever LLM you like to use. And, you should probably use something a little more concrete (E.G. instructor) to guarantee output shape.

  • ghcn-d

    Data Pipeline from the Global Historical Climatology Network DataSet

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • udacity_bike_share_datalake_project

    Azure Data Lake

  • Project mention: Unveiling the Azure Data Lake for Bike Share Data Analytics | dev.to | 2023-10-11

    You can find the code related to this project in my GitHub repository.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Index

What are some of the best open-source ETL projects in Jupyter Notebook? This list will help you:

Project Stars
1 hamilton 1,312
2 ghcn-d 21
3 udacity_bike_share_datalake_project 0

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com