Jupyter Notebook Pandas

Open-source Jupyter Notebook projects categorized as Pandas

Top 23 Jupyter Notebook Panda Projects

  1. PythonDataScienceHandbook

    Python Data Science Handbook: full text in Jupyter Notebooks

    Project mention: Python Data Science Handbook | news.ycombinator.com | 2024-05-19
  2. CodeRabbit

    CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.

    CodeRabbit logo
  3. Data-Science-For-Beginners

    10 Weeks, 20 Lessons, Data Science for All!

  4. pandas_exercises

    Practice your pandas skills!

  5. py

    Repository to store sample python programs for python learning

  6. machine_learning_complete

    A comprehensive machine learning repository containing 30+ notebooks on different concepts, algorithms and techniques.

  7. ta

    Technical Analysis Library using Pandas and Numpy

  8. alphalens

    Performance analysis of predictive (alpha) stock factors

  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  10. Andrew-NG-Notes

    This is Andrew NG Coursera Handwritten Notes.

  11. jetson-containers

    Machine Learning Containers for NVIDIA Jetson and JetPack-L4T

  12. 100-pandas-puzzles

    100 data puzzles for pandas, ranging from short and simple to super tricky (60% complete)

  13. mito

    Jupyter extensions that help you write code faster: Context aware AI Chat, Autocomplete, and Spreadsheet

    Project mention: Show HN: Excel to Python Compiler | news.ycombinator.com | 2024-05-23

    3. Tables that translate as Pandas dataframes. We support at most one table per sheet, at the tables must be contigious. If the formulas in a column are consistent, then we will try and translate this as a single pandas statement.

    We do not support: pivot tables or complex formulas. When we fail to translate these, we generate TODO statements. We also don’t support graphs or macros - and you won’t see these reflected in the output at all currently.

    *Why we built this:*

    We did YCS20 and built an open source tool called [Mito](https://trymito.io). It’s been a good journey since then - we’ve scaled revenue and to over [2k Github stars](https://github.com/mito-ds/mito). But fundamentally, Mito is a tool that’s useful for Excel users who wanted to start writing Python code more effectively.

    We wanted to take another stab at the Excel -> Python pain point that was more developer focused - that helped developers that have to translate Excel files into Python do this much more quickly. Hence, Pyoneer!

    I’ll be in the comments today if you’ve got feedback, criticism, questions, or comments.

  14. hamilton

    Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.

    Project mention: Show HN: I built an open-source data pipeline tool in Go | news.ycombinator.com | 2024-12-17

    I always thought Hamilton [1] does a good job of giving enough visual hooks that draw you in.

    I also noticed this pattern where library authors sometimes do a bit extra in terms of discussing and even promoting their competitors, and it makes me trust them more. A “heres why ours is better and everyone else sucks …” section always comes across as the infomercial character who is having quite a hard time peeling an apple to the point you wonder if this the first time they’ve used hands.

    One thing wish for is a tool that’s essentially just Celery that doesn’t require a message broker (and can just use a database), and which is supported on Windows. There’s always a handful of edge cases where we’re pulling data from an old 32-bit system on Windows. And basically every system has some not-quite-ergonomic workaround that’s as much work as if you’d just built it yourself.

    It seems like it’s just sending a JSON message over a queue or HTTP API and the worker receives it and runs the task. Maybe it’s way harder than I’m envisioning (but I don’t think so because I’ve already written most of it).

    I guess that’s one thing I’m not clear on with Bruin, can I run workers if different physical locations and have them carry out the tasks in the right order? Or is this more of a centralized thing (meaning even if its K8s or Dask or Ray, those are all run in a cluster which happens to be distributed, but they’re all machines sitting in the same subnet, which isn’t the definition of a “distributed task” I’m going for.

    [1] https://github.com/DAGWorks-Inc/hamilton

  15. fecon235

    Notebooks for financial economics. Keywords: Jupyter notebook pandas Federal Reserve FRED Ferbus GDP CPI PCE inflation unemployment wage income debt Case-Shiller housing asset portfolio equities SPX bonds TIPS rates currency FX euro EUR USD JPY yen XAU gold Brent WTI oil Holt-Winters time-series forecasting statistics econometrics

  16. project-walkthroughs

    Data science, machine learning, and web development project code for https://www.youtube.com/c/Dataquestio .

  17. code

    Compilation of R and Python programming codes on the Data Professor YouTube channel. (by dataprofessor)

  18. pdpipe

    Easy pipelines for pandas DataFrames.

  19. 100-days-of-code-python

    100 Days of Code: The Complete Python Pro Bootcamp

  20. kglab

    Graph Data Science: an abstraction layer in Python for building knowledge graphs, integrated with popular graph libraries – atop Pandas, NetworkX, RAPIDS, RDFlib, pySHACL, PyVis, morph-kgc, pslpython, pyarrow, etc.

  21. ydata-quality

    Data Quality assessment with one line of code

  22. Python-Roadmap

    Python Roadmap. Learn Python programming as your first programming language. Python for Absolute Beginners, Non-Tech Professionals, 15+ Projects, 30 Topics, 500+ Practice Questions, with Data Structures & Algorithms

  23. awesome-data-centric-ai

    Open-Source Software, Tutorials, and Research on Data-Centric AI 🤖

  24. tempo

    API for manipulating time series on top of Apache Spark: lagged time values, rolling statistics (mean, avg, sum, count, etc), AS OF joins, downsampling, and interpolation (by databrickslabs)

  25. feature-engineering-tutorials

    Data Science Feature Engineering and Selection Tutorials

  26. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Jupyter Notebook Pandas discussion

Log in or Post with

Jupyter Notebook Pandas related posts

  • Show HN: Create Data Visualization with Data Formulator from Microsoft Research

    4 projects | news.ycombinator.com | 21 Oct 2024
  • Welcome to 14 days of Data Science!

    1 project | dev.to | 7 Mar 2024
  • Financial Economics: Financial Economics Models. Extended Research - star count:1033.0

    1 project | /r/algoprojects | 10 Dec 2023
  • Performance Analysis: Performance analysis of predictive (alpha) stock factors. Factor and Risk Analysis - star count:2892.0

    1 project | /r/algoprojects | 9 Dec 2023
  • Performance Analysis: Performance analysis of predictive (alpha) stock factors. Factor and Risk Analysis - star count:2892.0

    1 project | /r/algoprojects | 8 Dec 2023
  • Performance Analysis: Performance analysis of predictive (alpha) stock factors. Factor and Risk Analysis - star count:2892.0

    1 project | /r/algoprojects | 7 Dec 2023
  • Performance Analysis: Performance analysis of predictive (alpha) stock factors. Factor and Risk Analysis - star count:2892.0

    1 project | /r/algoprojects | 7 Dec 2023
  • A note from our sponsor - SaaSHub
    www.saashub.com | 19 Mar 2025
    SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source Panda projects in Jupyter Notebook? This list will help you:

# Project Stars
1 PythonDataScienceHandbook 44,078
2 Data-Science-For-Beginners 29,021
3 pandas_exercises 11,142
4 py 6,988
5 machine_learning_complete 4,671
6 ta 4,541
7 alphalens 3,518
8 Andrew-NG-Notes 2,851
9 jetson-containers 2,798
10 100-pandas-puzzles 2,639
11 mito 2,356
12 hamilton 2,053
13 fecon235 1,148
14 project-walkthroughs 1,001
15 code 959
16 pdpipe 717
17 100-days-of-code-python 773
18 kglab 614
19 ydata-quality 434
20 Python-Roadmap 383
21 awesome-data-centric-ai 331
22 tempo 320
23 feature-engineering-tutorials 283

Sponsored
CodeRabbit: AI Code Reviews for Developers
Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
coderabbit.ai

Did you know that Jupyter Notebook is
the 13th most popular programming language
based on number of references?