Python Pandas

Open-source Python projects categorized as Pandas

Top 23 Python Panda Projects

  1. 30-Days-Of-Python

    30 days of Python programming challenge is a step-by-step guide to learn the Python programming language in 30 days. This challenge may take more than100 days, follow your own pace. These videos may help too: https://www.youtube.com/channel/UC7PNRuno1rzYPb1xLa4yktw

    Project mention: 17 Best GitHub Repositories to Learn Python | dev.to | 2025-02-06

    30-Days-Of-Python

  2. Sevalla

    Deploy and host your apps and databases, now with $50 credit! Sevalla is the PaaS you have been looking for! Advanced deployment pipelines, usage-based pricing, preview apps, templates, human support by developers, and much more!

    Sevalla logo
  3. Pandas

    Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

    Project mention: Writing memory efficient C structs | news.ycombinator.com | 2025-07-30

    https://github.com/pandas-dev/pandas/issues/58062 :

    > On disk Parquet appears to store the category data as logical type String which is compressed with snappy and encoded

    Arrow Flight RPC handles nested structs with enums over the wire somehow too FWIU

  4. tqdm

    :zap: A Fast, Extensible Progress Bar for Python and CLI

    Project mention: Tqdm (Python Progress Bar) | news.ycombinator.com | 2025-03-30
  5. data-science-ipython-notebooks

    Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

  6. pandas-ai

    Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.

    Project mention: Pandas AI | news.ycombinator.com | 2025-07-18
  7. datasets

    πŸ€— The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools

    Project mention: Training with Big Data on Any Cloud | dev.to | 2025-06-20

    Hugging Face Datasets -- the library that lets you download and manage datasets from the Hugging Face Hub, as well as being a convenient vendor-neutral interface for your own datasets.

  8. yfinance

    Download market data from Yahoo! Finance's API

  9. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  10. pygwalker

    PyGWalker: Turn your dataframe into an interactive UI for visual analysis

    Project mention: The DuckDB Local UI | news.ycombinator.com | 2025-03-12
  11. Dask

    Parallel computing with task scheduling

  12. seaborn

    Statistical data visualization in Python

    Project mention: How I Hacked Uber’s Hidden API to Download 4379 Rides | dev.to | 2025-04-09

    Below are the key insights. If you want to see the Python code I used to do this analysis and generate the charts using Seaborn, you can find my full analysis Jupyter notebook on my Github repo here: Tip Analysis.ipynb

  13. ydata-profiling

    1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

    Project mention: The DuckDB Local UI | news.ycombinator.com | 2025-03-12

    WhatTheDuck does SQL with duckdb-wasm IIRC

    Pygwalker does open-source descriptive statistics and charts from pandas dataframes: https://github.com/Kanaries/pygwalker

    ydata-profiling does Exploratory Data Analysis (EDA) with Pandas and Spark DataFrames and integrates with various apps: https://github.com/ydataai/ydata-profiling

  14. modin

    Modin: Scale your Pandas workflows by changing a single line of code

  15. mlcourse.ai

    Open Machine Learning Course

  16. visidata

    A terminal spreadsheet multitool for discovering and arranging data

  17. ibis

    the portable Python dataframe library

    Project mention: Why Pandas feels clunky when coming from R (2024) | news.ycombinator.com | 2025-06-07

    pandas* per the style guide (nobody follows it)

    also I recommend trying Ibis. created by the creator of pandas originally and solves so many of the issues

    https://ibis-project.org

  18. orange

    🍊 :bar_chart: :bulb: Orange: Interactive data analysis

  19. lux

    Automatically visualize your pandas dataframe via a single print! πŸ“Š πŸ’‘ (by lux-org)

  20. geopandas

    Python tools for geographic data

    Project mention: Rivian GeoLocation Plotting with IRIS Cloud Document and Databricks | dev.to | 2024-12-26

    We are using geopandas and geodatasets for a straight forward approach to plotting.

  21. Mimesis

    Mimesis is a robust data generator for Python that can produce a wide range of fake data in multiple languages.

    Project mention: Mimesis: The Fake Data Generator That Will Blow Your Mind! | dev.to | 2025-05-08

    View the Project on GitHub

  22. alpha_vantage

    A python wrapper for Alpha Vantage API for financial data.

  23. pytorch-forecasting

    Time series forecasting with PyTorch

  24. missingno

    Missing data visualization module for Python.

  25. AWS Data Wrangler

    pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).

  26. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python Pandas discussion

Log in or Post with

Python Pandas related posts

Index

What are some of the best open-source Panda projects in Python? This list will help you:

# Project Stars
1 30-Days-Of-Python 48,905
2 Pandas 46,447
3 tqdm 30,339
4 data-science-ipython-notebooks 28,284
5 pandas-ai 21,924
6 datasets 20,575
7 yfinance 18,847
8 pygwalker 15,122
9 Dask 13,444
10 seaborn 13,404
11 ydata-profiling 13,117
12 modin 10,263
13 mlcourse.ai 10,180
14 visidata 8,429
15 ibis 6,065
16 orange 5,319
17 lux 5,282
18 geopandas 4,861
19 Mimesis 4,612
20 alpha_vantage 4,573
21 pytorch-forecasting 4,455
22 missingno 4,122
23 AWS Data Wrangler 4,054

Sponsored
Deploy and host your apps and databases, now with $50 credit!
Sevalla is the PaaS you have been looking for! Advanced deployment pipelines, usage-based pricing, preview apps, templates, human support by developers, and much more!
sevalla.com

Did you know that Python is
the 2nd most popular programming language
based on number of references?