Python Dataframe

Open-source Python projects categorized as Dataframe

Top 19 Python Dataframe Projects

  1. pygwalker

    PyGWalker: Turn your dataframe into an interactive UI for visual analysis

    Project mention: The DuckDB Local UI | news.ycombinator.com | 2025-03-12
  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. modin

    Modin: Scale your Pandas workflows by changing a single line of code

  4. vaex

    Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀

  5. Mimesis

    Mimesis is a robust data generator for Python that can produce a wide range of fake data in multiple languages.

    Project mention: Mimesis: The Fake Data Generator That Will Blow Your Mind! | dev.to | 2025-05-08

    View the Project on GitHub

  6. koalas

    Koalas: pandas API on Apache Spark

  7. PandasGUI

    A GUI for Pandas DataFrames

  8. mars

    Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.

  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  10. sketch

    AI code-writing assistant that understands data content

  11. pyjanitor

    Clean APIs for data cleaning. Python implementation of R package Janitor

  12. optopsy

    A nimble options backtesting library for Python

  13. technical

    Various indicators developed or collected for the Freqtrade

  14. eland

    Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch

  15. pandastable

    Table analysis in Tkinter using pandas DataFrames.

  16. pystore

    Fast data store for Pandas time-series data

  17. snowpark-python

    Snowflake Snowpark Python API

  18. tablexplore

    Table analysis and plotting application written in PySide2/PyQt5

  19. polars-st

    Spatial extension for Polars DataFrames.

    Project mention: DuckDB is probably the most important geospatial software of the last decade | news.ycombinator.com | 2025-05-03

    Chiming in to promote a similar project, a geospatial extension for Polars [1] I'm working on. It's not stable yet (abeit pretty close to), but is already pretty feature complete (it uses GEOS as a backend, so has parity with GeoPandas).

    [1] https://github.com/oreilles/polars-st/

  20. frame-fixtures

    Use compact expressions to create diverse, deterministic DataFrame fixtures with StaticFrame

  21. LLMWorkbook

    Effortlessly harness the power of LLMs on Excel and DataFrames—seamless, smart, and efficient!

  22. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python Dataframe discussion

Log in or Post with

Python Dataframe related posts

Index

What are some of the best open-source Dataframe projects in Python? This list will help you:

# Project Stars
1 pygwalker 14,948
2 modin 10,193
3 vaex 8,393
4 Mimesis 4,589
5 koalas 3,360
6 PandasGUI 3,231
7 mars 2,718
8 sketch 2,257
9 pyjanitor 1,429
10 optopsy 1,129
11 technical 885
12 eland 678
13 pandastable 650
14 pystore 578
15 snowpark-python 301
16 tablexplore 138
17 polars-st 96
18 frame-fixtures 8
19 LLMWorkbook 5

Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com

Did you know that Python is
the 2nd most popular programming language
based on number of references?