pandas-profiling
data_algebra
pandas-profiling | data_algebra | |
---|---|---|
1 | 5 | |
8,962 | 113 | |
- | 0.0% | |
8.5 | 8.5 | |
almost 2 years ago | 7 months ago | |
Python | Python | |
MIT License | BSD 3-clause "New" or "Revised" License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
pandas-profiling
-
Mito – Excel-like interface for Pandas dataframes in Jupyter notebook
For those who are going through the thread finding new tools: pandas-profiling[0] is a library for automatic EDA (part of what bamboolib[1] does).
[0]: https://github.com/pandas-profiling/pandas-profiling
data_algebra
- Control Pandas, Polars, or SQL from One DSL
-
Modern Pandas (Part 2): Method Chaining
There are a number of packages in Python specializing in variations of piped processing in Pandas. My own is this one: https://github.com/WinVector/data_algebra .
- Plotting Multiple Curves in Python
-
Siuba – A Dplyr Port to Python
Neat. I've been working on my own "piped-Codd" style system I call the "data algebra" https://github.com/WinVector/data_algebra
I use method chaining as the composing notation.
What are some alternatives?
lux - Automatically visualize your pandas dataframe via a single print! 📊 💡
dataiter - Python classes for data manipulation
barfi - Python Flow Based Programming environment that provides a graphical programming environment.
siuba - Python library for using dplyr like syntax with pandas and SQL
qgrid - An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks
mito - The mitosheet package, trymito.io, and other public Mito code.
datasette - An open source multi-tool for exploring and publishing data
chain-ops-python - Simple chaining of operations (a.k.a. pipe operator) in python
python - 🚀 Curated collection of Amazing Python scripts from Basics to Advance with automation task scripts using Libraries and Logic. These things everyone should know in their journey with programming.
polars - Dataframes powered by a multithreaded, vectorized query engine, written in Rust
kangas - 🦘 Explore multimedia datasets at scale
ydata-profiling - 1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.