siuba
data_algebra
Our great sponsors
siuba | data_algebra | |
---|---|---|
25 | 5 | |
1,100 | 113 | |
- | 0.9% | |
7.5 | 8.5 | |
7 months ago | 6 months ago | |
Python | Python | |
MIT License | BSD 3-clause "New" or "Revised" License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
siuba
- The Design Philosophy of Great Tables (Software Package)
-
Best alternative to Pandas 2023?
I don't know what's best for you, but I can recommend Siuba, a tidy interface for Python to send queries to pandas and SQL-db.
- Method Chaining in Pandas: Bad Form or a Recipe for Success?
-
Happy Halloween, Pandas! 🎃🤓
You mean siuba?
-
Explorer (Elixir and Polars)
For further inspiration, this is a pretty good-looking "dplyr for Python": https://github.com/machow/siuba
- Unpopular opinion: Matplotlib is a bad library
- A trick to have arbitrary infix operators in Python
-
Going from R to Pandas: dplython vs dfply vs plydata
You should follow /u/the75th's advice. However, if you decide to buck that take, I'd look into siuba. I've never heard of those packages you've listed, and have doubts they'd be maintained.
- Tidyverse equivalent in Python?
-
R / Tidyverse User -> Python | How to Make it Hurt Less
Check out siuba
data_algebra
- Control Pandas, Polars, or SQL from One DSL
-
Modern Pandas (Part 2): Method Chaining
There are a number of packages in Python specializing in variations of piped processing in Pandas. My own is this one: https://github.com/WinVector/data_algebra .
- Plotting Multiple Curves in Python
-
Siuba – A Dplyr Port to Python
Neat. I've been working on my own "piped-Codd" style system I call the "data algebra" https://github.com/WinVector/data_algebra
I use method chaining as the composing notation.
What are some alternatives?
Pandas - Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
dataiter - Python classes for data manipulation
dtale - Visualizer for pandas data structures
mito - The mitosheet package, trymito.io, and other public Mito code.
Altair - Declarative statistical visualization library for Python
pandas-profiling - Create HTML profiling reports from pandas DataFrame objects [Moved to: https://github.com/ydataai/pandas-profiling]
q - q - Run SQL directly on delimited files and multi-file sqlite databases
chain-ops-python - Simple chaining of operations (a.k.a. pipe operator) in python
vinum - Vinum is a SQL processor for Python, designed for data analysis workflows and in-memory analytics.
polars - Dataframes powered by a multithreaded, vectorized query engine, written in Rust
DataFramesMeta.jl - Metaprogramming tools for DataFrames
ydata-profiling - 1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.