plydata
A grammar for data manipulation in Python (by has2k1)
pipda
A framework for data piping in python (by pwwang)
plydata | pipda | |
---|---|---|
2 | 1 | |
274 | 35 | |
- | - | |
0.0 | 6.2 | |
8 months ago | 7 months ago | |
Python | Python | |
BSD 3-clause "New" or "Revised" License | MIT License |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
plydata
Posts with mentions or reviews of plydata.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2021-06-25.
-
datar: the dplyr in python
from datar import f from datar.dplyr import mutate, filter, if_else from datar.tibble import tibble # or # from datar.all import f, mutate, filter, if_else, tibble df = tibble( x=range(4), y=['zero', 'one', 'two', 'three'] ) df >> mutate(z=f.x) """# output x y z 0 0 zero 0 1 1 one 1 2 2 two 2 3 3 three 3 """ df >> mutate(z=if_else(f.x>1, 1, 0)) """# output: x y z 0 0 zero 0 1 1 one 0 2 2 two 1 3 3 three 1 """ df >> filter(f.x>1) """# output: x y 0 2 two 1 3 three """ df >> mutate(z=if_else(f.x>1, 1, 0)) >> filter(f.z==1) """# output: x y z 0 2 two 1 1 3 three 1 """ Works with plotnine # example grabbed from https://github.com/has2k1/plydata import numpy from datar.base import sin, pi from plotnine import ggplot, aes, geom_line, theme_classic df = tibble(x=numpy.linspace(0, 2*pi, 500)) (df >> mutate(y=sin(f.x), sign=if_else(f.y>=0, "positive", "negative")) >> ggplot(aes(x='x', y='y')) + theme_classic() + geom_line(aes(color='sign'), size=1.2)) https://preview.redd.it/w0hs4m8fyf771.png?width=697&format=png&auto=webp&s=eadd7473a9e3393c2d58531c0b2b12f849c27e5e Easy to integrate with other libraries import klib from pipda import register_verb from datar.datasets import iris from datar.dplyr import pull dist_plot = register_verb(func=klib.dist_plot) iris >> pull(f.Sepal_Length) >> dist_plot() https://preview.redd.it/w8b8ouagyf771.png?width=892&format=png&auto=webp&s=3cc8f04e63be710f593b2b6128073f65cf7ffaa4 For more detailed and advanced usage, see https://pwwang.github.io/datar/
df = tibble( x=range(4), y=['zero', 'one', 'two', 'three'] ) df >> mutate(z=f.x) """# output x y z 0 0 zero 0 1 1 one 1 2 2 two 2 3 3 three 3 """ df >> mutate(z=if_else(f.x>1, 1, 0)) """# output: x y z 0 0 zero 0 1 1 one 0 2 2 two 1 3 3 three 1 """ df >> filter(f.x>1) """# output: x y 0 2 two 1 3 three """ df >> mutate(z=if_else(f.x>1, 1, 0)) >> filter(f.z==1) """# output: x y z 0 2 two 1 1 3 three 1 """ ``` Works with plotnine ```python example grabbed from https://github.com/has2k1/plydata import numpy from datar.base import sin, pi from plotnine import ggplot, aes, geom_line, theme_classic df = tibble(x=numpy.linspace(0, 2*pi, 500)) (df >> mutate(y=sin(f.x), sign=if_else(f.y>=0, "positive", "negative")) >> ggplot(aes(x='x', y='y')) + theme_classic() + geom_line(aes(color='sign'), size=1.2)) ``` ![plotnine](https://github.com/pwwang/datar/raw/master/example.png) Easy to integrate with other libraries ```python import klib from pipda import register_verb from datar.datasets import iris from datar.dplyr import pull dist_plot = register_verb(func=klib.dist_plot) iris >> pull(f.Sepal_Length) >> dist_plot() ``` ![klib](https://github.com/pwwang/datar/raw/master/example2.png) For more detailed and advanced usage, see https://pwwang.github.io/datar/
pipda
Posts with mentions or reviews of pipda.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2021-06-25.
-
datar: the dplyr in python
I wrote a framework (https://github.com/pwwang/pipda) that fits this situation, and it makes me easier to port those APIs as they are in python. I am not only following the documentation of the original APIs but looking into the R source code of them so that I can recover most parts of them. I wouldn't say it's perfect, due to the difference between the languages, but I would say it the closest and most covered port of dplyr/tidyr and related packages in python.
What are some alternatives?
When comparing plydata and pipda you can also consider the following projects:
Pandas - Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
datar - A Grammar of Data Manipulation in python
Dask - Parallel computing with task scheduling
brain-brew - Automated Anki flashcard creation and extraction to/from Csv