datar
sspipe
datar | sspipe | |
---|---|---|
4 | 1 | |
255 | 145 | |
- | 0.0% | |
7.4 | 0.0 | |
about 2 months ago | almost 2 years ago | |
Python | Python | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
datar
- Difficulty transitioning between R and Python?
- What would it take to recreate dplyr in Python?
-
datar: the dplyr in python
datar does not only mimic the piping syntax, but follows the API design from dplyr as much as possible, and is tested with its test cases.
df = tibble( x=range(4), y=['zero', 'one', 'two', 'three'] ) df >> mutate(z=f.x) """# output x y z 0 0 zero 0 1 1 one 1 2 2 two 2 3 3 three 3 """ df >> mutate(z=if_else(f.x>1, 1, 0)) """# output: x y z 0 0 zero 0 1 1 one 0 2 2 two 1 3 3 three 1 """ df >> filter(f.x>1) """# output: x y 0 2 two 1 3 three """ df >> mutate(z=if_else(f.x>1, 1, 0)) >> filter(f.z==1) """# output: x y z 0 2 two 1 1 3 three 1 """ ``` Works with plotnine ```python example grabbed from https://github.com/has2k1/plydata import numpy from datar.base import sin, pi from plotnine import ggplot, aes, geom_line, theme_classic df = tibble(x=numpy.linspace(0, 2*pi, 500)) (df >> mutate(y=sin(f.x), sign=if_else(f.y>=0, "positive", "negative")) >> ggplot(aes(x='x', y='y')) + theme_classic() + geom_line(aes(color='sign'), size=1.2)) ``` ![plotnine](https://github.com/pwwang/datar/raw/master/example.png) Easy to integrate with other libraries ```python import klib from pipda import register_verb from datar.datasets import iris from datar.dplyr import pull dist_plot = register_verb(func=klib.dist_plot) iris >> pull(f.Sepal_Length) >> dist_plot() ``` ![klib](https://github.com/pwwang/datar/raw/master/example2.png) For more detailed and advanced usage, see https://pwwang.github.io/datar/
sspipe
What are some alternatives?
pipda - A framework for data piping in python
siuba - Python library for using dplyr like syntax with pandas and SQL
plydata - A grammar for data manipulation in Python
tsflex - Flexible time series feature extraction & processing
kindleServer - This project serve HTML files (and a few more) saved in your computer with a UI suitable for Kindle web browser. On top of that, it include a Read Mode (thanks to ReadabiliPy) to display the text in a comfortable size without have to use the 'Article Mode' in Kindle web browser.
concrete-numpy - Concrete-Numpy: A library to turn programs into their homomorphic equivalent.
seaborn - Statistical data visualization in Python
data-science-ipython-notebooks - Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Dask - Parallel computing with task scheduling
textstat - :memo: python package to calculate readability statistics of a text object - paragraphs, sentences, articles.
orange - 🍊 :bar_chart: :bulb: Orange: Interactive data analysis
mars - Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.