pyjanitor
100-pandas-puzzles
pyjanitor | 100-pandas-puzzles | |
---|---|---|
4 | 6 | |
1,287 | 2,209 | |
1.6% | - | |
8.3 | 0.0 | |
2 days ago | 6 days ago | |
Python | Jupyter Notebook | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
pyjanitor
- Sub library with useful code
-
This Week In Python
pyjanitor – Clean APIs for data cleaning. Python implementation of R package Janitor
- Cleaning up panda dataframe calls
- how important are learning the data manipulation libraries?
100-pandas-puzzles
-
What are the best Python libraries to learn for beginners?
#1: Welcome to df[pandas]! #2: 100 data puzzles for pandas, ranging from short and simple to super tricky | 3 comments #3: Happy Halloween, Pandas! 🎃🤓 | 0 comments
- 100 data puzzles for pandas, ranging from short and simple to super tricky
-
pandas practice resources?
I remember someone sharing this with me earlier: https://github.com/ajcr/100-pandas-puzzles Let me know if you think it's comprehensive and a good resource.
-
how important are learning the data manipulation libraries?
If you want to get better with pandas specifically you could work through the 100 pandas puzzles repo in your spare time, https://github.com/ajcr/100-pandas-puzzles
- Can anyone recommend resources to prepare for Pandas and Numpy interview questions?
- Is there anything AoC-like for Machine Learning or Data Science?
What are some alternatives?
modin - Modin: Scale your Pandas workflows by changing a single line of code
numpy-100 - 100 numpy exercises (with solutions)
pandas-datareader - Extract data from a wide range of Internet sources into a pandas DataFrame.
tempo - API for manipulating time series on top of Apache Spark: lagged time values, rolling statistics (mean, avg, sum, count, etc), AS OF joins, downsampling, and interpolation
pdpipe - Easy pipelines for pandas DataFrames.
pandas_exercises - Practice your pandas skills!
Dask - Parallel computing with task scheduling
idx2numpy_array - Convert data in IDX format in MNIST Dataset to Numpy Array using Python
QuickSQLConnector - SQL in one line
RasgoQL - Write python locally, execute SQL in your data warehouse
cookiecutter-python-library - A Cookiecutter Template for Modern Python Libraries
tempo - Grafana Tempo is a high volume, minimal dependency distributed tracing backend.