piperider
Code review for data in dbt (by InfuseAI)
pandas-profiling
Create HTML profiling reports from pandas DataFrame objects [Moved to: https://github.com/ydataai/pandas-profiling] (by pandas-profiling)
piperider | pandas-profiling | |
---|---|---|
6 | 1 | |
469 | 8,962 | |
0.2% | - | |
9.5 | 8.5 | |
about 2 months ago | almost 2 years ago | |
Python | Python | |
Apache License 2.0 | MIT License |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
piperider
Posts with mentions or reviews of piperider.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2023-09-06.
- Show HN: PipeRider – open-source Data Impact Analysis for dbt changes
-
Open source data observability tools with UI?
If you post a GitHub issue to request these connectors is might help persuade the product team to add these sooner than later.
-
Data profiling as part of a data reliability strategy?
PS. I'm a bit biased -> I'm working for PipeRider; we're building an open-source data reliability toolkit with profiling at the core: https://github.com/InfuseAI/piperider
-
Show HN: PipeRider, data reliability automated tool
I was rush to Show HN, and now I want to tell a bit more.
PipeRider, it’s our take on a data reliability and quality tool for data pipelines. It’s based on data profiling and assertions that test against the data profile.
It’s open-source and ready to use on Github here: https://github.com/infuseai/piperider
Here is a quick start to get you up and running easily:
pandas-profiling
Posts with mentions or reviews of pandas-profiling.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2022-05-20.
-
Mito – Excel-like interface for Pandas dataframes in Jupyter notebook
For those who are going through the thread finding new tools: pandas-profiling[0] is a library for automatic EDA (part of what bamboolib[1] does).
[0]: https://github.com/pandas-profiling/pandas-profiling
What are some alternatives?
When comparing piperider and pandas-profiling you can also consider the following projects:
soda-sql - Data profiling, testing, and monitoring for SQL accessible data.
lux - Automatically visualize your pandas dataframe via a single print! 📊 💡