cuallee
Possibly the fastest DataFrame-agnostic quality check library in town. (by canimus)
polars-xdt
Polars plugin offering eXtra stuff for DateTimes (by pola-rs)
cuallee | polars-xdt | |
---|---|---|
5 | 1 | |
107 | 152 | |
- | 10.5% | |
9.0 | 9.7 | |
6 days ago | about 1 month ago | |
Python | Python | |
Apache License 2.0 | MIT License |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
cuallee
Posts with mentions or reviews of cuallee.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2022-11-30.
- Show HN: Snowflake Data Quality Checks in Python
-
data-diff VS cuallee - a user suggested alternative
2 projects | 30 Nov 2022
Declarative data quality rules at scale
-
deequ VS cuallee - a user suggested alternative
2 projects | 30 Nov 2022
Cuallee offers a faster and optimized version of pydeequ, on the Check API through the use of the new Observation API in pyspark. As well as support to Snowpark, Pandas, Polars and DuckDB dataframe abstractions.
- Show HN: Pyspark and Snowpark and Pandas data quality
- Show HN: Cuallee – pyspark data quality framework for v3.3.0
polars-xdt
Posts with mentions or reviews of polars-xdt.
We have used some of these posts to build our list of alternatives
and similar projects.
What are some alternatives?
When comparing cuallee and polars-xdt you can also consider the following projects:
data-diff - Compare tables within or across databases
functime - Time-series machine learning at scale. Built with Polars for embarrassingly parallel feature extraction and forecasts on panel data.
soda-core - :zap: Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
awesome-polars - A curated list of Polars talks, tools, examples & articles. Contributions welcome !
great_expectations - Always know what to expect from your data.
ibis - the portable Python dataframe library
fastexcel - A Python wrapper around calamine
deequ - Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
qsv - CSVs sliced, diced & analyzed.