Our great sponsors
-
deequ
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Cuallee offers a faster and optimized version of pydeequ, on the Check API through the use of the new Observation API in pyspark. As well as support to Snowpark, Pandas, Polars and DuckDB dataframe abstractions.
NOTE:
The number of mentions on this list indicates mentions on common posts plus user suggested alternatives.
Hence, a higher number means a more popular project.