Similar projects and alternatives to deequ based on common topics and language
Scout APM: A developer's best friend. Try free for 14-days. Scout APM uses tracing logic that ties bottlenecks to source code so you know the exact line of code causing performance issues and can get back to building a great product faster.
Reviews and mentions
PySpark - How to get Corrupted Records after Casting
reddit.com/r/dataengineering | 2021-09-28
Deequ (this is the Scala version but they have PyDeequ also)
High level overviews of how to properly publish Spark open source libraries (Scala and PySpark)
reddit.com/r/apachespark | 2021-04-15
I am working with the Deequ maintainers and gave them some detailed suggestions on how to maintain a Scala open source lib. TL;DR:
Considering forking Deequ
reddit.com/r/apachespark | 2021-03-21
Deequ is a popular library to unit test big data with Spark.
How would you QA data before/after a migration?
reddit.com/r/dataengineering | 2021-03-16
check out https://github.com/awslabs/deequ
Using Deequ 1.1 with Spark 3
dev.to | 2021-02-25
If you try to upgrade AWS Deequ to latest version (1.1.0) atm and use with Spark 3.0.1 you will get following error:
awslabs/deequ is an open source project licensed under Apache License 2.0 which is an OSI approved license.
Are you hiring? Post a new remote job listing for free.