deequ

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets. (by awslabs)

Deequ Alternatives

Similar projects and alternatives to deequ based on common topics and language

  • GitHub repo soda-sql

    deequ VS soda-sql

    Data profiling, testing, and monitoring for SQL accessible data.

  • GitHub repo Apache Spark

    deequ VS Apache Spark

    Apache Spark - A unified analytics engine for large-scale data processing

  • Scout APM

    Scout APM: A developer's best friend. Try free for 14-days. Scout APM uses tracing logic that ties bottlenecks to source code so you know the exact line of code causing performance issues and can get back to building a great product faster.

  • GitHub repo BigDL

    deequ VS BigDL

    BigDL: Distributed Deep Learning Framework for Apache Spark

  • GitHub repo Quill

    deequ VS Quill

    Compile-time Language Integrated Queries for Scala (by getquill)

  • GitHub repo SynapseML

    deequ VS SynapseML

    Microsoft Machine Learning for Apache Spark

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better deequ alternative or higher similarity.

Suggest an alternative to deequ

Reviews and mentions

Posts with mentions or reviews of deequ. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2021-03-16.
  • PySpark - How to get Corrupted Records after Casting
    Deequ (this is the Scala version but they have PyDeequ also)
  • High level overviews of how to properly publish Spark open source libraries (Scala and PySpark)
    I am working with the Deequ maintainers and gave them some detailed suggestions on how to maintain a Scala open source lib. TL;DR:
  • Considering forking Deequ
    Deequ is a popular library to unit test big data with Spark.
  • How would you QA data before/after a migration?
    check out https://github.com/awslabs/deequ
  • Using Deequ 1.1 with Spark 3
    dev.to | 2021-02-25
    If you try to upgrade AWS Deequ to latest version (1.1.0) atm and use with Spark 3.0.1 you will get following error:

Stats

Basic deequ repo stats
5
1,923
5.4
16 days ago

awslabs/deequ is an open source project licensed under Apache License 2.0 which is an OSI approved license.

SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
Find remote jobs at our new job board 99remotejobs.com. There are 34 new remote jobs listed recently.
Are you hiring? Post a new remote job listing for free.