deequ

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets. (by awslabs)

Deequ Alternatives

Similar projects and alternatives to deequ

  1. snowflake

    523 deequ VS snowflake

    Discontinued Snowflake is a network service for generating unique ID numbers at high scale with some simple guarantees.

  2. CodeRabbit

    CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.

    CodeRabbit logo
  3. MonitorControl

    🖥 Control your display's brightness & volume on your Mac as if it was a native Apple Display. Use Apple Keyboard keys or custom shortcuts. Shows the native macOS OSDs.

  4. devenv

    102 deequ VS devenv

    Fast, Declarative, Reproducible, and Composable Developer Environments

  5. Tabby

    94 deequ VS Tabby

    A terminal for a more modern age

  6. enso

    85 deequ VS enso

    Enso Analytics is a self-service data prep and analysis platform designed for data teams.

  7. delta

    73 deequ VS delta

    An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs (by delta-io)

  8. rembg

    55 deequ VS rembg

    Rembg is a tool to remove images background

  9. Nutrient

    Nutrient – The #1 PDF SDK Library, trusted by 10K+ developers. Other PDF SDKs promise a lot - then break. Laggy scrolling, poor mobile UX, tons of bugs, and lack of support cost you endless frustrations. Nutrient’s SDK handles billion-page workloads - so you don’t have to debug PDFs. Used by ~1 billion end users in more than 150 different countries.

    Nutrient logo
  10. soda-sql

    Discontinued Data profiling, testing, and monitoring for SQL accessible data.

  11. elementary

    The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.

  12. TheHive

    25 deequ VS TheHive

    TheHive: a Scalable, Open Source and Free Security Incident Response Platform

  13. Snowplow

    21 deequ VS Snowplow

    The leader in Next-Generation Customer Data Infrastructure

  14. re_data

    15 deequ VS re_data

    re_data - fix data issues before your users & CEO would discover them 😊

  15. OpenWhisk

    17 deequ VS OpenWhisk

    Apache OpenWhisk is an open source serverless cloud platform

  16. great_expectations

    Always know what to expect from your data.

  17. kafka-manager

    13 deequ VS kafka-manager

    CMAK is a tool for managing Apache Kafka clusters

  18. circe

    12 deequ VS circe

    Yet another JSON library for Scala

  19. dbt-data-reliability

    dbt package that is part of Elementary, the dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.

  20. BigDL

    9 deequ VS BigDL

    Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, DeepSpeed, Axolotl, etc.

  21. SynapseML

    18 deequ VS SynapseML

    Simple and Distributed Machine Learning

  22. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better deequ alternative or higher similarity.

deequ discussion

Log in or Post with

deequ reviews and mentions

Posts with mentions or reviews of deequ. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-08-23.

Stats

Basic deequ repo stats
18
3,363
5.8
3 days ago

awslabs/deequ is an open source project licensed under Apache License 2.0 which is an OSI approved license.

The primary programming language of deequ is Scala.


Sponsored
CodeRabbit: AI Code Reviews for Developers
Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
coderabbit.ai

Did you know that Scala is
the 38th most popular programming language
based on number of references?