Sign in
← Back to search

awslabs/deequ

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

Stars
3,618
Forks
584
Commits
352
Language
Scala
Awesome lists
1

Similar repositories

MigoXLab/dingo

Dingo: A Comprehensive AI Data, Model and Application Quality Evaluation Tool

702 stars
Python 2 awesome lists

Desbordante/desbordante-core

Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.

482 stars
C++ 1 awesome list

sfu-db/dataprep

Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.

2242 stars
Python 1 awesome list

deeplearning4j/deeplearning4j

Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learn...

14230 stars
Java 2 awesome lists

sdv-dev/SDV

Synthetic data generation for tabular data

3494 stars
Python 1 awesome list

dssg/aequitas

Bias Auditing & Fair ML Toolkit

760 stars
Python 1 awesome list

Tracked growth

1 capture since 2026-05-25

Latest capture 2026-05-25 20:52

Stars history

Total stars

Commits history

Default branch commits

Metadata

  • Created: 2018-08-07
  • First commit: —
  • Last pushed: 2026-05-22
  • Archived: no
  • Stack detected: —
  • License: Apache-2.0

AI development signals

No AI development config files detected.