Sign in
Awesome

Awesome-list intelligence for GitHub

Search every repository hiding inside awesome lists.

Discover projects curated by awesome-list maintainers, then narrow them by stars, age, freshness, archive status, language, topics, generated tags, detected stacks, package managers, and source list.

Repos indexed
9,926
Awesome lists tracked
76
Current results
31
31 repos shown
Topic: data-engineering
Highlighted

Open highlighted repo slot

Put your repository first

Promote a GitHub repo at the top of Awesome repository list views for 7 days.

mlrun/mlrun

MLRun is an open source MLOps platform for quickly building and managing continuous ML applications across their lifecycle. MLRun integrates into your development and CI/CD environment and automates the delivery of production data, ML pipelines, and online applications.

Python #data-engineering#data-science#experiment-tracking#kubernetes#machine-learning AI dev signals 1 awesome list 7689 commits 1 history point updated 2026-05-25
Desbordante/desbordante-core

Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.

C++ #anomaly-detection#correlations#data-analytics#data-cleaning#data-cleansing 1 awesome list 2011 commits first commit 2019-07-08 4 history points updated 2026-05-28