Sign in
Awesome

Awesome-list intelligence for GitHub

Search every repository hiding inside awesome lists.

Discover projects curated by awesome-list maintainers, then narrow them by stars, age, freshness, archive status, language, topics, generated tags, detected stacks, package managers, and source list.

Repos indexed
9,926
Awesome lists tracked
76
Current results
136
136 repos shown
Topic: data-science
Highlighted

Open highlighted repo slot

Put your repository first

Promote a GitHub repo at the top of Awesome repository list views for 7 days.

microsoft/responsible-ai-toolbox

Responsible AI Toolbox is a suite of tools providing model and data exploration and assessment user interfaces and libraries that enable a better understanding of AI systems. These interfaces and libraries empower developers and stakeholders of AI systems to develop and monitor AI more responsibly, and take better data-driven actions.

TypeScript #data-analysis#data-science#data-visualization#error-analysis#explainability 1 awesome list 1985 commits 1 history point updated 2026-04-29
mlrun/mlrun

MLRun is an open source MLOps platform for quickly building and managing continuous ML applications across their lifecycle. MLRun integrates into your development and CI/CD environment and automates the delivery of production data, ML pipelines, and online applications.

Python #data-engineering#data-science#experiment-tracking#kubernetes#machine-learning AI dev signals 1 awesome list 7689 commits 1 history point updated 2026-05-25
Bessouat40/RAGLight

RAGLight is a modular framework for Retrieval-Augmented Generation (RAG). It makes it easy to plug in different LLMs, embeddings, and vector stores, and now includes seamless MCP integration to connect external tools and data sources.

Python #agentic-ai#agentic-rag#agentic-workflow#artificial-intelligence#data-science AI dev signals 1 awesome list 512 commits first commit 2024-12-12 1 history point updated 2026-03-24
Desbordante/desbordante-core

Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.

C++ #anomaly-detection#correlations#data-analytics#data-cleaning#data-cleansing 1 awesome list 2011 commits first commit 2019-07-08 4 history points updated 2026-05-28