Sign in

Awesome List

Awesome Machine Learning

A curated list of awesome Machine Learning frameworks, libraries and software.

josephmisiti/awesome-machine-learning
List stars
72,664
README repos
861
Indexed repos
131
List commits
-
Forks
15,476
Open issues
1

Tracked list growth

GitHub stars and default-branch commits for josephmisiti/awesome-machine-learning.

Latest scan 2026-06-02 10:49

Likes history

GitHub stars

Commits history

Default branch commits

Indexed repositories

131 repos currently saved from this list.

No filters applied
Latest repo push 2026-06-02

Age filters use known first-commit dates and exclude repositories that have not synced that data yet.

Reset
Highlighted

Open highlighted repo slot

Put your repository first

Promote a GitHub repo at the top of Awesome repository list views for 7 days.

clips/pattern

Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.

Python #machine-learning#natural-language-processing#network-analysis#python pushed 2024-06-10 1,434 commits first commit 2011-02-24 2 list mentions
vaexio/vaex

Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀

Python #bigdata#data-science#dataframe#hdf5 pushed 2026-04-01 3,727 commits first commit 2014-01-27 3 list mentions
MervinPraison/PraisonAI

PraisonAI 🦞 — Hire a 24/7 AI Workforce. Stop writing boilerplate and start shipping autonomous self-improving agents that research, plan, code, and execute tasks. Deployed in 5 lines of code with built-in memory, RAG, and support for 100+ LLMs.

Python #agents#ai#ai-agent-framework#ai-agent-sdk pushed 2026-05-25 3,833 commits 6 list mentions AI dev signals
h2oai/h2o-3

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.

Jupyter Notebook #automl#big-data#data-science#deep-learning pushed 2026-05-30 32,791 commits first commit 2014-03-03 3 list mentions AI dev signals
clearml/clearml

ClearML - Auto-Magical CI/CD to streamline your AI workload. Experiment Management, Data Management, Pipeline, Orchestration, Scheduling & Serving in one MLOps/LLMOps solution

Python #ai#clearml#control#deep-learning pushed 2026-05-25 2,751 commits 2 list mentions
guofei9987/scikit-opt

Genetic Algorithm, Particle Swarm Optimization, Simulated Annealing, Ant Colony Optimization Algorithm,Immune Algorithm, Artificial Fish Swarm Algorithm, Differential Evolution and TSP(Traveling salesman)

Python #ant-colony-algorithm#artificial-intelligence#fish-swarms#genetic-algorithm pushed 2026-03-25 345 commits first commit 2017-12-06 2 list mentions
IDSIA/sacred

Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.

Python #infrastructure#machine-learning#mongodb#python pushed 2025-10-22 1,354 commits first commit 2014-03-31 2 list mentions
deepchecks/deepchecks

Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and models from research to production.

Python #data-drift#data-science#data-validation#deep-learning pushed 2025-12-28 1,504 commits 3 list mentions
MAIF/shapash

🔅 Shapash: User-friendly Explainability and Interpretability to Develop Reliable and Transparent Machine Learning Models

Jupyter Notebook #ethical-artificial-intelligence#explainability#explainable-ml#interpretability pushed 2026-05-23 1,903 commits 2 list mentions
aksnzhy/xlearn

High performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM) for Python and CLI interface.

C++ #data-analysis#data-science#factorization-machines#ffm pushed 2023-08-28 1,342 commits first commit 2017-06-10 2 list mentions
BayesWitnesses/m2cgen

Transform ML models into a native code (Java, C, Python, Go, JavaScript, Visual Basic, C#, R, PowerShell, PHP, Dart, Haskell, Ruby, F#, Rust) with zero dependencies

Python pytest pip #c#csharp#dartlang#go pushed 2024-08-03 376 commits first commit 2019-01-13 4 list mentions
deepnote/deepnote

Deepnote is a drop-in replacement for Jupyter with an AI-first design, sleek UI, new blocks, and native data integrations. Use Python, R, and SQL locally in your favorite IDE, then scale to Deepnote cloud for real-time collaboration, Deepnote agent, and deployable data apps. https://deepnote.com/

TypeScript #artificial-intelligence#data#data-analysis#data-science pushed 2026-05-21 269 commits 3 list mentions AI dev signals
AutoViML/AutoViz

Automatically Visualize any dataset, any size with a single line of code. Created by Ram Seshadri. Collaborators Welcome. Permission Granted upon Request.

Python #auto-sklearn#automated-machine-learning#automl#automl-algorithms pushed 2024-06-10 223 commits first commit 2019-07-17 2 list mentions