Open highlighted repo slot
Put your repository first
Promote a GitHub repo at the top of Awesome repository list views for 7 days.
Awesome List
Probably the best curated list of data science software in Python.
GitHub stars and default-branch commits for krzjoa/awesome-python-data-science.
351 repos currently saved from this list.
Open highlighted repo slot
Promote a GitHub repo at the top of Awesome repository list views for 7 days.
The Python ensemble sampling toolkit for affine-invariant MCMC
Keras community contributions
Distributed Deep learning with Keras & Spark
open-source feature selection repository in python
Spearmint Bayesian optimization codebase
Simple text to phones converter for multiple languages
MLBox is a powerful Automated Machine Learning python library.
TensorFlow GNN is a library to build Graph Neural Networks on the TensorFlow platform.
PySAL: Python Spatial Analysis Library Meta-Package
Python library for time series forecasting using scikit-learn compatible models, statistical methods, and foundation models
Clean APIs for data cleaning. Python implementation of R package Janitor
OpenMMLab Foundational Library for Training Deep Learning Models
A Graph Neural Network Library in Jax
moDel Agnostic Language for Exploration and eXplanation
Supply a wrapper ``StockDataFrame`` based on the ``pandas.DataFrame`` with inline stock statistics/indicators support.
Python interface for igraph
C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.
No-code in the front, Python in the back. An open-source framework for creating data apps.
Metric learning algorithms in Python
A research toolkit for particle swarm optimization in Python
sqldf for pandas
Python implementation of CMA-ES
Transpile trained scikit-learn estimators to C, Java, JavaScript and others.
Model analysis tools for TensorFlow
Anomaly Detection and Correlation library
SMAC3: A Versatile Bayesian Optimization Package for Hyperparameter Optimization
Scalable machine 🤖 learning for time series forecasting.
:chart_with_upwards_trend: Adaptive: parallel active learning of mathematical functions
Fast NumPy array functions written in C
PySpark + Scikit-learn = Sparkit-learn
An autoML framework & toolkit for machine learning on graphs.
fastFM: A Library for Factorization Machines
Industry-strength Computer Vision workflows with Keras
Modular Reinforcement Learning (RL) library (implemented in PyTorch, JAX, and NVIDIA Warp) with support for Gymnasium/Gym, NVIDIA Isaac Lab, MuJoCo Playground and other environments
allRank is a framework for training learning-to-rank neural models based on PyTorch.
[CONTRIBUTORS WELCOME] Generalized Additive Models in Python
Pretrained model hub for Keras 3.
A scikit-learn based module for multi-label et. al. classification
Scalable Machine Learning with Dask
Factorization machines in python
Distributed machine learning platform
The Classical Language Toolkit
OpenFE: automated feature generation with expert-level performance
python partial dependence plot toolbox
Universal model exchange and serialization format for decision tree forests
Code for "High-Precision Model-Agnostic Explanations" paper
:small_red_triangle: Ternary plotting library for python with matplotlib
A PyTorch and TorchDrug based deep learning library for drug pair scoring. (KDD 2022)
Library for exploring and validating machine learning data
TensorFlow implementation of an arbitrary order Factorization Machine