Sign in

Awesome List

Awesome Python Data Science

Probably the best curated list of data science software in Python.

krzjoa/awesome-python-data-science #awesome#awesome-list#awesome-python#data-analysis#data-science#data-visualization#deep-learning#machine-learning#python#scikit-learn#statistics
List stars
3,450
README repos
357
Indexed repos
351
List commits
500
Forks
447
Open issues
13

Tracked list growth

GitHub stars and default-branch commits for krzjoa/awesome-python-data-science.

Latest scan 2026-06-02 10:49

Likes history

GitHub stars

Commits history

Default branch commits

Indexed repositories

351 repos currently saved from this list.

No filters applied
Latest repo push 2026-06-02

Age filters use known first-commit dates and exclude repositories that have not synced that data yet.

Reset
Highlighted

Open highlighted repo slot

Put your repository first

Promote a GitHub repo at the top of Awesome repository list views for 7 days.

sktime/skpro

A unified framework for tabular probabilistic regression, time-to-event prediction, and probability distributions in python

Python #ai#data-science#distributional-regression#distributions pushed 2026-05-19 680 commits first commit 2017-09-13 1 list mention
jaswinder9051998/zoofs

zoofs is a python library for performing feature selection using a variety of nature-inspired wrapper algorithms. The algorithms range from swarm-intelligence to physics-based to Evolutionary. It's easy to use , flexible and powerful tool to reduce your feature size.

Python #evolutionary-algorithms#feature-selection#genetic-algorithm#grey-wolf pushed 2026-01-09 264 commits first commit 2020-07-11 1 list mention
janpipek/physt

Python histogram library - histograms as updateable, fully semantic objects with visualization tools. [P]ython [HYST]ograms.

Python #2d-histograms#heatmap#histogram#plotting pushed 2026-03-19 1,154 commits first commit 2016-03-25 1 list mention
NITRO-AI/NitroFE

NitroFE is a Python feature engineering engine which provides a variety of modules designed to internally save past dependent values for providing continuous calculation.

Python #feature#feature-engineering#features#indicator pushed 2022-05-04 128 commits first commit 2021-08-26 1 list mention
liquidSVM/liquidSVM

Support vector machines (SVMs) and related kernel-based learning algorithms are a well-known class of machine learning algorithms, for non-parametric classification and regression. liquidSVM is an implementation of SVMs whose key features are: fully integrated hyper-parameter selection, extreme speed on both small and large data sets, full flexibility for experts, and inclusion of a variety of different learning scenarios: multi-class classification, ROC, and Neyman-Pearson learning, and least-squares, quantile, and expectile regression.

C++ #apache-spark#c-plus-plus#classification#expectile-regression pushed 2020-02-20 48 commits first commit 2017-04-22 1 list mention