Sign in

Awesome List

Awesome Public Datasets

A topic-centric list of HQ open datasets.

awesomedata/awesome-public-datasets #aaron-swartz#awesome-public-datasets#datasets#opendata
List stars
75,789
README repos
81
Indexed repos
75
List commits
844
Forks
11,517
Open issues
153

Tracked list growth

GitHub stars and default-branch commits for awesomedata/awesome-public-datasets.

Latest scan 2026-06-03 10:49

Likes history

GitHub stars

Commits history

Default branch commits

Indexed repositories

75 repos currently saved from this list.

No filters applied
Latest repo push 2026-05-30

Filter this list

Search within Awesome Public Datasets or narrow by ecosystem and project health.

Search mode
Tune results
More filters Topics, generated tags, stack, age, archive status, and growth.
Ecosystem
Health

Uses known first-commit dates.

Momentum
Reset filters
Highlighted

Open highlighted repo slot

Put your repository first

Promote a GitHub repo at the top of Awesome repository list views for 7 days.

BetaNYC/Bike-Share-Data-Best-Practices

This repo is designed to gather bike share data best practices AND socialize a list of open and free tools to hack on bike share data. This grows from Council Member Brad Lander introducing Int. No 1117-2013 on 24 July 2013. This is a local law to amend the administrative code of the City of New York, in relation to requiring the complication of Citi Bike usage data.

pushed 2014-01-19 20 commits first commit 2013-07-25 1 list mention
Visillect/CubePlusPlus

Cube++ is a novel dataset collected for illumination estimation problem. It has 4890 raw 18-megapixel images, each containing a SpyderCube color target in their scenes, manually labelled categories, and ground truth illumination chromaticities.

Python #color-constancy#dataset#illumination-estimation#mixed-illumination pushed 2021-03-12 55 commits first commit 2020-07-21 1 list mention