Open highlighted repo slot
Put your repository first
Promote a GitHub repo at the top of Awesome repository list views for 7 days.
GitHub projects from awesome lists
Search names, descriptions, topics, tags, and stacks, then tune results by ecosystem, freshness, health, and cross-list signal.
Open highlighted repo slot
Promote a GitHub repo at the top of Awesome repository list views for 7 days.
Novel Coronavirus (COVID-19) Cases, provided by JHU CSSE
A repository of data on coronavirus cases and deaths in the U.S.
World countries in JSON, YAML, CSV and XML. Any help is welcome!
:globe_with_meridians: List of all countries with names and ISO 3166-1 codes in all languages and data formats.
List of Dirty, Naughty, Obscene, and Otherwise Bad Words
FMA: A Dataset For Music Analysis
Bruteforce database
ATP Tennis Rankings, Results, and Stats
🗺 High Quality GeoJSON maps programmatically generated.
World’s single largest Internet domains dataset
Twitter NLP Tools
Uber trip data from a freedom of information request to NYC's Taxi & Limousine Commission
The Public Utility Data Liberation Project provides analysis-ready energy system data to climate advocates, researchers, policymakers, and journalists.
Tate Collection metadata
Collection of open data resources for traffic information
MOVED - The project is still under development but this page is deprecated.
⚽️ Extract, prepare and publish Transfermarkt datasets.
Core meta for awesome-public-datasets. Contribute new data here!
source{d} datasets ("big code") for source code analysis and machine learning on source code
Data for Automatic Keyphrase Extraction Task
WTA Tennis Rankings, Results, and Stats
Large medical text dataset curated for abbreviation disambiguation, designed for natural language understanding pre-training in the medical domain
Ultra-deep search for novel viruses
This repo is designed to gather bike share data best practices AND socialize a list of open and free tools to hack on bike share data. This grows from Council Member Brad Lander introducing Int. No 1117-2013 on 24 July 2013. This is a local law to amend the administrative code of the City of New York, in relation to requiring the complication of Citi Bike usage data.
Collection Data for Cooper Hewitt, Smithsonian Design Museum
All-Age-Faces (AAF) Database.
No description.
This data set includes Landsat 8 images and their manually extracted pixel-level ground truths for cloud detection.
The Turing Change Point Dataset - A collection of time series for the evaluation and development of change point detection algorithms
Global Biotic Interactions provides access to existing species interaction datasets