Awesome Public Datasets on GitHub

A long, categorized list of large datasets (available for public use) to try your analytics skills on. Which one would you pick?



Data Challenges


 
Economics


 
Energy


 
Finance


 
You can also find various datasets for the following categories:

GeoSpace/GIS

Government

Healthcare

Image Processing

Machine Learning

Museums

Natural Language

Physics

Public Domains

Search Engines

Social Sciences

Sports

Time Series

Transportation

Complementary Collections

GitHub Link: https://github.com/caesar0301/awesome-public-datasets

xia-mingXia Ming is a Ph.D. candidate at Shanghai Jiao Tong Univ. He received B.S. in Optical Information and Science Technology in 2010 at Xidian University, Xi'an, China. His research area is the measurement and analysis of mobile network traffic, especially on the renewed models and characteristics of networks traffic, employing statistical and machine learning techniques on distributed processing platforms such as Apache Spark.

So, which dataset would you pick today? Would you like to add anything to this list?

Let us know your thoughts in the comments below.

Related: