Awesome Public Datasets on GitHub
A long, categorized list of large datasets (available for public use) to try your analytics skills on. Which one would you pick?
Pages: 1 2
- Challenges in Machine Learning
- D4D Challenge of Orange
- DrivenData Competitions for Social Good
- ICWSM Data Challenge (since 2009)
- Kaggle Competition Data
- KDD Cup by Tencent 2012
- Localytics Data Visualization Challenge
- Netflix Prize
- Yelp Dataset Challenge
Economics
Energy
Finance
- CBOE Futures Exchange
- Google Finance
- Google Trends
- NASDAQ
- OANDA
- OSU Financial data
- Quandl
- St Louis Federal
- Yahoo Finance
You can also find various datasets for the following categories:
GeoSpace/GIS
Government
Healthcare
Image Processing
Machine Learning
Museums
Natural Language
Physics
Public Domains
Search Engines
Social Sciences
Sports
Time Series
Transportation
Complementary Collections
GitHub Link: https://github.com/caesar0301/awesome-public-datasets

So, which dataset would you pick today? Would you like to add anything to this list?
Let us know your thoughts in the comments below.
Related:
- KDnuggets Datasets for Data Mining and Data Science
- Interesting Social Media Datasets
- Free Urban Data – What’s It Good For?
Pages: 1 2
Top Stories Past 30 Days
|
|