Google BigQuery Public Datasets
Google BigQuery is not only a fantastic tool to analyze data, but it also has a repository of public data, including GDELT world events database, NYC Taxi rides, GitHub archive, Reddit top posts, and more.
on Feb 20, 2015 in BigQuery, GDELT, Google, New York City, Reddit
Fun and Top! US States in 2 Words using twitteR
Combining twitteR package with text mining techniques and visualization tools can produce interesting outputs. Find out which US state is fun and top, and which is good and crazy, according to Twitter.
on Feb 19, 2015 in R, Text Mining, Twitter, USA
Automatic Statistician and the Profoundly Desired Automation for Data Science
The Automatic Statistician project by Univ. of Cambridge and MIT is pushing ahead the frontiers of automation for the selection and evaluation of machine learning models. In general, what does automation mean to Data Science?
on Feb 17, 2015 in Automation, Cambridge, Data Cleaning, Data Science, Machine Learning, MIT, Modeling, Statistician
Tinderbox: Automating Romance with Tinder and Eigenfaces
Tinderbox is a software uses machine learning and image recognition to automate Tinder, a popular app for single meetings. The author describes his experience and feedback until it started to work too well.
on Feb 15, 2015 in Bots, Eigenface, Image Recognition, Romance, Tinder
Facebook Open Sources deep-learning modules for Torch
We review Facebook recently released Torch module for Deep Learning, which helps researchers train large scale convolutional neural networks for image recognition, natural language processing and other AI applications.
on Feb 9, 2015 in Artificial Intelligence, Deep Learning, Facebook, GPU, Neural Networks, NYU, Ran Bi, Torch, Yann LeCun
|