Prismatic Interest Graph [API]: Organize and Recommend Content
Prismatic Interest Graph API provides a set of tools for automatically analyzing unstructured text and annotating it with a variety of tags that are useful for organizing and recommending content.
on Feb 20, 2015 in Machine Learning, Prismatic, Recommendations, Text Analytics, Text Mining
Google BigQuery Public Datasets
Google BigQuery is not only a fantastic tool to analyze data, but it also has a repository of public data, including GDELT world events database, NYC Taxi rides, GitHub archive, Reddit top posts, and more.
on Feb 20, 2015 in BigQuery, GDELT, Google, New York City, Reddit
Fun and Top! US States in 2 Words using twitteR
Combining twitteR package with text mining techniques and visualization tools can produce interesting outputs. Find out which US state is fun and top, and which is good and crazy, according to Twitter.
on Feb 19, 2015 in R, Text Mining, Twitter, USA
Automatic Statistician and the Profoundly Desired Automation for Data Science
The Automatic Statistician project by Univ. of Cambridge and MIT is pushing ahead the frontiers of automation for the selection and evaluation of machine learning models. In general, what does automation mean to Data Science?
on Feb 17, 2015 in Automation, Cambridge, Data Cleaning, Data Science, Machine Learning, MIT, Modeling, Statistician
Tamr Enterprise Platform for Scalable, End-to-End Data Unification
The new Tamr Platform radically simplifies and speeds the availability of unified data for analytics and downstream application, with key new features: catalog, connect, and consume. Tamr also announced solutions for Pharma and Procurement.
on Feb 17, 2015 in Cambridge, Data Preparation, MA, Pharma, Procurement, Tamr
Tinderbox: Automating Romance with Tinder and Eigenfaces
Tinderbox is a software uses machine learning and image recognition to automate Tinder, a popular app for single meetings. The author describes his experience and feedback until it started to work too well.
on Feb 15, 2015 in Bots, Eigenface, Image Recognition, Romance, Tinder
Ontotext: Integrated Text Mining and Triplestores, a form of graph database
Learn about 2 hot trends: RDF triplestores, a form of graph database, and the use of text mining to extract meaning from Big Data, and how Ontotext enables both. Free eval, Feb 26 webinar, and more.
on Feb 12, 2015 in Graph Databases, Ontotext, RDF, Text Mining, Triplestore
Facebook Open Sources deep-learning modules for Torch
We review Facebook recently released Torch module for Deep Learning, which helps researchers train large scale convolutional neural networks for image recognition, natural language processing and other AI applications.
on Feb 9, 2015 in Artificial Intelligence, Deep Learning, Facebook, GPU, Neural Networks, NYU, Ran Bi, Torch, Yann LeCun
How Big Data Pieces, Technology, and Animals fit together
How Big Data Pieces and animals fit together: MapReduce, HDFS, Apache Spark,, Pregel, Zookeeper, Flume, Hive, Pig, and more, explained by a Quora (and past Facebook) Data Scientist.
on Feb 5, 2015 in Apache Hive, Apache Spark, Google, Hadoop, MLlib
Comics Recommendations: “Tinder for Comics” built with Tapastic and PredictionIO
Here is how we built a cool demo of recommending comics, using PredictionIO new Similar Product Template and dataset provided by Tapastic.com.
on Feb 2, 2015 in Cartoon, PredictionIO, Recommendations, Tapastic, Tinder