- Bad Data + Good Models = Bad Results - Jan 26, 2017.
No matter how advanced is your Machine Learning algorithm, the results will be bad if the input data
is bad. We examine one popular IMDB dataset and discuss how an analyst can deal with such data.
- Data Science of Visiting Famous Movie Locations in San Francisco - Jul 30, 2016.
Using the Google Places API and IMDb API, we selected movie locations in The Golden City which every movie fan should visit while they are in town, and optimize sightseeing by solving the travelling salesman problem.
- Which Movie Sequels Are Really Better? A Data Science Answer - Oct 19, 2015.
The internet is filled with polls and lists of sequels that are better or worse movie in the series. Yet such rankings are often based on personal judgement and rarely on data and statistics. Here is our solution to analyze and visualize the movie series.
- In Machine Learning, What is Better: More Data or better Algorithms - Jun 17, 2015.
Gross over-generalization of “more data gives better results” is misguiding. Here we explain, in which scenario more data or more features are helpful and which are not. Also, how the choice of the algorithm affects the end result.