- Save Sarah Connor with Data Science - Oct 25, 2021.
Data science and data privacy are deeply interwoven, and must be carefully considered by practitioners. In comparing the Safe Harbour and Expert Determination data obfuscation approaches, Safe Harbour has been very popular among data engineers but has fundamental limitations, where Expert Determination offers important advantages.
Data Analytics, Data Science, GDPR, Movies, Privacy
- Content-based Recommender Using Natural Language Processing (NLP) - Nov 26, 2019.
A guide to build a content-based movie recommender model based on NLP.
Movies, Netflix, NLP, Python, Recommender Systems
- GitHub Repo Raider and the Automation of Machine Learning - Nov 18, 2019.
Since X never, ever marks the spot, this article raids the GitHub repos in search of quality automated machine learning resources. Read on for projects and papers to help understand and implement AutoML.
Automated Machine Learning, GitHub, Machine Learning, Movies, Python
- How Data Science Is Used Within the Film Industry - Jul 5, 2019.
As Data Science is becoming pervasive across so many industries, Hollywood is certainly not being left behind. Learn about how Big Data, analytics, and AI are now core drivers of the movies we watch and how we watch them.
Data Science, Industry, Marketing, Movies, Predictive Analytics, Recommender Systems
- Building a Recommender System, Part 2 - Jul 3, 2019.
This post explores an technique for collaborative filtering which uses latent factor models, a which naturally generalizes to deep learning approaches. Our approach will be implemented using Tensorflow and Keras.
Movies, Python, Recommendation Engine, Recommender Systems
- Building a Recommender System - Apr 4, 2019.
A beginners guide to building a recommendation system, with a step-by-step guide on how to create a content-based filtering system to recommend movies for a user to watch.
Movies, Python, Recommendation Engine, Recommender Systems
- Bad Data + Good Models = Bad Results - Jan 26, 2017.
No matter how advanced is your Machine Learning algorithm, the results will be bad if the input data
is bad. We examine one popular IMDB dataset and discuss how an analyst can deal with such data.
Data Quality, Face Recognition, IMDb, Kaggle, Movies
- Which Movie Sequels Are Really Better? A Data Science Answer - Oct 19, 2015.
The internet is filled with polls and lists of sequels that are better or worse movie in the series. Yet such rankings are often based on personal judgement and rarely on data and statistics. Here is our solution to analyze and visualize the movie series.
Data Analysis, Data Visualization, IMDb, James Bond, Movies, Silk.co