-
Learning from Imbalanced Classes
Imbalanced classes can cause trouble for classification. Not all hope is lost, however. Check out this article for methods in which to deal with such a situation.
-
Data Mining Tip: How to Use High-cardinality Attributes in a Predictive Model
High-cardinality nominal attributes can pose an issue for inclusion in predictive models. There exist a few ways to accomplish this, however, which are put forward here.
-
MDL Clustering: Unsupervised Attribute Ranking, Discretization, and Clustering
MDL Clustering is a free software suite for unsupervised attribute ranking, discretization, and clustering based on the Minimum Description Length principle and built on the Weka Data Mining platform.
-
A Tutorial on the Expectation Maximization (EM) Algorithm
This is a short tutorial on the Expectation Maximization algorithm and how it can be used on estimating parameters for multi-variate data.
-
Introduction to Local Interpretable Model-Agnostic Explanations (LIME)
Learn about LIME, a technique to explain the predictions of any machine learning classifier.
-
A Gentle Introduction to Bloom Filter
The Bloom Filter is a probabilistic data structure which can make a tradeoff between space and false positive rate. Read more, and see an implementation from scratch, in this post.
-
Data Science of Reviews: ReviewMeta tool Automatically Detects Unnatural Reviews on Amazon
ReviewMeta is a tool that analyzes millions of reviews and helps customers decide which ones to trust. As the dataset grows, so do the insights on unbiased reviews.
-
A Neat Trick to Increase Robustness of Regression Models
Read this take on the validity of choosing a different approach to regression modeling. Why isn't L1 norm used more often?
-
How to Become a Data Scientist – Part 1
Check out this excellent (and exhaustive) article on becoming a data scientist, written by someone who spends their day recruiting data scientists. Do yourself a favor and read the whole way through. You won't regret it!
-
Misinformation Key Terms, Explained
Misinformation has emerged as a key issue for social media platforms. This post will introduce the concept of misinformation and the 8 Key Terms, which provides insights into mining misinformation in social media.
|