Matt Mayo

How to Make Your Machine Learning Models Robust to Outliers

By Matt Mayo on August 28, 2018 in Machine Learning, Modeling, Outliers
In this blog, we’ll try to understand the different interpretations of this “distant” notion. We will also look into the outlier detection and treatment techniques while seeing their impact on different types of machine learning models.
Are Vectorized Random Number Generators Actually Useful?

By Matt Mayo on August 28, 2018 in Parallelism, Programming, Random, Randomization
I reported that you can multiply the speed of common (fast) random number generators such as PCG and xorshift128+ by a factor of three or four by vectorizing them using SIMD instructions. Is this actually useful in practice?
Multi-Class Text Classification with Scikit-Learn

By Matt Mayo on August 27, 2018 in NLP, Python, scikit-learn, Text Classification, Text Mining
The vast majority of text classification articles and tutorials on the internet are binary text classification such as email spam filtering and sentiment analysis. Real world problem are much more complicated than that.
Comparison of the Most Useful Text Processing APIs

By Matt Mayo on August 23, 2018 in NLP, Text Analytics, Text Mining
There is a need to compare different APIs to understand key pros and cons they have and when it is better to use one API instead of the other. Let us proceed with the comparison.
UX Design Guide for Data Scientists and AI Products

By Matt Mayo on August 21, 2018 in AI, Data Science, Data Scientist, UI/UX
Realizing that there is a legitimate knowledge gap between UX Designers and Data Scientists, I have decided to attempt addressing the needs from the Data Scientist’s perspective.
Basic Statistics in Python: Probability

By Matt Mayo on August 21, 2018 in Normal Distribution, Probability, Python, Statistics
At the most basic level, probability seeks to answer the question, "What is the chance of an event happening?" To calculate the chance of an event happening, we also need to consider all the other events that can occur.
Why Automated Feature Engineering Will Change the Way You Do Machine Learning

By Matt Mayo on August 20, 2018 in Automated Machine Learning, Feature Engineering, Machine Learning, Python
Automated feature engineering will save you time, build better predictive models, create meaningful features, and prevent data leakage.
Reinforcement Learning: The Business Use Case, Part 2

By Matt Mayo on August 16, 2018 in Business, Finance, Machine Learning, Reinforcement Learning, Use Cases
In this post, I will explore the implementation of reinforcement learning in trading. The Financial industry has been exploring the applications of Artificial Intelligence and Machine Learning for their use-cases, but the monetary risk has prompted reluctance.
Unveiling Mathematics Behind XGBoost

By Matt Mayo on August 14, 2018 in Gradient Boosting, Mathematics, XGBoost
Follow me till the end, and I assure you will atleast get a sense of what is happening underneath the revolutionary machine learning model.
Building Reliable Machine Learning Models with Cross-validation

By Matt Mayo on August 9, 2018 in Comet.ml, Cross-validation, Machine Learning, Modeling, scikit-learn
Cross-validation is frequently used to train, measure and finally select a machine learning model for a given dataset because it helps assess how the results of a model will generalize to an independent data set in practice.

Matt Mayo

Latest Posts

Top Posts