Matthew Mayo

KDnuggets Managing Editor

Matthew Mayo (@mattmayo13) holds a master's degree in computer science and a graduate diploma in data mining. As managing editor of KDnuggets & Statology, and contributing editor at Machine Learning Mastery, Matthew aims to make complex data science concepts accessible. His professional interests include natural language processing, language models, machine learning algorithms, and exploring emerging AI. He is driven by a mission to democratize knowledge in the data science community. Matthew has been coding since he was 6 years old.

Managing Machine Learning Workflows with Scikit-learn Pipelines Part 3: Multiple Models, Pipelines, and Grid Searches

By Matthew Mayo, KDnuggets Managing Editor on January 24, 2018 in Data Preprocessing, Hyperparameter, Optimization, Pipeline, Python, scikit-learn, Workflow
In this post, we will be using grid search to optimize models built from a number of different types estimators, which we will then compare and properly evaluate the best hyperparameters that each model has to offer.
Managing Machine Learning Workflows with Scikit-learn Pipelines Part 2: Integrating Grid Search

By Matthew Mayo, KDnuggets Managing Editor on January 19, 2018 in Data Preprocessing, Hyperparameter, Optimization, Pipeline, Python, scikit-learn, Workflow
Another simple yet powerful technique we can pair with pipelines to improve performance is grid search, which attempts to optimize model hyperparameter combinations.
NIPS 2017 Key Points & Summary Notes

By Matthew Mayo, KDnuggets Managing Editor on December 18, 2017 in Bias, Conference, Machine Learning, NeurIPS, NIPS, Reinforcement Learning
Third year Ph.D student David Abel, of Brown University, was in attendance at NIP 2017, and he labouriously compiled and formatted a fantastic 43-page set of notes for the rest of us. Get them here.
Machine Learning & Artificial Intelligence: Main Developments in 2017 and Key Trends in 2018

By Matthew Mayo, KDnuggets Managing Editor on December 15, 2017 in 2018 Predictions, AI, Ajit Jaokar, Brandon Rohrer, Daniel Tunkelang, Hugo Larochelle, Machine Learning, Pedro Domingos, Sebastian Raschka, Xavier Amatriain
As we bid farewell to one year and look to ring in another, KDnuggets has solicited opinions from numerous Machine Learning and AI experts as to the most important developments of 2017 and their 2018 key trend predictions.
Managing Machine Learning Workflows with Scikit-learn Pipelines Part 1: A Gentle Introduction

By Matthew Mayo, KDnuggets Managing Editor on December 7, 2017 in Data Preprocessing, Pipeline, Python, scikit-learn, Workflow
Scikit-learn's Pipeline class is designed as a manageable way to apply a series of data transformations followed by the application of an estimator.
Big Data: Main Developments in 2017 and Key Trends in 2018

By Matthew Mayo, KDnuggets Managing Editor on December 5, 2017 in 2018 Predictions, Big Data, Bill Inmon, Bill Schmarzo, Doug Laney, James Kobielus, Matei Zaharia, Meta Brown, Predictions, Ronald van Loon, Trends, Yves Mulkers
As we bid farewell to one year and look to ring in another, KDnuggets has solicited opinions from numerous Big Data experts as to the most important developments of 2017 and their 2018 key trend predictions.
Using Deep Learning to Solve Real World Problems

By Matthew Mayo, KDnuggets Managing Editor on December 4, 2017 in Advice, Career, Deep Learning, Neural Networks
Do you assume that deep learning is only being used for toy problems and in self-learning scenarios? This post includes several firsthand accounts of organizations using deep neural networks to solve real world problems.
A General Approach to Preprocessing Text Data

By Matthew Mayo, KDnuggets Managing Editor on December 1, 2017 in Data Preparation, Data Preprocessing, NLP, Text Analytics, Text Mining, Tokenization
Recently we had a look at a framework for textual data science tasks in their totality. Now we focus on putting together a generalized approach to attacking text data preprocessing, regardless of the specific textual data science task you have in mind.
Building a Wikipedia Text Corpus for Natural Language Processing

By Matthew Mayo, KDnuggets Managing Editor on November 23, 2017 in Datasets, Natural Language Processing, NLP, Text Mining, Wikidata, Wikipedia
Wikipedia is a rich source of well-organized textual data, and a vast collection of knowledge. What we will do here is build a corpus from the set of English Wikipedia articles, which is freely and conveniently available online.
A Framework for Approaching Textual Data Science Tasks

By Matthew Mayo, KDnuggets Managing Editor on November 22, 2017 in Modeling, Natural Language Processing, NLP, Text Analytics, Text Mining
Although NLP and text mining are not the same thing, they are closely related, deal with the same raw data type, and have some crossover in their uses. Let's discuss the steps in approaching these types of tasks.

Matthew Mayo

Latest Posts

Top Posts