Matt Mayo

Is It Too Late to Learn AI?

By Matt Mayo on March 9, 2021 in AI, Career Advice, Learning
Have you missed the train on learning AI?
Beautiful decision tree visualizations with dtreeviz

By Matt Mayo on March 8, 2021 in Algorithms, Data Visualization, Decision Trees, Python
Improve the old way of plotting the decision trees and never go back!
11 Essential Code Blocks for Complete EDA (Exploratory Data Analysis)

By Matt Mayo on March 5, 2021 in Data Analysis, Data Exploration, Data Visualization, Pandas, Python
This article is a practical guide to exploring any data science project and gain valuable insights.
Bayesian Hyperparameter Optimization with tune-sklearn in PyCaret

By Matt Mayo on March 5, 2021 in Bayesian, Hyperparameter, Machine Learning, Optimization, PyCaret, Python, scikit-learn
PyCaret, a low code Python ML library, offers several ways to tune the hyper-parameters of a created model. In this post, I'd like to show how Ray Tune is integrated with PyCaret, and how easy it is to leverage its algorithms and distributed computing to achieve results superior to default random search method.
Reducing the High Cost of Training NLP Models With SRU++

By Matt Mayo on March 4, 2021 in Deep Learning, Machine Learning, Neural Networks, NLP
The increasing computation time and costs of training natural language models (NLP) highlight the importance of inventing computationally efficient models that retain top modeling power with reduced or accelerated computation. A single experiment training a top-performing language model on the 'Billion Word' benchmark would take 384 GPU days and as much as $36,000 using AWS on-demand instances.
Getting Started with Distributed Machine Learning with PyTorch and Ray

By Matt Mayo on March 3, 2021 in Distributed Systems, Machine Learning, Python, PyTorch
Ray is a popular framework for distributed Python that can be paired with PyTorch to rapidly scale machine learning applications.
Speech to Text with Wav2Vec 2.0

By Matt Mayo on March 2, 2021 in Hugging Face, NLP, Python, PyTorch, Transformer
Facebook recently introduced and open-sourced their new framework for self-supervised learning of representations from raw audio data called Wav2Vec 2.0. Learn more about it and how to use it here.
The Ultimate Guide to Acing Coding Interviews for Data Scientists

By Matt Mayo on March 2, 2021 in Data Science, Data Scientist, Interview Questions, Programming
This article covers understanding the 4 types of coding interview questions and preparing for them effectively.
The Difficulty of Graph Anonymisation

By Matt Mayo on February 25, 2021 in Anonymized, Data Science, Graph Analytics, Graphs, Privacy, Singapore
Lessons from network science and the difficulty of graph anonymization. A data scientist's take on the difficultly of striking a balance between privacy and utility in anonymizing connected data.
How Reading Papers Helps You Be a More Effective Data Scientist

By Matt Mayo on February 24, 2021 in Career Advice, Data Science, Data Scientist, Research
By reading papers, we were able to learn what others (e.g., LinkedIn) have found to work (and not work). We can then adapt their approach and not have to reinvent the rocket. This helps us deliver a working solution with lesser time and effort.

Matt Mayo

Latest Posts

Top Posts