Matt Mayo

Budgeting For Your AI Training Data: Consider These 3 Factors

By Matt Mayo on May 26, 2021 in AI, Data Preparation, Training Data
Before you even plan to procure the data, one of the most important considerations in determining how much you should spend on your AI training data. In this article, we will give you insights to develop an effective budget for AI training data.
Topic Modeling with Streamlit

By Matt Mayo on May 26, 2021 in Deployment, NLP, Python, spaCy, Streamlit, Text Analytics, Topic Modeling
What does it take to create and deploy a topic modeling web application quickly? Read this post to see how the author uses Python NLP packages for topic modeling, Streamlit for the web application framework, and Streamlit Sharing for deployment.
The Rise of Vector Data

By Matt Mayo on May 25, 2021 in Distributed Representation, Pinecone, Representation
Embedding models convert raw data such as text, images, audio, logs, and videos into vector embeddings (“vectors”) to be used for predictions, comparisons, and other cognitive-like functions.
Write and train your own custom machine learning models using PyCaret

By Matt Mayo on May 25, 2021 in Machine Learning, Modeling, PyCaret, Python, Training
A step-by-step, beginner-friendly tutorial on how to write and train custom machine learning models in PyCaret.
Data Validation in Machine Learning is Imperative, Not Optional

By Matt Mayo on May 24, 2021 in Data Quality, Machine Learning, Production, Validation
Before we reach model training in the pipeline, there are various components like data ingestion, data versioning, data validation, and data pre-processing that need to be executed. In this article, we will discuss data validation, why it is important, its challenges, and more.
Building RESTful APIs using Flask

By Matt Mayo on May 21, 2021 in API, Flask, Python, RESTful API
Learn about using the lightweight web framework in Python from this article.
DataOps: 5 things that you need to know

By Matt Mayo on May 20, 2021 in Data Engineer, Data Engineering, DataOps
DataOps (Data Operations) has assumed a critical role in the age of big data to drive definitive impact on business outcomes. This process-oriented and agile methodology synergizes the components of DevOps and the capabilities of data engineers and data scientists to support data-focused workloads in enterprises. Here is a detailed look at DataOps.
How to Determine if Your Machine Learning Model is Overtrained

By Matt Mayo on May 20, 2021 in Learning, Modeling, Python, Training
WeightWatcher is based on theoretical research (done injoint with UC Berkeley) into Why Deep Learning Works, based on our Theory of Heavy Tailed Self-Regularization (HT-SR). It uses ideas from Random Matrix Theory (RMT), Statistical Mechanics, and Strongly Correlated Systems.
Animated Bar Chart Races in Python

By Matt Mayo on May 18, 2021 in COVID-19, Data Science, Data Visualization, Pandas, Python, Visualization
A quick and step-by-step beginners project to create an animation bar graph for an amazing Covid dataset.
Easy MLOps with PyCaret + MLflow

By Matt Mayo on May 18, 2021 in Machine Learning, MLflow, MLOps, PyCaret, Python
A beginner-friendly, step-by-step tutorial on integrating MLOps in your Machine Learning experiments using PyCaret.

Matt Mayo

Latest Posts

Top Posts