2020 Jan Tutorials, Overviews
All (91) | Events (7) | News, Education (5) | Opinions (32) | Top Stories, Tweets (9) | Tutorials, Overviews (38)
- How to Optimize Your Jupyter Notebook - Jan 30, 2020.
This article walks through some simple tricks on improving your Jupyter Notebook experience, and covers useful shortcuts, adding themes, automatically generated table of contents, and more.
- Amazon Gets Into the AutoML Race with AutoGluon: Some AutoML Architectures You Should Know About - Jan 30, 2020.
Amazon, Microsoft, Salesforce, Waymo have produced some of the most innovative AutoML architectures in the market.
- Generating English Pronoun Questions Using Neural Coreference Resolution - Jan 29, 2020.
This post will introduce a practical method for generating English pronoun questions from any story or article. Learn how to take an additional step toward computationally understanding language.
- Google Dataset Search Provides Access to 25 Million Datasets - Jan 29, 2020.
Google's dataset search is out of beta, and provides centralized access to 25 million datasets.
- A bird’s-eye view of modern AI from NeurIPS 2019 - Jan 28, 2020.
With the explosion of the field of AI/ML impacting so many applications and industries, there is great value coming out of recent progress. This review highlights many research areas covered at the NeurIPS 2019 conference recently held in Vancouver, Canada, and features many important areas of progress we expect to see in the coming year.
- Exoplanet Hunting Using Machine Learning - Jan 28, 2020.
Search for exoplanets — those planets beyond our own solar system — using machine learning, and implement these searches in Python.
- 2 Questions for a Junior Data Scientist - Jan 24, 2020.
Academic credentials and experience with previous machine learning projects are important for kicking off a data science career. However, landing your first job out of school will require you to extend your thinking about projects and problems. Learn how one interviewer honed in on desired skills by considering these two questions.
- Semi-supervised learning with Generative Adversarial Networks - Jan 24, 2020.
The paper discussed in this post, Semi-supervised learning with Generative Adversarial Networks, utilizes a GAN architecture for multi-label classification.
- How to Get Started With Algorithmic Finance - Jan 23, 2020.
Algorithmic finance has been around for decades as a money-making tool, and it's not magic. Learn about some practical strategies along with and introduction to code you can use to get started.
- NLP Year in Review — 2019 - Jan 23, 2020.
In this blog post, I want to highlight some of the most important stories related to machine learning and NLP that I came across in 2019.
- The Data Science Interview Study Guide - Jan 22, 2020.
Preparing for a job interview can be a full-time job, and Data Science interviews are no different. Here are 121 resources that can help you study and quiz your way to landing your dream data science job.
- The 5 Most Useful Techniques to Handle Imbalanced Datasets - Jan 22, 2020.
This post is about explaining the various techniques you can use to handle imbalanced datasets.
- Random Forest® — A Powerful Ensemble Learning Algorithm - Jan 22, 2020.
The article explains the Random Forest algorithm and how to build and optimize a Random Forest classifier.
- Explaining Black Box Models: Ensemble and Deep Learning Using LIME and SHAP - Jan 21, 2020.
This article will demonstrate explainability on the decisions made by LightGBM and Keras models in classifying a transaction for fraudulence, using two state of the art open source explainability techniques, LIME and SHAP.
- We Created a Lazy AI - Jan 20, 2020.
This article is an overview of how to design and implement reinforcement learning for the real world.
- Microsoft Introduces Project Petridish to Find the Best Neural Network for Your Problem - Jan 20, 2020.
The new algorithm takes a novel approach to neural architecture search.
- 10 Python String Processing Tips & Tricks - Jan 20, 2020.
Pursuing a text analytics path but don't know where to start? Try this string processing primer to first gain an understanding of using Python to manipulate and process strings at a basic level.
- Handling Trees in Data Science Algorithmic Interview - Jan 16, 2020.
This post is about fast-tracking the study and explanation of tree concepts for the data scientists so that you breeze through the next time you get asked these in an interview.
- Disentangling disentanglement: Ideas from NeurIPS 2019 - Jan 15, 2020.
This year’s NEURIPS-2019 Vancouver conference recently concluded and featured a dozen papers on disentanglement in deep learning. What is this idea and why is it so interesting in machine learning? This summary of these papers will give you initial insight in disentanglement as well as ideas on what you can explore next.
- Classify A Rare Event Using 5 Machine Learning Algorithms - Jan 15, 2020.
Which algorithm works best for unbalanced data? Are there any tradeoffs?
- Geovisualization with Open Data - Jan 15, 2020.
In this post I want to show how to use public available (open) data to create geo visualizations in python. Maps are a great way to communicate and compare information when working with geolocation data. There are many frameworks to plot maps, here I focus on matplotlib and geopandas (and give a glimpse of mplleaflet).
- Survey Segmentation Tutorial - Jan 14, 2020.
Learn the basics of verifying segmentation, analyzing the data, and creating segments in this tutorial. When reviewing survey data, you will typically be handed Likert questions (e.g., on a scale of 1 to 5), and by using a few techniques, you can verify the quality of the survey and start grouping respondents into populations.
- Decision Tree Algorithm, Explained - Jan 14, 2020.
All you need to know about decision trees and how to build and optimize decision tree classifier.
- Idiot’s Guide to Precision, Recall, and Confusion Matrix - Jan 13, 2020.
Building Machine Learning models is fun, but making sure we build the best ones is what makes a difference. Follow this quick guide to appreciate how to effectively evaluate a classification model, especially for projects where accuracy alone is not enough.
- Uber Creates Generative Teaching Networks to Better Train Deep Neural Networks - Jan 13, 2020.
The new technique can really improve how deep learning models are trained at scale.
- An Introductory Guide to NLP for Data Scientists with 7 Common Techniques - Jan 9, 2020.
Data Scientists work with tons of data, and many times that data includes natural language text. This guide reviews 7 common techniques with code examples to introduce you the essentials of NLP, so you can begin performing analysis and building models from textual data.
- The Book to Start You on Machine Learning - Jan 9, 2020.
This book is thought for beginners in Machine Learning, that are looking for a practical approach to learning by building projects and studying the different Machine Learning algorithms within a specific context.
- Stock Market Forecasting Using Time Series Analysis - Jan 9, 2020.
Time series analysis will be the best tool for forecasting the trend or even future. The trend chart will provide adequate guidance for the investor. So let us understand this concept in great detail and use a machine learning technique to forecast stocks.
- Top 5 must-have Data Science skills for 2020 - Jan 8, 2020.
The standard job description for a Data Scientist has long highlighted skills in R, Python, SQL, and Machine Learning. With the field evolving, these core competencies are no longer enough to stay competitive in the job market.
- Learning SQL the Hard Way - Jan 8, 2020.
Simply put: This post is about installing SQL, explaining SQL and running SQL.
- 10 Python Tips and Tricks You Should Learn Today - Jan 8, 2020.
Check out this collection of 10 Python snippets that can be taken as a reference for your daily work.
- A Comprehensive Guide to Natural Language Generation - Jan 7, 2020.
Follow this overview of Natural Language Generation covering its applications in theory and practice. The evolution of NLG architecture is also described from simple gap-filling to dynamic document creation along with a summary of the most popular NLG models.
- Introducing Generalized Integrated Gradients (GIG): A Practical Method for Explaining Diverse Ensemble Machine Learning Models - Jan 7, 2020.
There is a need for a new way to explain complex, ensembled ML models for high-stakes applications such as credit and lending. This is why we invented GIG.
- 3 common data science career transitions, and how to make them happen - Jan 6, 2020.
Breaking into a career in Data Science can depend on where you start. See if you fit into one of these three categories of "newbies," and then find out how to make your professional transition into the field.
- H2O Framework for Machine Learning - Jan 6, 2020.
This article is an overview of H2O, a scalable and fast open-source platform for machine learning. We will apply it to perform classification tasks.
- How to Convert a Picture to Numbers - Jan 6, 2020.
Reducing images to numbers makes them amenable to computation. Let's take a look at the why and the how using Python and Numpy.
- Beginner’s Guide to K-Nearest Neighbors in R: from Zero to Hero - Jan 3, 2020.
This post presents a pipeline of building a KNN model in R with various measurement metrics.
- Predict Electricity Consumption Using Time Series Analysis - Jan 2, 2020.
Time series forecasting is a technique for the prediction of events through a sequence of time. In this post, we will be taking a small forecasting problem and try to solve it till the end learning time series forecasting alongside.