2019 Jan Tutorials, Overviews

All (53) | Courses, Education (1) | News (5) | Opinions (21) | Tutorials, Overviews (26)

ELMo: Contextual Language Embedding

Create a semantic search engine using deep contextualised language representations from ELMo and why context is everything in NLP.

on Jan 31, 2019 in Data Visualization, NLP, Plotly, Python, Word Embeddings
Random forests® explained intuitively

A detailed explanation of random forests, with real life use cases, a discussion into when a random forest is a poor choice relative to other algorithms, and looking at some of the advantages of using random forest.

on Jan 30, 2019 in Decision Trees, Explained, random forests algorithm
Building an image search service from scratch

By the end of this post, you should be able to build a quick semantic search model from scratch, no matter the size of your dataset.

on Jan 30, 2019 in Computer Vision, Image Recognition, NLP, Search, Search Engine, Word Embeddings
7 Steps to Mastering Basic Machine Learning with Python — 2019 Edition

With a new year upon us, I thought it would be a good time to revisit the concept and put together a new learning path for mastering machine learning with Python. With these 7 steps you can master basic machine learning with Python!

on Jan 29, 2019 in 7 Steps, Classification, Clustering, Jupyter, Machine Learning, Python, Regression
Airbnb Rental Listings Dataset Mining

An Exploratory Analysis of Airbnb’s Data to understand the rental landscape in New York City.

on Jan 28, 2019 in AirBnB, Data Exploration, Data Visualization, New York City, R, Real Estate
Machine Learning Security

We take a look at how malicious actors can break machine learning models and what some of the best practices are when it comes to stopping them.

on Jan 25, 2019 in Adversarial, Alexa, Machine Learning, Security
Data Science Project Flow for Startups

The aim of this post, then, is to present the characteristic project flow that I have identified in the working process of both my colleagues and myself in recent years. Hopefully, this can help both data scientists and the people working with them to structure data science projects in a way that reflects their uniqueness.

on Jan 24, 2019 in Data Science, Startups, Workflow
How To Fine Tune Your Machine Learning Models To Improve Forecasting Accuracy

We explain how to retrieve estimates of a model's performance using scoring metrics, before taking a look at finding and diagnosing the potential problems of a machine learning algorithm.

on Jan 23, 2019 in Cross-validation, Forecasting, Machine Learning, Overfitting, Time Series
Building AI to Build AI: The Project That Won the NeurIPS AutoML Challenge

This is an overview of designing a computer program capable of developing predictive models without any manual intervention that are trained & evaluated in a lifelong machine learning setting in NeurIPS 2018 AutoML3 Challenge.

on Jan 23, 2019 in AI, Automated Machine Learning, AutoML, Gradient Boosting, Hyperparameter, NeurIPS
Logistic Regression: A Concise Technical Overview

Logistic Regression is a Regression technique that is used when we have a categorical outcome (2 or more categories). Logistic Regression is one of the most easily interpretable classification techniques in a Data Scientist’s portfolio.

on Jan 23, 2019 in Logistic Regression, Machine Learning
2018’s Top 7 R Packages for Data Science and AI

This is a list of the best packages that changed our lives this year, compiled from my weekly digests.

on Jan 22, 2019 in AI, Data Science, R
Automated Machine Learning in Python

An organization can also reduce the cost of hiring many experts by applying AutoML in their data pipeline. AutoML also reduces the amount of time it would take to develop and test a machine learning model.

on Jan 18, 2019 in Automated Machine Learning, AutoML, H2O, Keras, Machine Learning, Python, scikit-learn
Comparing Machine Learning Models: Statistical vs. Practical Significance

Is model A or B more accurate? Hmm… In this blog post, I’d love to share my recent findings on model comparison.

on Jan 18, 2019 in Machine Learning, Model Performance, P-value, Statistical Modeling, Statistical Significance
How to build an API for a machine learning model in 5 minutes using Flask

Flask is a micro web framework written in Python. It can create a REST API that allows you to send data, and receive a prediction as a response.

By Tim Elfrink on Jan 17, 2019 in API, Flask, Machine Learning, Python
Word Embeddings & Self-Supervised Learning, Explained

There are many algorithms to learn word embeddings. Here, we consider only one of them: word2vec, and only one version of word2vec called skip-gram, which works well in practice.

on Jan 16, 2019 in Andriy Burkov, NLP, Word Embeddings, word2vec
Ontology and Data Science

In simple words, one can say that ontology is the study of what there is. But there is another part to that definition that will help us in the following sections, and that is ontology is usually also taken to encompass problems about the most general features and relations of the entities which do exist.

on Jan 16, 2019 in Data Science, Ontology
How to solve 90% of NLP problems: a step-by-step guide

Read this insightful, step-by-step article on how to use machine learning to understand and leverage text.

By Emmanuel Ameisen on Jan 14, 2019 in LIME, NLP, Text Analytics, Text Classification, Word Embeddings, word2vec
End To End Guide For Machine Learning Projects

Let’s imagine you are attempting to work on a machine learning project. This article will provide you with the step to step guide on the process that you can follow to implement a successful project.

on Jan 14, 2019 in Machine Learning, Workflow
Practical Apache Spark in 10 Minutes

Check out this series of articles on Apache Spark. Each part is a 10 minute tutorial on a particular Apache Spark topic. Read on to get up to speed using Spark.

on Jan 11, 2019 in Apache Spark
Python Patterns: max Instead of if

I often have to loop over a set of objects to find the one with the greatest score. You can use an if statement and a placeholder, but there are more elegant ways!

on Jan 10, 2019 in Programming, Python
Top 10 Books on NLP and Text Analysis

When it comes to choosing the right book, you become immediately overwhelmed with the abundance of possibilities. In this review, we have collected our Top 10 NLP and Text Analysis Books of all time, ranging from beginners to experts.

on Jan 9, 2019 in Books, NLP, Text Analysis
NLP Overview: Modern Deep Learning Techniques Applied to Natural Language Processing

Trying to keep up with advancements at the overlap of neural networks and natural language processing can be troublesome. That's where the today's spotlighted resource comes in.

on Jan 8, 2019 in Deep Learning, Neural Networks, NLP
Comparison of the Text Distance Metrics

There are many different approaches of how to compare two texts (strings of characters). Each has its own advantages and disadvantages and is good only for a range of specific use cases.

on Jan 7, 2019 in Metrics, NLP, Text Analytics
What to do when your training and testing data come from different distributions

However, sometimes only a limited amount of data from the target distribution can be collected. It may not be sufficient to build the needed train/dev/test sets. What to do in such a case? Let us discuss some ideas!

on Jan 4, 2019 in Distribution, Machine Learning, Training Data
The Backpropagation Algorithm Demystified

A crucial aspect of machine learning is its ability to recognize error margins and to interpret data more precisely as rising numbers of datasets are fed through its neural network. Commonly referred to as backpropagation, it is a process that isn’t as complex as you might think.

on Jan 2, 2019 in Backpropagation, Explained, Neural Networks
3 More Google Colab Environment Management Tips

This is a short collection of lessons learned using Colab as my main coding learning environment for the past few months. Some tricks are Colab specific, others as general Jupyter tips, and still more are filesystem related, but all have proven useful for me.

on Jan 2, 2019 in Google, Google Colab, Jupyter, Machine Learning, Python

2019 Jan Tutorials, Overviews

Latest Posts

Top Posts