2017 Nov Tutorials, Overviews

Evolutionary Algorithms for Feature Selection

Feature selection is a very important technique in machine learning. In this post we discuss one of the most common optimization algorithms for multi-modal fitness landscapes - evolutionary algorithms.

on Nov 29, 2017 in Evolutionary Algorithm, Feature Selection, RapidMiner
Why You Should Forget ‘for-loop’ for Data Science Code and Embrace Vectorization

Data science needs fast computation and transformation of data. NumPy objects in Python provides that advantage over regular programming constructs like for-loop. How to demonstrate it in few easy lines of code?

on Nov 29, 2017 in numpy, Python, Scientific Computing
Natural Language Processing Library for Apache Spark – free to use

Introducing the Natural Language Processing Library for Apache Spark - and yes, you can actually use it for free! This post will give you a great overview of John Snow Labs NLP Library for Apache Spark.

on Nov 28, 2017 in Apache Spark, API, GitHub, John Snow Labs, Machine Learning, NLP
How To Unit Test Machine Learning Code

One of the main principles I learned during my time at Google Brain was that unit tests can make or break your algorithm and can save you weeks of debugging and training time.

on Nov 28, 2017 in Machine Learning, Neural Networks, Python, Software Engineering, TensorFlow
How (and Why) to Create a Good Validation Set

The definitions of training, validation, and test sets can be fairly nuanced, and the terms are sometimes inconsistently used. In the deep learning community, “test-time inference” is often used to refer to evaluating on data in production, which is not the technical definition of a test set.

on Nov 24, 2017 in Cross-validation, Datasets, Rachel Thomas, Training Data, Validation
Understanding Objective Functions in Neural Networks

This blog post is targeted towards people who have experience with machine learning, and want to get a better intuition on the different objective functions used to train neural networks.

on Nov 23, 2017 in Cost Function, Deep Learning, Gradient Descent, Neural Networks, Optimization
Building a Wikipedia Text Corpus for Natural Language Processing

Wikipedia is a rich source of well-organized textual data, and a vast collection of knowledge. What we will do here is build a corpus from the set of English Wikipedia articles, which is freely and conveniently available online.

on Nov 23, 2017 in Datasets, Natural Language Processing, NLP, Text Mining, Wikidata, Wikipedia
A Framework for Approaching Textual Data Science Tasks

Although NLP and text mining are not the same thing, they are closely related, deal with the same raw data type, and have some crossover in their uses. Let's discuss the steps in approaching these types of tasks.

on Nov 22, 2017 in Modeling, Natural Language Processing, NLP, Text Analytics, Text Mining
Best Masters in Data Science and Analytics in US/Canada

Second comprehensive list of master's degrees in the US and Canada with tuition information and duration.

on Nov 21, 2017 in Canada, Master of Science, MS in Analytics, MS in Business Analytics, MS in Data Science, USA
Estimating an Optimal Learning Rate For a Deep Neural Network

This post describes a simple and powerful way to find a reasonable learning rate for your neural network.

on Nov 21, 2017 in Deep Learning, Hyperparameter, Neural Networks
Automated Feature Engineering for Time Series Data

We introduce a general framework for developing time series models, generating features and preprocessing the data, and exploring the potential to automate this process in order to apply advanced machine learning algorithms to almost any time series problem.

on Nov 20, 2017 in Automated Machine Learning, Data Preparation, Feature Engineering, Feature Selection, Time Series
Top 10 Videos on Deep Learning in Python

Playlists, individual tutorials (not part of a playlist) and online courses on Deep Learning (DL) in Python using the Keras, Theano, TensorFlow and PyTorch libraries. Assumes no prior knowledge. These videos cover all skill levels and time constraints!

on Nov 17, 2017 in Deep Learning, Keras, Python, PyTorch, TensorFlow, Theano, Top 10, Tutorials, Videolectures, Youtube
8 Ways to Improve Your Data Science Skills in 2 Years

Two years. Two years is the maximum amount of time you should spend focused on your learning, education and training. That’s exactly why this guide is focused on honing the most beneficial skills in two years.

on Nov 17, 2017 in Data Science, Data Science Skills, Skills, Training
PySpark SQL Cheat Sheet: Big Data in Python

PySpark is a Spark Python API that exposes the Spark programming model to Python - With it, you can speed up analytic applications. With Spark, you can get started with big data processing, as it has built-in modules for streaming, SQL, machine learning and graph processing.

on Nov 16, 2017 in Apache Spark, Big Data, DataCamp, Python, SQL
The 10 Statistical Techniques Data Scientists Need to Master

The author presents 10 statistical techniques which a data scientist needs to master. Build up your toolbox of data science tools by having a look at this great overview post.

on Nov 15, 2017 in Algorithms, Data Science, Data Scientist, Machine Learning, Statistical Learning, Statistics
Best Online Masters in Data Science and Analytics – a comprehensive, unbiased survey

The first comprehensive and objective survey of online Masters in Analytics / Data Science, including rankings, tuition, and duration of the education program.

on Nov 14, 2017 in Master of Science, MS in Analytics, MS in Business Analytics, MS in Data Science, Online Education
Extracting Tweets With R

This article will give you a great, brief overview for extracting Tweets using R.

on Nov 14, 2017 in R, Twitter
Machine Learning Algorithms: Which One to Choose for Your Problem

This article will try to explain basic concepts and give some intuition of using different kinds of machine learning algorithms in different tasks. At the end of the article, you’ll find the structured overview of the main features of described algorithms.

on Nov 14, 2017 in Algorithms, Machine Learning, Reinforcement Learning, Statsbot, Supervised Learning, Unsupervised Learning
The Qualitative Side of Quantitative Research

Kevin and Koen may buy the same brand for the same reasons. On the other hand, they may buy the same brand for different reasons, or buy different brands for the same reasons, or even different brands for different reasons. The brands they purchase and the reasons why may vary by occasion, too.

on Nov 9, 2017 in Qualitative Analytics, Qualitative Research, Quantitative Analytics, Research
TensorFlow: What Parameters to Optimize?

Learning TensorFlow Core API, which is the lowest level API in TensorFlow, is a very good step for starting learning TensorFlow because it let you understand the kernel of the library. Here is a very simple example of TensorFlow Core API in which we create and train a linear regression model.

on Nov 9, 2017 in Neural Networks, Optimization, Python, TensorFlow
Tips for Getting Started with Text Mining in R and Python

This article opens up the world of text mining in a simple and intuitive way and provides great tips to get started with text mining.

on Nov 8, 2017 in Python, R, Text Mining
Interpreting Machine Learning Models: An Overview

This post summarizes the contents of a recent O'Reilly article outlining a number of methods for interpreting machine learning models, beyond the usual go-to measures.

on Nov 7, 2017 in Interpretability, Machine Learning, Modeling, O'Reilly
Blockchain Key Terms, Explained

Need a quick glance over some important definitions associated with the Blockchain? Then consider this article your Blockchain Definitions 101!

on Nov 3, 2017 in Bitcoin, Blockchain, Cryptocurrency, Explained, Hashing, Key Terms
Want to know how Deep Learning works? Here’s a quick guide for everyone

Once you’ve read this article, you will understand the basics of AI and ML. More importantly, you will understand how Deep Learning, the most popular type of ML, works.

on Nov 3, 2017 in Deep Learning, Neural Networks
Process Mining with R: Introduction

In the past years, several niche tools have appeared to mine organizational business processes. In this article, we’ll show you that it is possible to get started with “process mining” using well-known data science programming languages as well.

on Nov 2, 2017 in Data Mining, Data Science, Process Mining, R
3 different types of machine learning

In this extract from “Python Machine Learning” a top data scientist Sebastian Raschka explains 3 main types of machine learning: Supervised, Unsupervised and Reinforcement Learning. Use code PML250KDN to save 50% off the book cost.

on Nov 1, 2017 in Classification, Clustering, Machine Learning, Regression, Reinforcement Learning, Supervised Learning
Conjoint Analysis: A Primer

Conjoint is another of those things everyone talks about but many are confused about…

on Nov 1, 2017 in Statistical Analysis, Statistics
Getting Started with Machine Learning in One Hour!

Here is a machine learning getting started guide which grew out of the author's notes for a one hour talk on the subject. Hopefully you find the path helpful.

on Nov 1, 2017 in Beginners, Machine Learning

2017 Nov Tutorials, Overviews

Latest Posts

Top Posts