2019 Mar

All (48) | Courses, Education (1) | News (2) | Opinions (18) | Tutorials, Overviews (27)

Explaining Random Forest® (with Python Implementation)

We provide an in-depth introduction to Random Forest, with an explanation to how it works, its advantages and disadvantages, important hyperparameters and a full example Python implementation.

on Mar 29, 2019 in Explained, Machine Learning, Python, random forests algorithm
A Beginner’s Guide to Linear Regression in Python with Scikit-Learn

What linear regression is and how it can be implemented for both two variables and multiple variables using Scikit-Learn, which is one of the most popular machine learning libraries for Python.

By Nagesh Singh Chauhan on Mar 29, 2019 in Beginners, Linear Regression, Python, scikit-learn
Interpolation in Autoencoders via an Adversarial Regularizer

Adversarially Constrained Autoencoder Interpolation (ACAI; Berthelot et al., 2018) is a regularization procedure that uses an adversarial strategy to create high-quality interpolations of the learned representations in autoencoders.

on Mar 29, 2019 in Adversarial, AISC, Autoencoder, Machine Learning
D3.js Graph Gallery for Data Visualization

The d3 graph gallery is a collection of 200 simple charts made with d3.js, with reproducible, commented and editable code.

on Mar 28, 2019 in D3.js, Data Visualization, Javascript
7 “Gotchas” for Data Engineers New to Google BigQuery

Here are some things that might take some getting used to when new to Google BigQuery, along with mitigation strategies where I’ve found them.

on Mar 28, 2019 in BigQuery, Data Engineer, Data Engineering, Google
Explainable AI or Halting Faulty Models ahead of Disaster

A brief overview of a new method for explainable AI (XAI), called anchors, introduce its open-source implementation and show how to use it to explain models predicting the survival of Titanic passengers.

on Mar 27, 2019 in AI, Explainable AI, Kaggle, LIME, Titanic, XAI
How to Choose the Right Chart Type

This article presents an infographic for choosing which chart type is most useful in a given scenario. The infographic and chart types are then explored for greater clarity.

on Mar 27, 2019 in Charts, Data Visualization
Data Pipelines, Luigi, Airflow: Everything you need to know

This post focuses on the workflow management system (WMS) Airflow: what it is, what can you do with it, and how it differs from Luigi.

on Mar 27, 2019 in Data Workflow, Pipeline, Python, Workflow
The Four Levels of Analytics Maturity

We outline our four-step model to categorize how successfully a company uses analytics by its ability to show the analytics, uncover underlying trends, and take action based on them.

on Mar 26, 2019 in Analytics, Business, Deployment, Performance, Visualization
Pedestrian Detection in Aerial Images Using RetinaNet

Object Detection in Aerial Images is a challenging and interesting problem. By using Keras to train a RetinaNet model for object detection in aerial images, we can use it to extract valuable information.

on Mar 26, 2019 in AI, Computer Vision, Deep Learning, Keras, Object Detection, Retina Net
The AI Black Box Explanation Problem

Introducing Black Box AI, a system for automated decision making often based on machine learning over big data, which maps a user’s features into a class predicting the behavioural traits of the individuals.

on Mar 25, 2019 in AI, Explainable AI, GDPR
R vs Python for Data Visualization

This article demonstrates creating similar plots in R and Python using two of the most prominent data visualization packages on the market, namely ggplot2 and Seaborn.

on Mar 25, 2019 in Data Visualization, ggplot2, Matplotlib, Python, Python vs R, R, Seaborn
Feature Reduction using Genetic Algorithm with Python

This tutorial discusses how to use the genetic algorithm (GA) for reducing the feature vector extracted from the Fruits360 dataset in Python mainly using NumPy and Sklearn.

on Mar 25, 2019 in Deep Learning, Feature Engineering, Genetic Algorithm, Neural Networks, numpy, Python, scikit-learn
Checklist for Debugging Neural Networks

Check out these tangible steps you can take to identify and fix issues with training, generalization, and optimization for machine learning models.

on Mar 22, 2019 in Checklist, Modeling, Neural Networks, Optimization, Tips, Training
How to Capture Data to Make Business Impact

We take a look at the formula for calculating the efficiency of a data capturing method, before going onto explain the concept of Smart Data.

on Mar 21, 2019 in Analytics, Big Data, Data Science, ROI, Smart Data
Top 8 Data Science Use Cases in Manufacturing

Data science is said to change the manufacturing industry dramatically. Let's take under consideration several data science use cases in manufacturing that have already become common and brought benefits to the manufacturers.

on Mar 21, 2019 in Data Science, Manufacturing, Use Cases
Deep Compression: Optimization Techniques for Inference & Efficiency

We explain deep compression for improved inference efficiency, mobile applications, and regularization as technology cozies up to the physical limits of Moore's law.

on Mar 20, 2019 in Compression, Convolutional Neural Networks, Deep Learning, ICLR, Inference, Optimization, Regularization
Deploy your PyTorch model to Production

This tutorial aims to teach you how to deploy your recently trained model in PyTorch as an API using Python.

on Mar 20, 2019 in Data Science Education, Data Scientist, Deep Learning, Flask, Programming, Python, PyTorch
Mastering Fast Gradient Boosting on Google Colaboratory with free GPU

CatBoost is a fast implementation of GBDT with GPU support out-of-the-box. Google Colaboratory is a very useful tool with free GPU support.

on Mar 19, 2019 in CatBoost, Google Colab, GPU, Gradient Boosting, Machine Learning, Python, Yandex
How to Train a Keras Model 20x Faster with a TPU for Free

This post shows how to train an LSTM Model using Keras and Google CoLaboratory with TPUs to exponentially reduce training time compared to a GPU on your local machine.

on Mar 19, 2019 in Deep Learning, Google Colab, Keras, Python, TensorFlow, TPU
8 Reasons Why You Should Get a Microsoft Azure Certification

With huge and growing popularity of Microsoft Azure, getting that certification will advance your career. Consider these 8 reasons for taking an Azure certification course

on Mar 18, 2019 in Certification, Cloud Computing, Microsoft Azure, Online Education, Simplilearn
Artificial Neural Networks Optimization using Genetic Algorithm with Python

This tutorial explains the usage of the genetic algorithm for optimizing the network weights of an Artificial Neural Network for improved performance.

on Mar 18, 2019 in AI, Algorithms, Deep Learning, Machine Learning, Neural Networks, numpy, Optimization, Python
[eBook] Standardizing the Machine Learning Lifecycle

We explore what makes the machine learning lifecycle so challenging compared to regular software, and share the Databricks approach.

on Mar 15, 2019 in Databricks, ebook, Life Cycle, Machine Learning, MLflow
Top R Packages for Data Cleaning

Data cleaning is one of the most important and time consuming task for data scientists. Here are the top R packages for data cleaning.

on Mar 15, 2019 in Data Cleaning, Data Preparation, Data Science, Machine Learning, R
Building NLP Classifiers Cheaply With Transfer Learning and Weak Supervision

In this blog, I’ll walk you through a personal project in which I cheaply built a classifier to detect anti-semitic tweets, with no public dataset available, by combining weak supervision and transfer learning.

on Mar 15, 2019 in Bias, fast.ai, NLP, Python, Text Classification, Transfer Learning, Twitter, ULMFiT
My favorite mind-blowing Machine Learning/AI breakthroughs

We present some of our favorite breakthroughs in Machine Learning and AI in recent times, complete with papers, video links and brief summaries for each.

on Mar 14, 2019 in AI, AlphaStar, GANs, Generative Adversarial Network, Machine Learning, Machine Translation, Reinforcement Learning, Robots
Cartoon: AI and March Madness

AI has mastered chess, Go, and other games, but can AI master March Madness? KDnuggets Cartoon imagines one scenario when this happens.

on Mar 14, 2019 in AI, Basketball, Cartoon, March Madness, Sports
[PDF] Executive Guide To Machine Learning

The Executive Guide covers the benefits to your business, the build-or-buy process, and gives a practical overview for implementing ML in your organization.

on Mar 13, 2019 in ActiveState, ebook, Machine Learning
Towards Automatic Text Summarization: Extractive Methods

The basic idea looks simple: find the gist, cut off all opinions and detail, and write a couple of perfect sentences, the task inevitably ended up in toil and turmoil. Here is a short overview of traditional approaches that have beaten a path to advanced deep learning techniques.

on Mar 13, 2019 in Bayesian, Deep Learning, Machine Learning, Sciforce, Text Analysis, Text Mining, Topic Modeling
Object Detection with Luminoth

In this article you will learn about Luminoth, an open source computer vision library which sits atop Sonnet and TensorFlow and provides object detection for images and video.

on Mar 13, 2019 in Computer Vision, Image Recognition, Object Detection, Python
AI: Arms Race 2.0

An analysis of the current state of the competition between US, Europe, and China in AI, examining research, patent publications, global datasphere, devices and IoT, people, and more.

on Mar 12, 2019 in AI, China, Deep Learning, Europe, Investment, IoT, Machine Learning, Neural Networks, Startups, Trends, USA
People Tracking using Deep Learning

Read this overview of people tracking and how deep learning-powered computer vision has allowed for phenomenal performance.

on Mar 12, 2019 in Deep Learning, Image Recognition, Object Detection
Who is a typical Data Scientist in 2019?

We investigate what a typical data scientist looks like and see how this differs from this time last year, looking at skill set, programming languages, industry of employment, country of employment, and more.

on Mar 11, 2019 in Career, Data Science Skills, Data Scientist, Industry, MATLAB, Python, R, SQL
The Pareto Principle for Data Scientists

In this article, I’ll share a few ways in which we, as data scientists, can use the power of the Pareto Principle to guide our day-to-day activities.

on Mar 11, 2019 in Data Science, Data Scientist
Beating the Bookies with Machine Learning

We investigate how to use a custom loss function to identify fair odds, including a detailed example using machine learning to bet on the results of a darts match and how this can assist you in beating the bookmaker.

on Mar 8, 2019 in Machine Learning, PyTorch, Sports, Statistics
19 Inspiring Women in AI, Big Data, Data Science, Machine Learning

For the 2019 international women's day, we profile a new set of 19 inspiring women who lead the field in AI, Big Data, Data Science, and Machine Learning fields.

on Mar 8, 2019 in AI, Data Science, Machine Learning, Women
Designing Ethical Algorithms

Ethical algorithm design is becoming a hot topic as machine learning becomes more widespread. But how do you make an algorithm ethical? Here are 5 suggestions to consider.

on Mar 8, 2019 in AI, Algorithms, Bias, Ethics, Machine Learning
Breaking neural networks with adversarial attacks

We develop an intuition behind "adversarial attacks" on deep neural networks, and understand why these attacks are so successful.

on Mar 7, 2019 in Adversarial, Deep Learning, Neural Networks, Privacy
Beyond news contents: the role of social context for fake news detection

Today we’re looking at a more general fake news problem: detecting fake news that is being spread on a social network. This is a summary of a recent paper which demonstrates why we should also look at the social context: the publishers and the users spreading the information!

on Mar 7, 2019 in Fake News, NLP, Social Media
3 Reasons Why AutoML Won’t Replace Data Scientists Yet

We dispel the myth that AutoML is replacing Data Scientists jobs by highlighting three factors in Data Science development that AutoML can’t solve.

on Mar 6, 2019 in Automated Machine Learning, Automation, AutoML, Data Scientist, Feature Engineering, Reinforcement Learning
Another 10 Free Must-Read Books for Machine Learning and Data Science

Here's a third set of 10 free books for machine learning and data science. Have a look to see if something catches your eye, and don't forget to check the previous installments for reading material while you're here.

on Mar 6, 2019 in Books, Data Science, ebook, Free ebook, Machine Learning
Deconstructing BERT, Part 2: Visualizing the Inner Workings of Attention

In this post, the author shows how BERT can mimic a Bag-of-Words model. The visualization tool from Part 1 is extended to probe deeper into the mind of BERT, to expose the neurons that give BERT its shape-shifting superpowers.

on Mar 6, 2019 in Attention, BERT, NLP, Word Embeddings
Neural Networks with Numpy for Absolute Beginners: Introduction

In this tutorial, you will get a brief understanding of what Neural Networks are and how they have been developed. In the end, you will gain a brief intuition as to how the network learns.

on Mar 5, 2019 in Beginners, Neural Networks, numpy, Python
GANs Need Some Attention, Too

Self-Attention Generative Adversarial Networks (SAGAN; Zhang et al., 2018) are convolutional neural networks that use the self-attention paradigm to capture long-range spatial relationships in existing images to better synthesize new images.

on Mar 5, 2019 in AISC, Attention, Deep Learning, GANs, Image Generation, Machine Learning
The Difference Between Data Scientists and Data Engineers

ODSC East 2019 has multiple tracks for both Data Scientists and Data Engineers, including workshops, talks, and training sessions. Save 45% with code KDN45.

on Mar 4, 2019 in Boston, Data Engineer, Data Science Skills, Data Scientist, MA, ODSC
On Building Effective Data Science Teams

We take a look at the qualities that make a successful data team in order to help business leaders and executives create better AI strategies.

on Mar 4, 2019 in CRISP-DM, Data Analyst, Data Engineering, Data Governance, Data Science Team, Machine Learning Engineer
OpenAI’s GPT-2: the model, the hype, and the controversy

OpenAI recently released a very large language model called GPT-2. Controversially, they decided not to release the data or the parameters of their biggest model, citing concerns about potential abuse. Read this researcher's take on the issue.

on Mar 4, 2019 in AI, Ethics, GPT-2, Hype, NLP, OpenAI
Comparing MobileNet Models in TensorFlow

MobileNets are a family of mobile-first computer vision models for TensorFlow, designed to effectively maximize accuracy while being mindful of the restricted resources for an on-device or embedded application.

on Mar 1, 2019 in Computer Vision, Mobile, Neural Networks, TensorFlow

2019 Mar

Latest Posts

Top Posts