2018 Dec

All (47) | News (3) | Opinions (21) | Top Stories, Tweets (1) | Tutorials, Overviews (22)

Good Feature Building Techniques and Tricks for Kaggle

A selection of top tips to obtain great results on Kaggle leaderboards, including useful code examples showing how best to use Latitude and Longitude features.

on Dec 31, 2018 in Feature Engineering, Kaggle, Tips
Papers with Code: A Fantastic GitHub Resource for Machine Learning

Looking for papers with code? If so, this GitHub repository, a clearinghouse for research papers and their corresponding implementation code, is definitely worth checking out.

on Dec 31, 2018 in GitHub, Machine Learning, Research
Comparison of the Top Speech Processing APIs

There are two main tasks in speech processing. First one is to transform speech to text. The second is to convert the text into human speech. We will describe the general aspects of each API and then compare their main features in the table.

on Dec 28, 2018 in Amazon, API, Google Cloud, IBM Watson, Microsoft Azure, NLP, Speech Recognition
Supervised Learning: Model Popularity from Past to Present

An extensive look at the history of machine learning models, using historical data from the number of publications of each type to attempt to answer the question: what is the most popular model?

on Dec 28, 2018 in Decision Trees, Deep Learning, Linear Regression, Logistic Regression, Machine Learning, Neural Networks, SVM
The Essence of Machine Learning

And so now, as an exercise in what may seem to be semantics, let's explore some 30,000 feet definitions of what machine learning is.

on Dec 28, 2018 in Aaron Courville, Classification, Ian Goodfellow, Machine Learning, Tom Mitchell, Yoshua Bengio
A Case For Explainable AI & Machine Learning

In support of the explainable AI cause, we present a variety of use cases covering operational needs, regulatory compliance and public trust and social acceptance.

on Dec 27, 2018 in Bias, Explainable AI, Explanation, Interpretability, Machine Learning
Synthetic Data Generation: A must-have skill for new data scientists

A brief rundown of methods/packages/ideas to generate synthetic data for self-driven data science projects and deep diving into machine learning methods.

on Dec 27, 2018 in Classification, Clustering, Datasets, Machine Learning, Python, Synthetic Data
Deep learning in Satellite imagery

This article outlines possible sources of satellite imagery, what its properties are and how this data can be utilised using R.

on Dec 26, 2018 in Deep Learning, Image Recognition, R
BERT: State of the Art NLP Model, Explained

BERT’s key technical innovation is applying the bidirectional training of Transformer, a popular attention model, to language modelling. It has caused a stir in the Machine Learning community by presenting state-of-the-art results in a wide variety of NLP tasks.

on Dec 26, 2018 in Explained, Modeling, Neural Networks, NLP, Transformer
A Guide to Decision Trees for Machine Learning and Data Science

What makes decision trees special in the realm of ML models is really their clarity of information representation. The “knowledge” learned by a decision tree through training is directly formulated into a hierarchical structure.

on Dec 24, 2018 in Algorithms, Data Science, Decision Trees, Machine Learning, Python, scikit-learn
Feature Engineering for Machine Learning: 10 Examples

A brief introduction to feature engineering, covering coordinate transformation, continuous data, categorical features, missing values, normalization, and more.

on Dec 21, 2018 in Data, Data Preparation, Data Processing, Feature Engineering, Normalization
Six Steps to Master Machine Learning with Data Preparation

To prepare data for both analytics and machine learning initiatives teams can accelerate machine learning and data science projects to deliver an immersive business consumer experience that accelerates and automates the data-to-insight pipeline by following six critical steps.

on Dec 21, 2018 in Data Preparation, Machine Learning
Machine Learning Explainability vs Interpretability: Two concepts that could help restore trust in AI

We explain the key differences between explainability and interpretability and why they're so important for machine learning and AI, before taking a look at several techniques and methods for improving machine learning interpretability.

on Dec 20, 2018 in AI, Explainable AI, Explanation, Interpretability, Machine Learning
10 More Must-See Free Courses for Machine Learning and Data Science

Have a look at this follow-up collection of free machine learning and data science courses to give you some winter study ideas.

on Dec 20, 2018 in AI, Algorithms, Big Data, Data Science, Deep Learning, Machine Learning, MIT, NLP, Reinforcement Learning, U. of Washington, UC Berkeley, Yandex
Top Python Libraries in 2018 in Data Science, Deep Learning, Machine Learning

Here are the top 15 Python libraries across Data Science, Data Visualization. Deep Learning, and Machine Learning.

on Dec 19, 2018 in Data Science, Deep Learning, Machine Learning, Pandas, Python, PyTorch, TensorFlow
The brain as a neural network: this is why we can’t get along

This article sets out to answer the question: what insights can we gain about ourselves by thinking of the brain as a machine learning model?

on Dec 19, 2018 in Brain, Confirmation Bias, Neural Networks, Overfitting, Politics
How to do Deep Learning with SAS

Build a deep learning model using SAS. This paper offers a how-to guide so that you can get up and running.

on Dec 18, 2018 in Deep Learning, SAS
How will automation tools change data science?

This article provides an overview of recent trends in machine learning and data science automation tools and addresses how those tools will change data science.

on Dec 18, 2018 in Automation, Data Science, dotData
Industry Predictions: AI, Machine Learning, Analytics & Data Science Main Developments in 2018 and Key Trends for 2019

This is a collection of data science, machine learning, analytics, and AI predictions for next year from a number of top industry organizations. See what the insiders feel is on the horizon for 2019!

on Dec 18, 2018 in 2019 Predictions, AI, Analytics, Data Science, Domino, dotData, Figure Eight, Industry, Knime, Machine Learning, MapR, MathWorks, OpenText, ParallelM, Salesforce, Splice Machine, Splunk
eBook: An Introduction to Active Learning

At Figure Eight, we're big believers in active learning. We think it holds the promise to better models, and that it's just about to go mainstream. In our new eBook, An Introduction to Active Learning, we cover the essentials. Download now!

on Dec 17, 2018 in Active Learning, ebook, Figure Eight
Introduction to Statistics for Data Science

This tutorial helps explain the central limit theorem, covering populations and samples, sampling distribution, intuition, and contains a useful video so you can continue your learning.

on Dec 17, 2018 in Data Science, Statistics
Top Stories of 2018: 9 Must-have skills you need to become a Data Scientist, updated; Python eats away at R: Top Software for Analytics, Data Science, Machine Learning

Also 5 Data Science Projects That Will Get You Hired in 2018; Top 20 Python AI and Machine Learning Open Source Projects; Neural network AI is simple. So... Stop pretending you are a genius.

on Dec 14, 2018 in Top stories
Why You Shouldn’t be a Data Science Generalist

But it’s hard to avoid becoming a generalist if you don’t know which common problem classes you could specialize in in the fist place. That’s why I put together a list of the five problem classes that are often lumped together under the “data science” heading.

on Dec 14, 2018 in Career Advice, Data Science, Data Scientist
State of Deep Learning and Major Advances: H2 2018 Review

In this post we summarise some of the key developments in deep learning in the second half of 2018, before briefly discussing the road ahead for the deep learning community.

on Dec 13, 2018 in Deep Learning, Generative Adversarial Network, NLP, PyTorch, TensorFlow, Trends
Solve any Image Classification Problem Quickly and Easily

This article teaches you how to use transfer learning to solve image classification problems. A practical example using Keras and its pre-trained models is given for demonstration purposes.

on Dec 13, 2018 in Classification, Computer Vision, Image Recognition, Keras, Python
Four Approaches to Explaining AI and Machine Learning

We discuss several explainability techniques being championed today, including LOCO (leave one column out), permutation impact, and LIME (local interpretable model-agnostic explanations).

on Dec 12, 2018 in AI, Explainable AI, Interpretability, LIME, Machine Learning
Keras Hyperparameter Tuning in Google Colab Using Hyperas

In this post, I will show you how you can tune the hyperparameters of your existing keras models using Hyperas and run everything in a Google Colab Notebook.

on Dec 12, 2018 in Automated Machine Learning, Google, Google Colab, Hyperparameter, Keras, Python
Machine Learning & AI Main Developments in 2018 and Key Trends for 2019

As we bid farewell to one year and look to ring in another, KDnuggets has solicited opinions from numerous Machine Learning and AI experts as to the most important developments of 2018 and their 2019 key trend predictions.

on Dec 11, 2018 in 2019 Predictions, AI, Ajit Jaokar, Andriy Burkov, Anima Anandkumar, Brandon Rohrer, Daniel Tunkelang, Machine Learning, Pedro Domingos, Rachel Thomas, Zachary Lipton
Automated Web Scraping in R

How to automatically web scrape periodically so you can analyze timely/frequently updated data.

on Dec 11, 2018 in Data Science Dojo, R, Web Scraping
Learning Machine Learning vs Learning Data Science

We clarify some important and often-overlooked distinctions between Machine Learning and Data Science, covering education, scalable vs non-scalable jobs, career paths, and more.

on Dec 11, 2018 in Career, Data Science, Education, Machine Learning
Introduction to Named Entity Recognition

Named Entity Recognition is a tool which invariably comes handy when we do Natural Language Processing tasks. Read on to find out how.

on Dec 11, 2018 in NLP, Python, Text Classification
Should you become a data scientist?

An overview of the current situation for data scientists, from its origins and history, to the recent growth in job postings, and looking at what changes the future might bring.

on Dec 10, 2018 in Career, Data Science, Data Scientist, History, Machine Learning, Tips, Trends
How Different are Conventional Programming and Machine Learning?

When I heard about Machine Learning I couldn't contain the amazement. I was not able to get my mind around the fact, that unlike normal software programs - which I was accustomed to - I wouldn't even have to teach a computer the "how" in detail about all the future scenarios up front.

on Dec 10, 2018 in Machine Learning, Programming
Here are the most popular Python IDEs / Editors

We report on the most popular IDE and Editors, based on our poll. Jupyter is the favorite across all regions and employment types, but there is competition for no. 2 and no. 3 spots.

on Dec 7, 2018 in IDE, Jupyter, Poll, Programming, PyCharm, Python, Visual Studio Code
A comprehensive list of Machine Learning Resources: Open Courses, Textbooks, Tutorials, Cheat Sheets and more

A thorough collection of useful resources covering statistics, classic machine learning, deep learning, probability, reinforcement learning, and more.

on Dec 7, 2018 in Cheat Sheet, Data Science Education, Deep Learning, Machine Learning, Mathematics, Open Source, Reinforcement Learning, Resources, Statistics
The Machine Learning Project Checklist

In an effort to further refine our internal models, this post will present an overview of Aurélien Géron's Machine Learning Project Checklist, as seen in his bestselling book, "Hands-On Machine Learning with Scikit-Learn & TensorFlow."

on Dec 7, 2018 in Checklist, Machine Learning, Process, Workflow
Common mistakes when carrying out machine learning and data science

We examine typical mistakes in Data Science process, including wrong data visualization, incorrect processing of missing values, wrong transformation of categorical variables, and more. Learn what to avoid!

on Dec 6, 2018 in Data Preparation, Data Science, Data Visualization, Machine Learning, Missing Values, Mistakes, Multicollinearity
Explainable Artificial Intelligence (Part 2) – Model Interpretation Strategies

The aim of this article is to give you a good understanding of existing, traditional model interpretation methods, their limitations and challenges. We will also cover the classic model accuracy vs. model interpretability trade-off and finally take a look at the major strategies for model interpretation.

on Dec 6, 2018 in Explainable AI, Interpretability, LIME, Machine Learning, SHAP
Four Techniques for Outlier Detection

There are many techniques to detect and optionally remove outliers from a dataset. In this blog post, we show an implementation in KNIME Analytics Platform of four of the most frequently used - traditional and novel - techniques for outlier detection.

on Dec 6, 2018 in DBSCAN, Knime, Outliers, Python
How to build a data science project from scratch

A demonstration using an analysis of Berlin rental prices, covering how to extract data from the web and clean it, gaining deeper insights, engineering of features using external APIs, and more.

on Dec 5, 2018 in Berlin, Data Preparation, Data Science, Real Estate, Web Scraping
6 Step Plan to Starting Your Data Science Career

When people want to launch data science careers but haven't made the first move, they're in a scenario that's understandably daunting and full of uncertainty. Here are six steps to get started.

on Dec 5, 2018 in Career, Data Science
Kick Start Your Data Career! Tips From the Frontline

I am going to provide very interesting and useful tips through this blog series which will help students to kick start their career in Data.

on Dec 5, 2018 in Career, Data Science, Tips
Data Mining Book – Chapter Download

Download this immediately useful book chapter, and learn how to create derived variables, which allow the statistical and Data Science modeling to incorporate human insights.

on Dec 4, 2018 in Data Mining, Data Visualization, Derived Variables, Feature Engineering, JMP, Michael Berry
Data Science Projects Employers Want To See: How To Show A Business Impact

The best way to create better data science projects that employers want to see is to provide a business impact. This article highlights the process using customer churn prediction in R as a case-study.

on Dec 4, 2018 in Career Advice, Churn, Data Preparation, Data Science, R
Handling Imbalanced Datasets in Deep Learning

It’s important to understand why we should do it so that we can be sure it’s a valuable investment. Class balancing techniques are only really necessary when we actually care about the minority classes.

on Dec 4, 2018 in Balancing Classes, Datasets, Deep Learning, Keras, Python
AI, Data Science, Analytics Main Developments in 2018 and Key Trends for 2019

Review of 2018 and Predictions for 2019 from our panel of experts, including Meta Brown, Tom Davenport, Carla Gentry, Bob E Hayes, Cassie Kozyrkov, Doug Laney, Bill Schmarzo, Kate Strachnyi, Ronald van Loon, Favio Vazquez, and Jen Underwood.

on Dec 3, 2018 in 2019 Predictions, AI, Automated Machine Learning, Automation, Bill Schmarzo, Carla Gentry, Cassie Kozyrkov, Doug Laney, GDPR, Hype, Jen Underwood, Meta Brown, Predictions, Risks, Ronald van Loon, Tom Davenport, Trends
Best Machine Learning Languages, Data Visualization Tools, DL Frameworks, and Big Data Tools

We cover a variety of topics, from machine learning to deep learning, from data visualization to data tools, with comments and explanations from experts in the relevant fields.

on Dec 3, 2018 in Big Data, Data Visualization, Deep Learning, Jupyter, Machine Learning, Python, R, Tableau

2018 Dec

Latest Posts

Top Posts