2019 Dec

All (54) | News (2) | Opinions (27) | Top Stories, Tweets (1) | Tutorials, Overviews (24)

Towards a Quantitative Measure of Intelligence: Breaking Down One of the Most Important AI Papers of 2019, Part II

AI scientist Francois Chollet proposes a better framework for measuring the intelligence of AI systems.

on Dec 31, 2019 in AI, Francois Chollet, Research
How To “Ultralearn” Data Science: summary, for those in a hurry

For those of you in a hurry and interested in ultralearning (which should be all of you), this recap reviews the approach and summarizes its key elements -- focus, optimization, and deep understanding with experimentation -- geared toward learning Data Science.

on Dec 30, 2019 in Advice, Data Science, Experimentation, Optimization, Ultralearn
How To “Ultralearn” Data Science: deep understanding and experimentation, Part 4

In this fourth and final part of the ultralearning data science series, it's time to take the final steps toward developing a deep understanding of the fundamentals and learning how to experiment -- the two aspects that are the ultimate keys to ultralearning.

on Dec 27, 2019 in Advice, Data Science, Experimentation, Ultralearn
Fighting Overfitting in Deep Learning

This post outlines an attack plan for fighting overfitting in neural networks.

on Dec 27, 2019 in Deep Learning, Keras, Neural Networks, Overfitting, Python, Regularization, Transfer Learning
10 Best and Free Machine Learning Courses, Online

Getting ready to leap into the world of Data Science? Consider these top machine learning courses curated by experts to help you learn and thrive in this exciting field.

on Dec 26, 2019 in Coursera, Data Science Education, Deep Learning, edX, Machine Learning, Online Education
Random Forest® vs Neural Networks for Predicting Customer Churn

Let us see how random forest competes with neural networks for solving a real world business problem.

on Dec 26, 2019 in Churn, Customer Analytics, Neural Networks, random forests algorithm
KDnuggets Cartoon in an English textbook?

KDnuggets is not only for learning about AI, Data Science, and Machine Learning. A KDnuggets cartoon has been included in an English language and culture textbook for French high-school students.

on Dec 24, 2019 in About KDnuggets, Cartoon, France, Self-Driving Car
Market Basket Analysis: A Tutorial

This article is about Market Basket Analysis & the Apriori algorithm that works behind it.

on Dec 24, 2019 in Apriori, Association Rules, Data Mining, Python
What is Data Catalog and Why You Should Care?

Learn why data catalogs could be just the thing you need to meet the challenges of data and metadata management and collaboration.

on Dec 23, 2019 in Compliance, Consistency, Data Catalog, Data Governance, Datasets, Metadata, Reddit
What is a Data Scientist Worth?

What is the Salary of a Data Scientist in 2019? Let's have a look at some data to see how we can answer that question.

on Dec 23, 2019 in Data Science, Salary, StackOverflow
Google’s New Explainable AI Service

Google has started offering a new service for “explainable AI” or XAI, as it is fashionably called. Presently offered tools are modest, but the intent is in the right direction.

on Dec 20, 2019 in AI, Explainability, Explainable AI, Google
The Most In Demand Tech Skills for Data Scientists

By the end of this article you’ll know which technologies are becoming more popular with employers and which are becoming less popular.

on Dec 20, 2019 in Career Advice, Data Science, Data Science Skills, Data Scientist
Alternative Cloud Hosted Data Science Environments

Over the years new alternative providers have risen to provided a solitary data science environment hosted on the cloud for data scientist to analyze, host and share their work.

on Dec 19, 2019 in Big Data, Cloud Computing, Data Science, Jupyter, Saturn Cloud
Interpretability part 3: opening the black box with LIME and SHAP

The third part in a series on leveraging techniques to take a look inside the black box of AI, this guide considers methods that try to explain each prediction instead of establishing a global explanation.

on Dec 19, 2019 in Explainability, Interpretability, LIME, SHAP
5 Ways to Apply Ethics to AI

Here are six more lessons based on real life examples that I think we should all remember as people working in machine learning, whether you’re a researcher, engineer, or a decision-maker.

on Dec 19, 2019 in Algorithms, Bias, Ethics, Goodhart’s Law, Machine Learning, Social Good
Ontotext Platform 3.0 for Enterprise Knowledge Graphs Released

Ontotext Platform 3.0 features significant technology improvements to enable simpler and faster graph navigation, including GraphQL interfaces to make it easier for application developers to access knowledge graphs without tedious development of back-end APIs or complex SPARQL.

on Dec 18, 2019 in GraphDB, Graphs, Knowledge Graph, Platform, SPARQL
The 4 fastest ways NOT to get hired as a data scientist

Ready to try to get hired as a data scientist for the first time? Avoiding these common mistakes won’t guarantee an offer, but not avoiding them is a sure fire way for your application to be tossed into the trash bin.

on Dec 18, 2019 in Advice, Career, Data Scientist, Hiring
Automatic Text Summarization in a Nutshell

Marketing scientist Kevin Gray asks Dr. Anna Farzindar of the University of Southern California about Automatic Text Summarization and the various ways it is used.

on Dec 18, 2019 in NLP, Text Analytics, Text Summarization
The ravages of concept drift in stream learning applications and how to deal with it

Stream data processing has gained progressive momentum with the arriving of new stream applications and big data scenarios. These streams of data evolve generally over time and may be occasionally affected by a change (concept drift). How to handle this change by using detection and adaptation mechanisms is crucial in many real-world systems.

on Dec 18, 2019 in IoT, Learning, Real-time
Top 2019 Stories: Top 10 Technology Trends of 2019; How to select rows and columns in Pandas

Also: Your AI skills are worth less than you think; Another 10 Free Must-See Courses for Machine Learning and Data Science.

on Dec 17, 2019 in Top stories
How To “Ultralearn” Data Science: removing distractions and finding focus, Part 2

This second part in a series about how to "ultralearn" data science will guide you through several techniques to remove those distractions -- because your focus needs more focus.

on Dec 17, 2019 in Beginners, Data Science, Education, Ultralearn
Let’s Build an Intelligent Chatbot

Check out this step by step approach to building an intelligent chatbot in Python.

on Dec 17, 2019 in Chatbot, NLP, NLTK, Python
The Ultimate Guide to Model Retraining

Once you have deployed your machine learning model into production, differences in real-world data will result in model drift. So, retraining and redeploying will likely be required. In other words, deployment should be treated as a continuous process. This guide defines model drift and how to identify it, and includes approaches to enable model training.

on Dec 16, 2019 in Deployment, Machine Learning, Model Drift, Model Performance, Monitoring, Production, Training Data
Microsoft Introduces Icebreaker to Address the Famous Ice-Start Challenge in Machine Learning

The new technique allows the deployment of machine learning models that operate with minimum training data.

on Dec 16, 2019 in Data Preparation, Machine Learning, Microsoft
How To “Ultralearn” Data Science, Part 1

What is "ultralearning" and how can you follow the strategy to become an expert of data science? Start with this first part in a series that will guide you through this self-motivated methodology to help you efficiently master difficult skills.

on Dec 13, 2019 in Beginners, Data Science, Education, Ultralearn
Build Pipelines with Pandas Using pdpipe

We show how to build intuitive and useful pipelines with Pandas DataFrame using a wonderful little library called pdpipe.

on Dec 13, 2019 in Data Preparation, Data Preprocessing, Pandas, Pipeline, Python
Plotnine: Python Alternative to ggplot2

Python's plotting libraries such as matplotlib and seaborn does allow the user to create elegant graphics as well, but lack of a standardized syntax for implementing the grammar of graphics compared to the simple, readable and layering approach of ggplot2 in R makes it more difficult to implement in Python.

on Dec 12, 2019 in Data Science, Data Visualization, Python, R
Python Dictionary Guide: 10 Python Dictionary Methods & Examples

Master Python Dictionaries and their essential functions in 15 minutes with this introductory guide.

on Dec 12, 2019 in Programming, Python
Deploying a pretrained GPT-2 model on AWS

This post attempts to summarize my recent detour into NLP, describing how I exposed a Huggingface pre-trained Language Model (LM) on an AWS-based web application.

on Dec 12, 2019 in AWS, Deployment, GPT-2, Natural Language Generation, NLP
AI, Analytics, Machine Learning, Data Science, Deep Learning Technology Main Developments in 2019 and Key Trends for 2020

We asked leading experts - what are the most important developments of 2019 and 2020 key trends in AI, Analytics, Machine Learning, Data Science, and Deep Learning? This blog focuses mainly on technology and deployment.

on Dec 11, 2019 in 2020 Predictions, AI, Analytics, Bill Schmarzo, Carla Gentry, Data Science, Doug Laney, Jen Underwood, Kate Strachnyi, Machine Learning, Meta Brown, Ronald van Loon, Tom Davenport, Trends
Interpretability: Cracking open the black box, Part 2

The second part in a series on leveraging techniques to take a look inside the black box of AI, this guide considers post-hoc interpretation that is useful when the model is not transparent.

on Dec 11, 2019 in Explainability, Explainable AI, Feature Selection, Interpretability, Python
NeurIPS 2019 Outstanding Paper Awards

NeurIPS 2019 is underway in Vancouver, and the committee has just recently announced this year's Outstanding Paper Awards. Find out what the selections were, along with some additional info on NeurIPS papers, here.

on Dec 11, 2019 in Conference, NeurIPS, Research
Deployment of Machine learning models using Flask

This blog will explain the basics of deploying a machine learning algorithm, focusing on developing a Naïve Bayes model for spam message identification, and using Flask to create an API for that model.

on Dec 10, 2019 in Deployment, Flask, Machine Learning
Scalable graph machine learning: a mountain we can climb?

Graph machine learning is a developing area of research that brings many complexities. One challenge that both fascinates and infuriates those working with graph algorithms is — scalability. We take a close look at scalability for graph machine learning methods covering what it is, what makes it difficult, and an example of a method that tackles it head-on.

on Dec 10, 2019 in Deep Learning, Graph Analytics, Graph Databases, Machine Learning, Scalability
Intro to Grafana: Installation, Configuration, and Building the First Dashboard

One of the biggest highlights of Grafana is the ability to bring several data sources together in one dashboard with adding rows that will host individual panels. Let's look at installing, configuring, and creating our first dashboard using Grafana.

on Dec 10, 2019 in Analytics, BI, Business Analytics, Dashboard
5 Great New Features in Latest Scikit-learn Release

From not sweating missing values, to determining feature importance for any estimator, to support for stacking, and a new plotting API, here are 5 new features of the latest release of Scikit-learn which deserve your attention.

on Dec 10, 2019 in Data Preparation, Data Preprocessing, Ensemble Methods, Feature Selection, Gradient Boosting, K-nearest neighbors, Machine Learning, Missing Values, Python, scikit-learn, Visualization
Moving Predictive Maintenance from Theory to Practice

Here are four common hurdles that need to be overcome before tapping into the benefits of predictive maintenance.

on Dec 9, 2019 in Deployment, Machine Learning, MathWorks, MATLAB, Predictive Maintenance, Simulation
The 4 Hottest Trends in Data Science for 2020

The field of Data Science is growing with new capabilities and reach into every industry. With digital transformations occurring in organizations around the world, 2019 included trends of more companies leveraging more data to make better decisions. Check out these next trends in Data Science expected to take off in 2020.

on Dec 9, 2019 in 2020 Predictions, Automated Data Science, AutoML, Cloud Computing, Data Science, NLP, Privacy, Security, Trends
AI, Analytics, Machine Learning, Data Science, Deep Learning Research Main Developments in 2019 and Key Trends for 2020

As we say goodbye to one year and look forward to another, KDnuggets has once again solicited opinions from numerous research & technology experts as to the most important developments of 2019 and their 2020 key trend predictions.

on Dec 9, 2019 in 2020 Predictions, AI, Ajit Jaokar, Analytics, Andriy Burkov, Anima Anandkumar, Daniel Tunkelang, Data Science, Deep Learning, Machine Learning, Pedro Domingos, Research, Rosaria Silipo, Xavier Amatriain
DeepMind Unveils MuZero, a New Agent that Mastered Chess, Shogi, Atari and Go Without Knowing the Rules

The new model showed great improvements over the previous AlphaZero agent.

on Dec 9, 2019 in Agents, AlphaGo, Atari, Chess, DeepMind, Reinforcement Learning, Video Games
Accuracy Fallacy: The Media’s Coverage of AI Is Bogus

Such as the gross exaggerations Stanford researchers broadcasted about their infamous "AI gaydar" project, there exists a prevalent "accuracy fallacy" in relation to AI from the media. Find out more about how the press constantly misleads the public into believing that machine learning can reliably predict psychosis, heart attacks, sexuality, and much more.

on Dec 6, 2019 in Accuracy, AI, Hype, Media
10 Free Top Notch Machine Learning Courses

Are you interested in studying machine learning over the holidays? This collection of 10 free top notch courses will allow you to do just that, with something for every approach to improving your machine learning skills.

on Dec 6, 2019 in Books, Computer Vision, Courses, Deep Learning, Explainability, Graph Analytics, Interpretability, Machine Learning, NLP, Python
5 Techniques to Prevent Overfitting in Neural Networks

In this article, I will present five techniques to prevent overfitting while training neural networks.

on Dec 6, 2019 in Neural Networks, Overfitting
Why software engineering processes and tools don’t work for machine learning

While AI may be the new electricity significant challenges remain to realize AI potential. Here we examine why data scientists and teams can’t rely on software engineering tools and processes for machine learning.

on Dec 5, 2019 in Agile, Andrew Ng, Comet.ml, Machine Learning, Software Engineering
The Essential Toolbox for Data Cleaning

Increase your confidence to perform data cleaning with a broader perspective of what datasets typically look like, and follow this toolbox of code snipets to make your data cleaning process faster and more efficient.

on Dec 5, 2019 in Data Cleaning, Data Preparation
Enabling the Deep Learning Revolution

Deep learning models are revolutionizing the business and technology world with jaw-dropping performances in one application area after another. Read this post on some of the numerous composite technologies which allow deep learning its complex nonlinearity.

on Dec 5, 2019 in Deep Learning, Gradient Descent, Neural Networks, Optimization
Artificial Friend or Virtual Foe

Is AI making more good than harm?

on Dec 5, 2019 in AI, Machine Learning, Social Good, Sustainability
Explainability: Cracking open the black box, Part 1

What is Explainability in AI and how can we leverage different techniques to open the black box of AI and peek inside? This practical guide offers a review and critique of the various techniques of interpretability.

on Dec 4, 2019 in Explainability, Explainable AI, Interpretability, XAI
The Rise of User-Generated Data Labeling

Let’s say your project is humongous and needs data labeling to be done continuously - while you’re on-the-go, sleeping, or eating. I’m sure you’d appreciate User-generated Data Labeling. I’ve got 6 interesting examples to help you understand this, let’s dive right in!

on Dec 4, 2019 in Data Labeling, Data Preparation, Data Science, User Generated Content
Vega-Lite: A grammar of interactive graphics

Vega and Vega-lite follow in a long line of work that can trace its roots back to Wilkinson’s ‘The Grammar of Graphics.’ Since then VegaLite has come into existence, bringing high-level specification of interactive visualisations to the Vega-Lite world.

on Dec 3, 2019 in Data Visualization, Graphics, Visualization
Data Science Curriculum Roadmap

What follows is a set of broad recommendations, and it will inevitably require a lot of adjustments in each implementation. Given that caveat, here are our curriculum recommendations.

on Dec 3, 2019 in Data Science, Data Science Education
A Non-Technical Reading List for Data Science

The world still cannot be reduced to numbers on a page because human beings are still the ones making all the decisions. So, the best data scientists understand the numbers and the people. Check out these great data science books that will make you a better data scientist without delving into the technical details.

on Dec 2, 2019 in Books, Data Science, Future, Review, Society
Top 7 Data Science Use Cases in Trust and Security

What are trust and safety? What is the role of trust and security in the modern world? Read this overview of 7 data science application use cases in the realm of trust and security.

on Dec 2, 2019 in AI, Data Science, Security, Trust, Use Cases
Google Open Sources MobileNetV3 with New Ideas to Improve Mobile Computer Vision Models

The latest release of MobileNets incorporates AutoML and other novel ideas in mobile deep learning.

on Dec 2, 2019 in Automated Machine Learning, Computer Vision, Google, Mobile, Open Source

2019 Dec

Latest Posts

Top Posts