In this third part of this series of posts the contributions of InfoGAN will be explored, which apply concepts from Information Theory to transform some of the noise terms into latent codes that have systematic, predictable effects on the outcome.
We show how to build a deep neural network that classifies images to many categories with an accuracy of a 90%. This was a very hard problem before the rise of deep networks and especially Convolutional Neural Networks.
We compare survival analysis to other predictive techniques, and provide examples of how it can produce business value, with a focus on Kaplan-Meier and Cox Regression methods which have been underutilized in business analytics.
At the heart of this reproducibility problem is the statistical inference methods used to validate research findings—specifically the concept of “statistical significance.”
This year, the ODSC West was held at the Hyatt Regency San Francisco Airport, from November 2 to 4. I am, attempting here, to give you a snapshot tour of what I experienced.
Sharing one platform has some obvious benefits for Data Science and Data Engineering teams, but technical, language and process challenges often make this a challenge. Learn how one company implemented single cloud platform for R, Python and other workloads – and some of the unexpected benefits they discovered along the way.
If you develop methods for data analysis, you might only be conducting gentle tests of your method on idealized data. This leads to “fragile research,” which breaks when released into the wild. Here, I share 3 ways to make your methods robust.
Linear Regression is an excellent starting point for Machine Learning, but it is a common mistake to focus just on the p-values and R-Squared values while determining validity of model. Here we examine the underlying assumptions of a Linear Regression, which need to be validated before applying the model.
This article explains how Bayes Nets gain remarkable predictive power by their use of conditional probability. This adds to several other salient strengths, making them a preeminent method for prediction and understanding variables’ effects.
Are you interested in what a data scientist does on a typical day of work? Each data science role may be different, but these five individuals provide insight to help those interested in figuring out what a day in the life of a data scientist actually looks like.
Bayes Nets have remarkable properties that make them better than many traditional methods in determining variables’ effects. This article explains the principle advantages.
The demand for professionals that can build financial analytics programs is booming. We foresee two main objectives- to predict market movement for profit, and to protect customer assets of banks.
Open Source is the heart of innovation and rapid evolution of technologies, these days. Here we discuss how to choose open source machine learning tools for different use cases.
Data Scientist is a very broad term and hiring a good fit data scientist for your project is challenging task. Here we discuss this important topic in details.
This article is simply a stream of consciousness on questions and problems I have been thinking and asking myself, and hopefully, it will stimulate some discussion.
This article is for people who are already in the field but are just starting out. My goal is to not only use this post as a reminder to myself about the important things that I have learned, but also to inspire others as they embark onto their DS careers!