2016 Jun

Recursive (not Recurrent!) Neural Networks in TensorFlow

Learn how to implement recursive neural networks in TensorFlow, which can be used to learn tree-like structures, or directed acyclic graphs.

on Jun 30, 2016 in Neural Networks, TensorFlow
Mining Twitter Data with Python Part 5: Data Visualisation Basics

Part 5 of this series takes on data visualization, as we look to make sense of our data and highlight interesting insights.

on Jun 29, 2016 in D3.js, Data Visualization, Python, Social Media, Social Media Analytics, Text Mining, Twitter
The Big Data Ecosystem is Too Damn Big

The Big Data ecosystem is just too damn big! It's complex, redundant, and confusing. There are too many layers in the technology stack, too many standards, and too many engines. Vendors? Too many. What is the user to do?

on Jun 28, 2016 in Analytics, Big Data, Business Analytics
5 More Machine Learning Projects You Can No Longer Overlook

There are a lot of popular machine learning projects out there, but many more that are not. Which of these are actively developed and worth checking out? Here is an offering of 5 such projects.

on Jun 28, 2016 in Computer Vision, Data Preparation, Data Preprocessing, Javascript, Machine Learning, Natural Language Processing, NLP, Overlook, Python
Mining Twitter Data with Python Part 4: Rugby and Term Co-occurrences

Part 4 of this series employs some of the lessons learned thus far to analyze tweets related to rugby matches and term co-occurrences.

on Jun 27, 2016 in Python, Social Media, Social Media Analytics, Text Mining, Twitter
Improving Nudity Detection and NSFW Image Recognition

This post discussed improvements made in a tricky machine learning classification problem: nude and/or NSFW, or not?

on Jun 25, 2016 in Algorithmia, Algorithms, Classification
Regularization in Logistic Regression: Better Fit and Better Generalization?

A discussion on regularization in logistic regression, and how its usage plays into better model fit and generalization.

on Jun 24, 2016 in Cost Function, Logistic Regression, Machine Learning, Regression, Regularization
Top Machine Learning Libraries for Javascript

Javascript may not be the conventional choice for machine learning, but there is no reason it cannot be used for such tasks. Here are the top libraries to facilitate machine learning in Javascript.

on Jun 24, 2016 in Andrej Karpathy, Convolutional Neural Networks, Deep Learning, Javascript, Machine Learning, Neural Networks
Ten Simple Rules for Effective Statistical Practice: An Overview

An overview of 10 simple rules to follow to ensure proper effective statistical data analysis.

on Jun 23, 2016 in Advice, Data Quality, Noise, Replication, Reproducibility, Statistical Analysis
Machine Learning Trends and the Future of Artificial Intelligence

The confluence of data flywheels, the algorithm economy, and cloud-hosted intelligence means every company can now be a data company, every company can now access algorithmic intelligence, and every app can now be an intelligent app.

on Jun 22, 2016 in Algorithmia, Algorithms, Artificial Intelligence, Cloud, Machine Intelligence, Machine Learning
History of Data Mining

Data mining is a subfield of computer science which blends many techniques from statistics, data science, database theory and machine learning. Here are the major milestones and “firsts” in the history of data mining plus how it’s evolved and blended with data science and big data.

on Jun 22, 2016 in About Gregory Piatetsky, Alan Turing, Bayes Theorem, Data Mining, DJ Patil, History, Vladimir Vapnik
New Andrew Ng Machine Learning Book Under Construction, Free Draft Chapters

Check out the details on Andrew Ng's new book on building machine learning systems, and find out how to get your free copy of draft chapters as they are written.

on Jun 20, 2016 in Andrew Ng, Book, Free ebook, Machine Learning
What is Your Data Worth? On LinkedIn, Microsoft, and the Value of User Data

The recent announcement of Microsoft’s acquisition of LinkedIn has raised many questions about how Microsoft will monetize this data. We examine LinkedIn value per user and compare to Google, Facebook, Yahoo, and Twitter.

on Jun 20, 2016 in Business Value, Facebook, Google, LinkedIn, Microsoft, Yahoo
Political Data Science: Analyzing Trump, Clinton, and Sanders Tweets and Sentiment

This post shares some results of political text analytics performed on Twitter data. How negative are the US Presidential candidate tweets? How does the media mention the candidates in tweets? Read on to find out!

on Jun 18, 2016 in Bernie Sanders, Donald Trump, Hillary Clinton, ParseHub, Politics, Sentiment Analysis, Twitter
A Visual Explanation of the Back Propagation Algorithm for Neural Networks

A concise explanation of backpropagation for neural networks is presented in elementary terms, along with explanatory visualization.

on Jun 17, 2016 in Algorithms, Backpropagation, Explanation, Machine Learning, Neural Networks
How open API economy accelerates the growth of big data and analytics

An open API is available on the internet for free. We review the growth of API economy and how organizations have been realizing the potential of open APIs in transforming their business.

on Jun 17, 2016 in API, Big Data Analytics, Open Data
Thinking About Analytics Readiness

This article touches upon an important but under-discussed topic of analytics readiness, including whether and when organizations should engage in analytics.

on Jun 16, 2016 in Analytics, Analytics Strategy, Culture, Strategy
Nutrition & Principal Component Analysis: A Tutorial

A great overview of Principal Component Analysis (PCA), with an example application in the field of nutrition.

on Jun 16, 2016 in Algobeans, Feature Selection, Food, Nutrition, PCA
7 Steps to Mastering SQL for Data Science

Follow these 7 steps to go from SQL data science newbie to seasoned practitioner quickly. No nonsense, just the necessities.

on Jun 16, 2016 in 7 Steps, Data Science, Database, Relational Databases, SQL
Mining Twitter Data with Python Part 1: Collecting Data

Part 1 of a 7 part series focusing on mining Twitter data for a variety of use cases. This first post lays the groundwork, and focuses on data collection.

on Jun 15, 2016 in Python, Social Media, Social Media Analytics, Twitter
10 Data Acquisition Strategies for Startups

An interesting discussion of the myriad methods in which startups may choose to acquire data, often the most overlooked and important aspect of a startup's success (or failure).

on Jun 14, 2016 in Acquisitions, Crowdsourcing, Datasets, Startups
Machine Learning Classic: Parsimonious Binary Classification Trees

Get your hands on a classic technical report outlining a three-step method of construction binary decision trees for multiple classification problems.

on Jun 14, 2016 in Decision Trees, Leo Breiman, Machine Learning, Statistics
How to Select Support Vector Machine Kernels

Support Vector Machine kernel selection can be tricky, and is dataset dependent. Here is some advice on how to proceed in the kernel selection process.

on Jun 13, 2016 in Machine Learning, Support Vector Machines
Apache Spark Key Terms, Explained

An overview of 13 core Apache Spark concepts, presented with focus and clarity in mind. A great beginner's overview of essential Spark terminology.

on Jun 13, 2016 in Apache Spark, Databricks, Dataset, Explained, Key Terms, RDD, Tungsten
AIG & Zurich on Machine Learning in Insurance

Where and how can machine learning be practically applied by insurers? And is it worth it? Read the white paper from insurance experts at AIG and Zurich.

on Jun 10, 2016 in AIG, Insurance, Machine Learning, White Paper
Top NoSQL Database Engines

An overview of the top 5 NoSQL database engines in use today, including examples of key-value, column-oriented, graph, and document paradigms.

on Jun 10, 2016 in Cassandra, Database, HBase, MongoDB, Neo4j, NoSQL
Cloud Computing Key Terms, Explained

A concise overview of 20 core cloud computing ecosystem concepts. The focus here is on the terminology, not The Big Picture.

on Jun 9, 2016 in AWS, Cloud, Cloud Computing, Explained, Key Terms, PaaS, SaaS
5 Best Practices for Big Data Security

Lack of data security can not only result in financial losses, but may also damage the reputation of organizations. Take a look at some of the most important data security best practices that can reduce the risks associated with analyzing a massive amount of data.

on Jun 9, 2016 in Best Practices, Big Data, Security
Where are the Opportunities for Machine Learning Startups?

Machine learning has permeated data-driven businesses, which means almost all businesses. Here are a few areas where it’s possible that big corporations haven’t already eaten everybody’s lunch.

on Jun 8, 2016 in Machine Learning, Startup
Data Science of Variable Selection: A Review

There are as many approaches to selecting features as there are statisticians since every statistician and their sibling has a POV or a paper on the subject. This is an overview of some of these approaches.

on Jun 7, 2016 in Algorithms, Big Data, Feature Selection, Statistics
Big Data Business Model Maturity Index and the Internet of Things (IoT)

This post explores how organizations could use the Big Data Business Model Maturity Index (BDBMMI) to exploit the Internet of Things (IoT).

on Jun 7, 2016 in Big Data, Internet of Things, IoT, Maturity Model
R, Python Duel As Top Analytics, Data Science software – KDnuggets 2016 Software Poll Results

R remains the leading tool, with 49% share, but Python grows faster and almost catches up to R. RapidMiner remains the most popular general Data Science platform. Big Data tools used by almost 40%, and Deep Learning usage doubles.

on Jun 6, 2016 in Data Mining Software, Data Science Platform, Poll, Python, Python vs R, R, RapidMiner, SQL
Ethics in Machine Learning – Summary

Still worried about the AI apocalypse? Here we are discussion about the constraints and ethics for the machine learning algorithms to prevent it.

on Jun 6, 2016 in AI, Ethics, Machine Learning, MLconf, Seattle, WA
What is the Difference Between Deep Learning and “Regular” Machine Learning?

Another concise explanation of a machine learning concept by Sebastian Raschka. This time, Sebastian explains the difference between Deep Learning and "regular" machine learning.

on Jun 3, 2016 in Convolutional Neural Networks, Deep Learning
Udacity Nanodegree Programs: Machine Learning, Data Analyst, and more

Develop new skills. Be in demand. Accelerate your career with the credential that fast-tracks you to career success.

on Jun 1, 2016 in Machine Learning, Online Education, Udacity

2016 Jun

Latest Posts

Top Posts