How to Remove Duplicates in Large Datasets
Dealing with huge datasets can be tricky, especially the data cleaning process. One of such processing is de-duplication, find out how you can solve this using the statistical techniques.
on Apr 27, 2016 in CleverTap, Data Cleaning, Data Preparation
Top 10 IPython Notebook Tutorials for Data Science and Machine Learning
A list of 10 useful Github repositories made up of IPython (Jupyter) notebooks, focused on teaching data science and machine learning. Python is the clear target here, but general principles are transferable.
on Apr 22, 2016 in Data Science, Deep Learning, GitHub, IPython, Machine Learning, Python, Sebastian Raschka, TensorFlow
Comprehensive Guide to Learning Python for Data Analysis and Data Science
Want to make a career change to Data Science using python? Well learning anything on your own can be a challenge & a little guidance could be a great help, that is exactly what this article will provide you with.
on Apr 20, 2016 in Data Analysis, Data Science Education, DataCamp, Python
Deep Learning for Chatbots, Part 1 – Introduction
The first in a series of tutorial posts on using Deep Learning for chatbots, this covers some of the techniques being used to build conversational agents, and goes from the current state of affairs through to what is and is not possible.
on Apr 19, 2016 in Chatbot, Deep Learning, Siri
Association Rules and the Apriori Algorithm: A Tutorial
A great and clearly-presented tutorial on the concepts of association rules and the Apriori algorithm, and their roles in market basket analysis.
on Apr 14, 2016 in Algobeans, Annalyn Ng, Apriori, Association Rules
Regression & Correlation for Military Promotion: A Tutorial
A clear and well-written tutorial covering the concepts of regression and correlation, focusing on military commander promotion as a use case.
on Apr 13, 2016 in Algobeans, Correlation, Military, Regression
Basics of GPU Computing for Data Scientists
With the rise of neural network in data science, the demand for computationally extensive machines lead to GPUs. Learn how you can get started with GPUs & algorithms which could leverage them.
on Apr 7, 2016 in Algorithms, CUDA, Data Science, GPU, NVIDIA
Deep Learning for Internet of Things Using H2O
H2O is feature-rich open source machine learning platform known for its R and Spark integration and it’s ease of use. This is an overview of using H2O deep learning for data science with the Internet of Things.
on Apr 6, 2016 in Deep Learning, H2O, Internet of Things, IoT, R
|