- The Best Tool for Data Blending is KNIME - Jan 13, 2021.
These are the lessons and best practices I learned in many years of experience in data blending, and the software that became my most important tool in my day-to-day work.
Tags: Data Exploration, Data Management, ETL, Knime
- 14 Data Science projects to improve your skills - Dec 1, 2020.
There's a lot of data out there and so many data science techniques to master or review. Check out these great project ideas from easy to advanced difficulty levels to develop new skills and strengthen your portfolio.
Tags: Data Exploration, Data Science, Data Science Skills, Data Visualization, Prediction, Project

Top Python Libraries for Data Science, Data Visualization & Machine Learning - Nov 2, 2020.
This article compiles the 38 top Python libraries for data science, data visualization & machine learning, as best determined by KDnuggets staff.
Tags: Automated Machine Learning, AutoML, Data Exploration, Data Processing, Data Science, Data Visualization, Explainability, Machine Learning, Python
- Statistical and Visual Exploratory Data Analysis with One Line of Code - Sep 21, 2020.
If EDA is not executed correctly, it can cause us to start modeling with “unclean” data. See how to use Pandas Profiling to perform EDA with a single line of code.
Tags: Data Exploration, Data Visualization, Pandas, Python
- Bring your Pandas Dataframes to life with D-Tale - Aug 13, 2020.
Bring your Pandas dataframes to life with D-Tale. D-Tale is an open-source solution for which you can visualize, analyze and learn how to code Pandas data structures. In this tutorial you'll learn how to open the grid, build columns, create charts and view code exports.
Tags: Data Exploration, Data Science, Data Visualization, Pandas, Python
First Steps of a Data Science Project - Jul 29, 2020.
Many data science projects are launched with good intentions, but fail to deliver because the correct process is not understood. To achieve good performance and results in this work, the first steps must include clearly defining goals and outcomes, collecting data, and preparing and exploring the data. This is all about solving problems, which requires a systematic process.
Tags: Beginners, Data Exploration, Data Preparation, Data Science
Exploratory Data Analysis on Steroids - Jul 6, 2020.
This is a central aspect of Data Science, which sometimes gets overlooked. The first step of anything you do should be to know your data: understand it, get familiar with it. This concept gets even more important as you increase your data volume: imagine trying to parse through thousands or millions of registers and make sense out of them.
Tags: Data Analysis, Data Exploration, Data Preparation, Pandas, Python
- Why Python is One of the Most Preferred Languages for Data Science? - Jan 3, 2020.
Why do most data scientists love Python? Learn more about how so many well-developed Python packages can help you accomplish your crucial data science tasks.
Tags: Data Exploration, Data Science, Programming Languages, Python
- Exploratory Data Analysis Using Python - Aug 7, 2019.
In this tutorial, you’ll use Python and Pandas to explore a dataset and create visual distributions, identify and eliminate outliers, and uncover correlations between two datasets.
Tags: ActiveState, Data Analysis, Data Exploration, Pandas, Python
- Five Command Line Tools for Data Science - Jul 31, 2019.
You can do more data science than you think from the terminal.
Tags: Data Exploration, Data Science, Data Science Tools
Fantastic Four of Data Science Project Preparation - Jul 26, 2019.
This article takes a closer look at the four fantastic things we should keep in mind when approaching every new data science project.
Tags: Comic, Data Exploration, Data Preparation, Data Science, Domain Knowledge
- KDnuggets™ News 19:n19, May 15: Data Scientist – Best Job of the Year!; How (not) to use Machine Learning for time series forecasting - May 15, 2019.
"Please, explain." Interpretability of machine learning models; How to fix an Unbalanced Dataset; Data Science Poem; Customer Churn Prediction Using Machine Learning; A Complete Exploratory Data Analysis and Visualization for Text
Tags: Churn, Data Exploration, Data Science, Data Scientist, Interpretability, Machine Learning, Text Analytics, Time Series, Unbalanced
- Most impactful AI trends of 2018: The rise of ML Engineering - Mar 1, 2019.
As both research and applied teams are doubling down on their engineering and infrastructure needs, the nascent field of ML Engineering will build upon 2018’s foundation and truly blossom in 2019.
Tags: AI, Data Exploration, Machine Learning, Platform, Representation, Scalability, Trends
- Airbnb Rental Listings Dataset Mining - Jan 28, 2019.
An Exploratory Analysis of Airbnb’s Data to understand the rental landscape in New York City.
Tags: AirBnB, Data Exploration, Data Visualization, New York City, R, Real Estate
- Beginner Data Visualization & Exploration Using Pandas - Oct 22, 2018.
This tutorial will offer a beginner guide into how to get around with Pandas for data wrangling and visualization.
Pages: 1 2
Tags: Data Exploration, Data Visualization, Pandas, Python
Top 12 Essential Command Line Tools for Data Scientists - Mar 21, 2018.
This post is a short introductory overview of 12 Unix-like operating system command line tools of value to data science tasks, and the data scientists who perform them.
Tags: Data Exploration, Data Science, Data Science Tools
- Applied Data Science: Solving a Predictive Maintenance Business Problem Part 2 - Feb 20, 2018.
In this post we will discuss further on how exploratory analysis can be used for getting insights for feature engineering.
Tags: Data Analysis, Data Exploration, Data Science, Feature Engineering
Data Science at the Command Line: Exploring Data - Feb 14, 2018.
See what's available in the freely-available book "Data Science at the Command Line" by digging into data exploration in the terminal.
Tags: Data Exploration, Data Science, Data Science Tools
- Next Generation Data Manipulation with R and dplyr - Aug 31, 2017.
The idea behind the dplyr package is to do one thing at a time. dplyr has separate functions for every task which make its implementation crisp and easy to understand.
Tags: Data Cleaning, Data Exploration, R, R Packages
- Exploratory Data Analysis in Python - Jul 7, 2017.
We view EDA very much like a tree: there is a basic series of steps you perform every time you perform EDA (the main trunk of the tree) but at each step, observations will lead you down other avenues (branches) of exploration by raising questions you want to answer or hypotheses you want to test.
Tags: Data Analysis, Data Exploration, Data Preparation, Jupyter, Python, SVDS
- 5 Machine Learning Projects You Can No Longer Overlook, May - May 10, 2017.
In this month's installment of Machine Learning Projects You Can No Longer Overlook, we find some data preparation and exploration tools, a (the?) reinforcement learning "framework," a new automated machine learning library, and yet another distributed deep learning library.
Tags: Automated Machine Learning, Data Exploration, Deep Learning, Distributed Systems, Machine Learning, Overlook, Pandas, Reinforcement Learning
- The Value of Exploratory Data Analysis - Apr 20, 2017.
In this post, we will give a high level overview of what exploratory data analysis (EDA) typically entails and then describe three of the major ways EDA is critical to successfully model and interpret its results.
Tags: Data Analysis, Data Exploration, Data Visualization, SVDS
5 Machine Learning Projects You Can No Longer Overlook, April - Apr 13, 2017.
It's about that time again... 5 more machine learning or machine learning-related projects you may not yet have heard of, but may want to consider checking out. Find tools for data exploration, topic modeling, high-level APIs, and feature selection herein.
Tags: Data Exploration, Deep Learning, Java, Machine Learning, Neural Networks, Overlook, Python, Scala, scikit-learn, Topic Modeling
- Top Data Scientist Daniel Tunkelang on Data Recycling - Nov 22, 2016.
Respected Data Scientist Daniel Tunkelang shares some insight into data recycling, using data from other contexts to bootstrap your initial statistical models until you can collect live data.
Tags: Advice, Daniel Tunkelang, Data Exploration, Data Scientist
- 5 Steps for Advanced Data Analysis using Visualization - Oct 28, 2016.
In most of the scientific researches, due to large amount of experiment data, statistical analysis is typically done by technical experts in computing and statistics. Unfortunately, these experts are not the experts of underlying research; which may cause gaps in analysis. If actual researchers are given easy to use tools and methods to handle and analyse data, it will enrich the research outcome for sure.
Tags: Bioinformatics, Clustering, Data Exploration, Data Visualization, Noise, PCA, Qlucore, Statistical Analysis
- Emory University: Tenure-Track Faculty Position in Data Exploration - Sep 26, 2016.
We are particularly interested in applicants with expertise in interactive data exploration, broadly construed, which includes data mining, analytics, visualization, human-computer interaction, and summarization.
Tags: Atlanta, Data Exploration, Emory University, Faculty, GA
- Caravel: Airbnb’s data exploration platform - Apr 13, 2016.
For data exploration, discovery, and collaborative analytics, AirBnB have built and open sourced, a data exploration and dashboarding platform named Caravel. It allows data exploration through rich visualizations while performing fast and intuitive “slicing and dicing” of your dataset.
Tags: AirBnB, Data Exploration, Data Science Tools
- Change in Perspective with Process Mining - Feb 9, 2016.
Process mining is focused on the analysis of processes, and is an excellent tool in particular for the exploratory analysis of process-related data. Understand how effectively use it as an exploratory analysis tool, which can rapidly and flexibly take different perspectives on your processes.
Pages: 1 2 3
Tags: Data Exploration, Data Science, Process Mining
- Improve your processes with statistical models - Jan 7, 2016.
Through real-world case studies, this technical primer will help you: find best practices to interactively explore the patterns in your data, build useful statistical models, and visually interact with these models.
Tags: Data Exploration, Data Visualization, JMP
- Beyond One-Hot: an exploration of categorical variables - Dec 8, 2015.
Coding categorical variables into numbers, by assign an integer to each category ordinal coding of the machine learning algorithms. Here, we explore different ways of converting a categorical variable and their effects on the dimensionality of data.
Tags: Data Exploration, Machine Learning, Python, Will McGinnis
- Business Analytics Webinars: Practical Training, 7 Live Sessions, Dec 10 - Dec 3, 2015.
Stay ahead of the curve in business analytics with our Dec 10 Webinar Marathon, and watch industry experts deliver 7 back-to-back sessions on hot topics. Register now.
Tags: Business Analytics, Dashboard, Data Exploration, Excel, PASS
- Improve your processes with statistical models - Nov 3, 2015.
Get technical primer with best practices to interactively explore the patterns in your data, build useful statistical models of these patterns, and visually interact with these models.
Tags: Data Exploration, Data Visualization, JMP
- INFORMS Courses: Essential Practice Skills, Data Exploration and Visualization, November, Baltimore - Oct 5, 2015.
Two INFORMS courses teach Essential Practice Skills for High-Impact Analytics Projects (Nov 18-19) and Data Exploration & Visualization (Nov 10-11). Both courses are given at Johns Hopkins University, Baltimore, MD.
Tags: Baltimore, Best Practices, Data Exploration, Data Visualization, Freakalytics, INFORMS, MD, Skills
- Webcast: Tech expert Phil Simon on exploring data - Jun 17, 2015.
Phil Simon, award-winning author, talks about how data visualization can help improve data quality, promoting the exploratory mindset, telling good stories with data, and more. On demand webcast.
Tags: Data Exploration, Data Quality, Data Visualization, JMP
- Statwing, Modern Data Analysis Software - Jan 30, 2014.
Every decision maker in the organization needs to be capable of analyzing data, but most tools require a lot of mundane and time-consuming data cleaning. Statwing solves that problem and lets you focus on data analysis.
Tags: Automating, Data Exploration, General Social Survey, Statwing