- Explainable Forecasting and Nowcasting with State-of-the-art Deep Neural Networks and Dynamic Factor Model - Dec 27, 2021.
Review this detailed tutorial with code and revisit the decades-long old problem using a democratized and interpretable AI framework of how precisely can we anticipate the future and understand its causal factors?
Data Exploration, Explainable AI, Feature Engineering, Forecasting
- Speeding up data understanding by interactive exploration - Aug 19, 2021.
A key success factor of data science projects is to understand the data well. This blog explains why coding can be inefficient for this and how you can improve.
Communication, Data Exploration, Data Visualization, Visplore
- How Visualization is Transforming Exploratory Data Analysis - Aug 4, 2021.
Data analysts are dealing with bigger datasets than ever before, making interrogation difficult. Visualized Exploratory Data Analysis, supported by advanced parallel computing, promises an answer.
Data Analysis, Data Exploration, Data Visualization, Geospatial
- A Lightning Fast Look at Single Line Exploratory Data Analysis - Jul 8, 2021.
Here's a very quick look at how you can perform EDA with a single line of code using D-Tale.
Data Analysis, Data Exploration, Data Science, Data Visualization
- What makes a song popular? Analyzing Top Songs on Spotify - Apr 16, 2021.
With so many great (and not-so-great) songs out there, it can be hard to find those that match your musical preferences. Follow along this ML model building project to explore the extensive song data available on Spotify and design a recommendation engine that could help you discover your next favorite artist!
Beatles, Data Analysis, Data Exploration, Feature Selection, Music, Spotify
- How to frame the right questions to be answered using data - Mar 18, 2021.
Understanding your data first is a key step before going too far into any data science project. But, you can't fully understand your data until you know the right questions to ask of it.
Advice, Data Analysis, Data Exploration, Data Science, Data Visualization
- Know your data much faster with the new Sweetviz Python library - Mar 12, 2021.
One of the latest exploratory data analysis libraries is a new open-source Python library called Sweetviz, for just the purposes of finding out data types, missing information, distribution of values, correlations, etc. Find out more about the library and how to use it here.
Data Analysis, Data Exploration, Data Visualization, Python
- 11 Essential Code Blocks for Complete EDA (Exploratory Data Analysis) - Mar 5, 2021.
This article is a practical guide to exploring any data science project and gain valuable insights.
Data Analysis, Data Exploration, Data Visualization, Pandas, Python
- Pandas Profiling: One-Line Magical Code for EDA - Feb 24, 2021.
EDA can be automated using a Python library called Pandas Profiling. Let’s explore Pandas profiling to do EDA in a very short time and with just a single line code.
Data Analysis, Data Exploration, Data Science, Pandas, Python
- Powerful Exploratory Data Analysis in just two lines of code - Feb 22, 2021.
EDA is a fundamental early process for any Data Science investigation. Typical approaches for visualization and exploration are powerful, but can be cumbersome for getting to the heart of your data. Now, you can get to know your data much faster with only a few lines of code... and it might even be fun!
Data Analysis, Data Exploration, Data Visualization, Python
- The Best Tool for Data Blending is KNIME - Jan 13, 2021.
These are the lessons and best practices I learned in many years of experience in data blending, and the software that became my most important tool in my day-to-day work.
Data Exploration, Data Management, ETL, Knime
- 14 Data Science projects to improve your skills - Dec 1, 2020.
There's a lot of data out there and so many data science techniques to master or review. Check out these great project ideas from easy to advanced difficulty levels to develop new skills and strengthen your portfolio.
Data Exploration, Data Science, Data Science Skills, Data Visualization, Prediction, Project
- Statistical and Visual Exploratory Data Analysis with One Line of Code - Sep 21, 2020.
If EDA is not executed correctly, it can cause us to start modeling with “unclean” data. See how to use Pandas Profiling to perform EDA with a single line of code.
Data Exploration, Data Visualization, Pandas, Python
- Bring your Pandas Dataframes to life with D-Tale - Aug 13, 2020.
Bring your Pandas dataframes to life with D-Tale. D-Tale is an open-source solution for which you can visualize, analyze and learn how to code Pandas data structures. In this tutorial you'll learn how to open the grid, build columns, create charts and view code exports.
Data Exploration, Data Science, Data Visualization, Pandas, Python
- First Steps of a Data Science Project - Jul 29, 2020.
Many data science projects are launched with good intentions, but fail to deliver because the correct process is not understood. To achieve good performance and results in this work, the first steps must include clearly defining goals and outcomes, collecting data, and preparing and exploring the data. This is all about solving problems, which requires a systematic process.
Beginners, Data Exploration, Data Preparation, Data Science
- Exploratory Data Analysis on Steroids - Jul 6, 2020.
This is a central aspect of Data Science, which sometimes gets overlooked. The first step of anything you do should be to know your data: understand it, get familiar with it. This concept gets even more important as you increase your data volume: imagine trying to parse through thousands or millions of registers and make sense out of them.
Data Analysis, Data Exploration, Data Preparation, Pandas, Python
- Why Python is One of the Most Preferred Languages for Data Science? - Jan 3, 2020.
Why do most data scientists love Python? Learn more about how so many well-developed Python packages can help you accomplish your crucial data science tasks.
Data Exploration, Data Science, Programming Languages, Python
- Exploratory Data Analysis Using Python - Aug 7, 2019.
In this tutorial, you’ll use Python and Pandas to explore a dataset and create visual distributions, identify and eliminate outliers, and uncover correlations between two datasets.
ActiveState, Data Analysis, Data Exploration, Pandas, Python
- Five Command Line Tools for Data Science - Jul 31, 2019.
You can do more data science than you think from the terminal.
Data Exploration, Data Science, Data Science Tools
- Fantastic Four of Data Science Project Preparation - Jul 26, 2019.
This article takes a closer look at the four fantastic things we should keep in mind when approaching every new data science project.
Comic, Data Exploration, Data Preparation, Data Science, Domain Knowledge
- Airbnb Rental Listings Dataset Mining - Jan 28, 2019.
An Exploratory Analysis of Airbnb’s Data to understand the rental landscape in New York City.
AirBnB, Data Exploration, Data Visualization, New York City, R, Real Estate
- Top 12 Essential Command Line Tools for Data Scientists - Mar 21, 2018.
This post is a short introductory overview of 12 Unix-like operating system command line tools of value to data science tasks, and the data scientists who perform them.
Data Exploration, Data Science, Data Science Tools
- Applied Data Science: Solving a Predictive Maintenance Business Problem Part 2 - Feb 20, 2018.
In this post we will discuss further on how exploratory analysis can be used for getting insights for feature engineering.
Data Analysis, Data Exploration, Data Science, Feature Engineering
- Data Science at the Command Line: Exploring Data - Feb 14, 2018.
See what's available in the freely-available book "Data Science at the Command Line" by digging into data exploration in the terminal.
Data Exploration, Data Science, Data Science Tools
- Next Generation Data Manipulation with R and dplyr - Aug 31, 2017.
The idea behind the dplyr package is to do one thing at a time. dplyr has separate functions for every task which make its implementation crisp and easy to understand.
Data Cleaning, Data Exploration, R, R Packages
- Exploratory Data Analysis in Python - Jul 7, 2017.
We view EDA very much like a tree: there is a basic series of steps you perform every time you perform EDA (the main trunk of the tree) but at each step, observations will lead you down other avenues (branches) of exploration by raising questions you want to answer or hypotheses you want to test.
Data Analysis, Data Exploration, Data Preparation, Jupyter, Python, SVDS
- 5 Machine Learning Projects You Can No Longer Overlook, May - May 10, 2017.
In this month's installment of Machine Learning Projects You Can No Longer Overlook, we find some data preparation and exploration tools, a (the?) reinforcement learning "framework," a new automated machine learning library, and yet another distributed deep learning library.
Automated Machine Learning, Data Exploration, Deep Learning, Distributed Systems, Machine Learning, Overlook, Pandas, Reinforcement Learning
- The Value of Exploratory Data Analysis - Apr 20, 2017.
In this post, we will give a high level overview of what exploratory data analysis (EDA) typically entails and then describe three of the major ways EDA is critical to successfully model and interpret its results.
Data Analysis, Data Exploration, Data Visualization, SVDS
- 5 Machine Learning Projects You Can No Longer Overlook, April - Apr 13, 2017.
It's about that time again... 5 more machine learning or machine learning-related projects you may not yet have heard of, but may want to consider checking out. Find tools for data exploration, topic modeling, high-level APIs, and feature selection herein.
Data Exploration, Deep Learning, Java, Machine Learning, Neural Networks, Overlook, Python, Scala, scikit-learn, Topic Modeling
- Beyond One-Hot: an exploration of categorical variables - Dec 8, 2015.
Coding categorical variables into numbers, by assign an integer to each category ordinal coding of the machine learning algorithms. Here, we explore different ways of converting a categorical variable and their effects on the dimensionality of data.
Data Exploration, Machine Learning, Python, Will McGinnis