- Find the Best-Matching Distribution for Your Data Effortlessly - Oct 22, 2021.
How to find the best-matching statistical distributions for your data points — in an automated and easy way. And, then how to extend the utility further.
Distribution, Python, Statistics, Synthetic Data
- Advanced Statistical Concepts in Data Science - Sep 30, 2021.
The article contains some of the most commonly used advanced statistical concepts along with their Python implementation.
Career Advice, Data Science, Distribution, Probability, Statistics
- Don’t Touch a Dataset Without Asking These 10 Questions - Sep 20, 2021.
Selecting the right dataset is critical for the success of your AI project.
Datasets, Distribution, Outliers, Privacy, Standardization
- Comprehensive Guide to the Normal Distribution - Jan 18, 2021.
Drop in for some tips on how this fundamental statistics concept can improve your data science.
Distribution, Normal Distribution, Python, SciPy, Statistics
- Essential Math for Data Science: The Poisson Distribution - Dec 29, 2020.
The Poisson distribution, named after the French mathematician Denis Simon Poisson, is a discrete distribution function describing the probability that an event will occur a certain number of times in a fixed time (or space) interval.
Data Science, Distribution, Mathematics, Poisson Distribution
- Fast and Intuitive Statistical Modeling with Pomegranate - Dec 21, 2020.
Pomegranate is a delicious fruit. It can also be a super useful Python library for statistical analysis. We will show how in this article.
Distribution, Markov Chains, Probability, Python, Statistical Modeling
- Before Probability Distributions - Jul 16, 2020.
Why do we use probability distributions, and why do they matter?
Distribution, Probability, Statistics
- Overview of data distributions - Jun 10, 2020.
With so many types of data distributions to consider in data science, how do you choose the right one to model your data? This guide will overview the most important distributions you should be familiar with in your work.
Binomial, Distribution, Normal Distribution, Poisson Distribution, Probability, Statistics
- Looking Normal(ly Distributed) - May 20, 2020.
This article investigates when some probability distributions look normal "enough" for a statistical test.
Data Visualization, Distribution, Normal Distribution, Probability, Statistics
- Probability Distributions in Data Science - Feb 26, 2020.
Some machine learning models are designed to work best under some distribution assumptions. Therefore, knowing with which distributions we are working with can help us to identify which models are best to use.
Data Science, Distribution, Normal Distribution, Probability
- Beta Distribution: What, When & How - Sep 25, 2019.
This article covers the beta distribution, and explains it using baseball batting averages.
Distribution, Probability, Statistics
- What is Poisson Distribution? - Aug 14, 2019.
An solid overview of the Poisson distribution, starting from why it is needed, how it stacks up to binomial distribution, deriving its formula mathematically, and more.
Distribution, Poisson Distribution, Probability, Statistics
- 5 Probability Distributions Every Data Scientist Should Know - Jul 4, 2019.
Having an understanding of probability distributions should be a priority for data scientists. Make sure you know what you should by reviewing this post on the subject.
Data Science, Data Scientist, Distribution, Normal Distribution, Probability
- What to do when your training and testing data come from different distributions - Jan 4, 2019.
However, sometimes only a limited amount of data from the target distribution can be collected. It may not be sufficient to build the needed train/dev/test sets. What to do in such a case? Let us discuss some ideas!
Distribution, Machine Learning, Training Data
- The Long Tail of Medical Data - Nov 12, 2018.
This article discusses some issues related to medical data, relating specifically to power law distributions and computer aided diagnosis. Read on to see machine learning's place in the puzzle.
Distribution, Healthcare, Medical, Zipf's Law
- The Intuitions Behind Bayesian Optimization with Gaussian Processes - Oct 19, 2018.
Bayesian Optimization adds a Bayesian methodology to the iterative optimizer paradigm by incorporating a prior model on the space of possible target functions. This article introduces the basic concepts and intuitions behind Bayesian Optimization with Gaussian Processes.
Bayesian, Distribution, Hyperparameter, Machine Learning, Optimization
- Why Data Scientists Love Gaussian - Jun 26, 2018.
Gaussian distribution model, often identified with its iconic bell shaped curve, also referred as Normal distribution, is so popular mainly because of three reasons.
Distribution, Probability, Statistics
- Packaging and Distributing Your Python Project to PyPI for Installation Using pip - Jun 11, 2018.
This tutorial will explain the steps required to package your Python projects, distribute them in distribution formats using steptools, upload them into the Python Package Index (PyPI) repository using twine, and finally installation using Python installers such as pip and conda.
Pages: 1 2
Distribution, Project, Python
- Error Analysis to your Rescue – Lessons from Andrew Ng, part 3 - Jan 29, 2018.
The last entry in a series of posts about Andrew Ng's lessons on strategies to follow when fixing errors in your algorithm
Andrew Ng, Bias, Distribution, Machine Learning, Variance
- Data Science Primer: Basic Concepts for Beginners - Aug 11, 2017.
This collection of concise introductory data science tutorials cover topics including the difference between data mining and statistics, supervised vs. unsupervised learning, and the types of patterns we can mine from data.
Bias, Data Mining, Data Science, Distribution, Ensemble Methods, Statistics
- Data Science Basics: Power Laws and Distributions - Dec 21, 2016.
Power laws and other relationships between observable phenomena may not seem like they are of any interest to data science, at least not to newcomers to the field, but this post provides an overview and suggests how they may be.
Beginners, Data Science, Distribution, Zipf's Law
- Central Limit Theorem for Data Science – Part 2 - Aug 16, 2016.
This post continues an explanation of Central Limit Theorem started in a previous post, with additional details... and beer.
Beer, Centrality, Distribution, Statistics
- Central Limit Theorem for Data Science - Aug 12, 2016.
This post is an introductory explanation of the Central Limit Theorem, and why it is (or should be) of importance to data scientists.
Centrality, Distribution, Statistics
- What Statistics Topics are Needed for Excelling at Data Science? - Aug 2, 2016.
Here is a list of skills and statistical concepts suggested for excelling at data science, roughly in order of increasing complexity.
Bayesian, Distribution, Machine Learning, Markov Chains, Probability, Regression, Statistics