-
Why data analysts should choose stories over statistics
Join the Crunch Data Conference in Budapest, Oct 16-18, with stellar speakers from companies like Facebook, Netflix and LinkedIn. Use the discount code ‘KDNuggets’ to save $100 off your conference ticket.
-
Natural Language in Python using spaCy: An Introduction
This article provides a brief introduction to working with natural language (sometimes called “text analytics”) in Python using spaCy and related libraries.
-
Customer Segmentation for R Users
This article shows you how to separate your customers into distinct groups based on their purchase behavior. For the R enthusiasts out there, I demonstrated what you can do with r/stats, ggradar, ggplot2, animation, and factoextra.
-
Beta Distribution: What, When & How
This article covers the beta distribution, and explains it using baseball batting averages.
-
Data Quality Assessment Is Not All Roses. What Challenges Should You Be Aware Of?
Of all data quality characteristics, we consider consistency and accuracy to be the most difficult ones to measure. Here, we describe the challenges that you may encounter and the ways to overcome them.
-
A Gentle Introduction to PyTorch 1.2
This comprehensive tutorial aims to introduce the fundamentals of PyTorch building blocks for training neural networks.
-
Applying Data Science to Cybersecurity Network Attacks & Events
Check out this detailed tutorial on applying data science to the cybersecurity domain, written by an individual with backgrounds in both fields.
-
5 Beginner Friendly Steps to Learn Machine Learning and Data Science with Python
“I want to learn machine learning and artificial intelligence, where do I start?” Here.
-
Reddit Post Classification
This article covers the implementation of a data scraping and natural language processing project which had two parts: scrape as many posts from Reddit’s API as allowed &then use classification models to predict the origin of the posts.
-
BERT, RoBERTa, DistilBERT, XLNet: Which one to use?
Lately, varying improvements over BERT have been shown — and here I will contrast the main similarities and differences so you can choose which one to use in your research or application.
|