September 26-30: SIAM Conference on Mathematics of Data Science (Hybrid)
Join researchers, practitioners, educators, and students from around the world working in industry, government, laboratories, and academia for this thought-provoking conference.
By SIAM on Aug 10, 2022
The Difference Between L1 and L2 Regularization
Two types of regularized regression models are discussed here: Ridge Regression (L2 Regularization), and Lasso Regression (L1 Regularization)
By Benjamin O. Tayo on Aug 10, 2022
The Evolution From Artificial Intelligence to Machine Learning to Data Science
By the end of this article, you should be able to distinguish between these concepts.
By Nahla Davies on Aug 10, 2022
KDnuggets News, August 10: Free AI for Beginners Course • Most In-demand Artificial Intelligence Skills To Learn In 2022
Free AI for Beginners Course • Most In-demand Artificial Intelligence Skills To Learn In 2022 • Getting Started with SQL Cheatsheet • 3 Free Statistics Courses for Data Science • The Complete Collection of Data Science Projects – Part 1
By KDnuggets on Aug 10, 2022
3 Benefits to A/B Testing (+ Where to Get Started)
Let’s look at 3 concrete benefits that demonstrate why A/B testing is worth your time and effort. Then learn more about Ronny’s upcoming course, "Accelerating Innovation with A/B Testing."
By Sphere on Aug 9, 2022
6 Ways Businesses Can Benefit From Machine Learning
Machine learning is gaining popularity rapidly in the business world. Discover the ways that your business can benefit from machine learning.
By Rajalekshmy KR on Aug 9, 2022
3 Free Statistics Courses for Data Science
Statistics is one of the most in-demand data science skills. Master it for free with these online courses.
By Natassha Selvaraj on Aug 9, 2022
Top Posts August 1-7: Most In-demand Artificial Intelligence Skills To Learn In 2022
Most In-demand Artificial Intelligence Skills To Learn In 2022 • The 5 Hardest Things to Do in SQL • 10 Most Used Tableau Functions • Decision Trees vs Random Forests, Explained • Decision Tree Algorithm, Explained
By KDnuggets on Aug 8, 2022
Best Instagram Accounts to Follow for Data Science, Machine Learning & AI
I have put this blog together to help you figure out what Instagram accounts you should follow to get the best Data Science, Machine Learning, and Artificial Intelligence content.
By Nisha Arya on Aug 8, 2022
The Complete Collection of Data Science Projects – Part 1
The first part covers the list of Programming, Web scraping, Data Analytics, SQL, Business Intelligence, and Time Series projects.
By Abid Ali Awan on Aug 8, 2022
Free AI for Beginners Course
Microsoft has put together an AI course for beginners, consisting of a 12 week, 24 lesson curriculum, available for free to all.
By Matthew Mayo on Aug 8, 2022
Machine Learning Is Not Like Your Brain Part 6: The Importance of Precise Synapse Weights and the Ability to Set Them Quickly
In Part Six, I’ll show how limitations in synapses are even more of a problem. Precise synapse weights and the ability to set them quickly to a specific value are crucial to ML and biological neurons offer neither.
By Charles Simon on Aug 5, 2022
Where Does Data Come From?
In this article, we will go over the top five ways to collect or receive data, whether to help optimize an AI-driven machine or simply forecast future consumer demand.
By Nahla Davies on Aug 5, 2022
Why Emily Ekdahl chose co:rise to level up her job performance as a machine learning engineer
Find out what one of the first learners to complete the co:rise Machine Learning Foundations track said about her experience in the track and what she’s tackling next when she recently talked to Julia Stiglitz, co:rise co-founder and CEO.
By co:rise on Aug 4, 2022
Most In-demand Artificial Intelligence Skills To Learn In 2022
Artificial Intelligence (AI) is the process of programming a computer that can reason and learn like a human being and make decisions for itself.
By Kanchanapally Swapnil Raju on Aug 4, 2022
How to Deal with Categorical Data for Machine Learning
Check out this guide to implementing different types of encoding for categorical data, including a cheat sheet on when to use what type.
By Shelvi Garg on Aug 4, 2022
What are the Assumptions of XGBoost?
In this article, you will learn: how boosting relates to XGBoost; the features of XGBoost; how it reduces the loss function value and overfitting.
By Nisha Arya on Aug 4, 2022
Getting Started with SQL Cheatsheet
Want to get started with SQL? Check out the latest cheatsheet from KDnuggets to get up to speed on the basics of one of the most popular, useful, and in-demand languages in the world of data science.
By Matthew Mayo on Aug 3, 2022
A community developing a Hugging Face for customer data modeling
A year ago, Objectiv started a community of 50 companies to develop a Hugging Face like open-source project for customer data modeling. They key objective: enable building data models on one team/company’s dataset, and then run them seamlessly on another.
By Objectiv on Aug 3, 2022
Full Stack Everything? Organizational Intersections Between Data Science, Dev & Tech
Breakthrough value is found when teams collaborate at their intersections to come up with innovative solutions.
By Stan Pugsley on Aug 3, 2022
KDnuggets News, August 3: 10 Most Used Tableau Functions • Is Domain Knowledge Important for Machine Learning?
10 Most Used Tableau Functions • Is Domain Knowledge Important for Machine Learning? • ETL vs ELT: Data Integration Showdown • Free MLOps Crash Course for Beginners • 90% of Today’s Code is Written to Prevent Failure, and That’s a Problem
By KDnuggets on Aug 3, 2022
Preparing for a Data Analyst Interview
The interview process for the job can sometimes be a bit daunting. However, with the right knowledge and preparation, you can make sure you ace the interview and land your dream job. Read this summary of DataCamp’s full article on how to prepare for a data analyst interview, presenting some of the key points.
By DataCamp on Aug 2, 2022
Trust in AI is Priceless
Many machine learning models fail to deliver. Sadly, it’s often due to a lack of focus on data quality.
By Edouard d'Archimbaud on Aug 2, 2022
Decision Trees vs Random Forests, Explained
A simple, non-math heavy explanation of two popular tree-based machine learning models.
By Natassha Selvaraj on Aug 2, 2022
Top Posts July 25-31: The 5 Hardest Things to Do in SQL
The 5 Hardest Things to Do in SQL • Free Python Automation Course • Machine Learning Algorithms Explained in Less Than 1 Minute Each • Decision Tree Algorithm, Explained • The AIoT Revolution: How AI and IoT Are Transforming Our World
By KDnuggets on Aug 1, 2022
ETL vs ELT: Data Integration Showdown
Extract-Transform-Load vs Extract-Load-Transform: Data integration methods used to transfer data from one source to a data warehouse. Their aims are similar, but see how they differ.
By Nisha Arya on Aug 1, 2022
10 Most Used Tableau Functions
Learn about the most used string, number, date, logical, and aggregation Tableau functions.
By Abid Ali Awan on Aug 1, 2022
Free MLOps Crash Course for Beginners
Interest in, and demand for, MLOps is growing exponentially. What, exactly, is it? Why is it important? Where should you turn next to learn more? Check out this crash course to find the answers to these questions and more.
By Matthew Mayo on Aug 1, 2022
Online Training and Workshops with Nvidia
Learn about the Nvidia Self-Paced Online Training from their Deep Learning Institute.
By Abid Ali Awan on Jul 29, 2022
How ML Model Explainability Accelerates the AI Adoption Journey for Financial Services
Explainability and good model governance reduce risk and create the framework for ethical and transparent AI in financial services that eliminates bias.
By Yuktesh Kashyap on Jul 29, 2022
Be prepared to manage the threat with an MS in Cybersecurity from Bay Path University
Bay Path’s Master’s in Cybersecurity prepares students to step into the workforce and assume immediate responsibility for the management and oversight of such systems.
By Bay Path on Jul 28, 2022
How do I do that in Python?
This book from Manning is full of techniques and best practices for writing readable and maintainable Python code, with careful cross-referencing that reveals how the same concept can be used in different contexts.
By Manning on Jul 28, 2022
What is Text Classification?
We will define text classification, how it works, some of its most known algorithms, and provide data sets that might help start your text classification journey.
By Kevin Vu on Jul 28, 2022
90% of Today’s Code is Written to Prevent Failure, and That’s a Problem
Trying to anticipate and defend against these failures is the constant uphill battle that today’s engineers are up against. But it doesn’t have to be.
By Jeremiah Lowin on Jul 28, 2022
K-nearest Neighbors in Scikit-learn
Learn about the k-nearest neighbours algorithm, one of the most prominent workhorse machine learning algorithms there is, and how to implement it using Scikit-learn in Python.
By Nisha Arya on Jul 28, 2022
Why Upskilling in Data Vis Matters (& How to Get Started)
How do you condense the information you collect and present it to decision-makers in a clear, concise, and memorable way? This August, Noah Iliinsky will be opening up an intimate cohort and presenting an online course, Effective and Efficient Data Visualization.
By Sphere on Jul 27, 2022
Best Practices for Creating Domain-Specific AI Models
Here are some best practices and techniques for domain-specific model adaptation that worked for us time and again.
By Cathy Feng on Jul 27, 2022
Is Domain Knowledge Important for Machine Learning?
If you incorporate domain knowledge into your architecture and your model, it can make it a lot easier to explain the results, both to yourself and to an outside viewer. Every bit of domain knowledge can serve as a stepping stone through the black box of a machine learning model.
By Nate Rosidi on Jul 27, 2022
KDnuggets News, July 27: The AIoT Revolution: How AI and IoT Are Transforming Our World • Introduction to Hill Climbing Algorithm
Calculus for Data Science • Real-time Translations with AI • Using Numpy's argmax() • Using the apply() Method with Pandas DataFrames • An Introduction to Hill Climbing Algorithm in AI
By KDnuggets on Jul 27, 2022
Detecting Data Drift for Ensuring Production ML Model Quality Using Eurybia
This article will focus on a step-by-step data drift study using Eurybia an open-source python library
By Thomas Bouche on Jul 26, 2022
The 5 Hardest Things to Do in SQL
The 5 hardest things Josh Berry, a 15 year analytics professional, experienced while switching from Python to SQL. Offering examples, SQL code, and a resource to customize the SQL to your own project.
By Josh Berry on Jul 26, 2022
Does the Random Forest Algorithm Need Normalization?
Normalization is a good technique to use when your data consists of being scaled and your choice of machine learning algorithm does not have the ability to make assumptions on the distribution of your data.
By Nisha Arya on Jul 25, 2022
Using Scikit-learn’s Imputer
Learn about Scikit-learn’s SimpleImputer, IterativeImputer, KNNImputer, and machine learning pipelines.
By Abid Ali Awan on Jul 25, 2022
Top Posts July 18-24: Free Python Automation Course
Free Python Automation Course • Machine Learning Algorithms Explained in Less Than 1 Minute Each • Parallel Processing Large File in Python • 12 Most Challenging Data Science Interview Questions • Decision Tree Algorithm, Explained
By KDnuggets on Jul 25, 2022
Practical Deep Learning from fast.ai is Back!
Looking for a great course to go from machine learning zero to hero quickly? fast.ai has released the latest version of Practical Deep Learning For Coders. And it won't cost you a thing.
By Matthew Mayo on Jul 25, 2022
The AIoT Revolution: How AI and IoT Are Transforming Our World
The AIoT has the potential to transform industries and society, and it is already starting to have an impact. This article will explore the principles of AIoT, its benefits, and its current use.
By Nahla Davies on Jul 22, 2022
The Difficulty of Estimating the Carbon Footprint of Machine Learning
Is machine learning killing the planet? Probably not, but let's make sure it doesn't.
By Juha Kiili on Jul 22, 2022
Benefits Of Becoming A Data-First Enterprise
Data is everywhere but only data is not sufficient to reap the benefits that come with it. It needs to be organized to enable the organizations to make more informed business decisions. In this article, we will learn what are the various benefits of being a data-first enterprise and using the data in developing a business intelligence solution.
By Vidhi Chugh on Jul 22, 2022
An Introduction to Hill Climbing Algorithm in AI
Hill climbing is basically a search technique or informed search technique having different weights based on real numbers assigned to different nodes, branches, and goals in a path.
By Neeraj Agarwal on Jul 21, 2022
Using the apply() Method with Pandas Dataframes
Explore ways in which you can use apply () method to do different activities in a DataFrame.
By Priya Sengar on Jul 21, 2022
Using Numpy’s argmax()
A simple overview of using an often-misunderstood yet useful function in Python: Numpy's argmax(). Read the what, the how, and the why of argmax() here.
By Matthew Mayo on Jul 21, 2022
KDnuggets Top Posts for June 2022: 21 Cheat Sheets for Data Science Interviews
14 Essential Git Commands for Data Scientists • Statistics and Probability for Data Science • 20 Basic Linux Commands for Data Science Beginners • 3 Ways Understanding Bayes Theorem Will Improve Your Data Science • Learn MLOps with This Free Course • Primary Supervised Learning Algorithms Used in Machine Learning • Data Preparation with SQL Cheatsheet
By KDnuggets on Jul 20, 2022
Real-time Translations with AI
Language is now less of a barrier than it was in earlier days and the concept of real-time translation is no longer a fantasy with AI. Learn more!
By Neeraj Agarwal on Jul 20, 2022
Calculus for Data Science
In this article, we discuss the importance of calculus in data science and machine learning.
By Benjamin O. Tayo on Jul 20, 2022
KDnuggets News, July 20: Machine Learning Algorithms Explained in Less Than 1 Minute Each; Parallel Processing Large File in Python
Machine Learning Algorithms Explained in Less Than 1 Minute Each; Parallel Processing Large File in Python; Free Python Automation Course; How Does Logistic Regression Work?; 12 Most Challenging Data Science Interview Questions
By KDnuggets on Jul 20, 2022
5 Project Ideas to Stay Up-To-Date as a Data Scientist
The skills you have need maintenance and occasional updates. Doing an interesting data science project is what will keep you from getting rusty.
By Nate Rosidi on Jul 19, 2022
The Evolution of Apache Druid
And so true to the origins of its name, Apache Druid is shapeshifting - with the addition of a new multi-stage query engine.
By Gian Merlino on Jul 19, 2022
Hone Your Data Skills With Free Access to DataCamp
DataCamp has launched their Free Week, running now through to 11.59 pm ET on 24 July. For this whole week, anyone, anywhere, and anytime can have unlimited access to their site. Try it out now!
By DataCamp on Jul 19, 2022
Top Posts July 11-17: Machine Learning Algorithms Explained in Less Than 1 Minute Each
Also: Linear Algebra for Data Science; 10 Modern Data Engineering Tools; Parallel Processing Large File in Python; How Does Logistic Regression Work?
By KDnuggets on Jul 18, 2022
When Would Ensemble Techniques be a Good Choice?
When would ensemble techniques be a good choice? When you want to improve the performance of machine learning models - it’s that simple.
By Nisha Arya on Jul 18, 2022
12 Most Challenging Data Science Interview Questions
The simple but tricky data science questions that most people struggle to answer.
By Abid Ali Awan on Jul 18, 2022
Free Python Automation Course
Who wants to do boring stuff? Learn to automate the mundane with Python thanks to this free course. Set it and forget it!
By Matthew Mayo on Jul 18, 2022
How Does Logistic Regression Work?
Logistic regression is a machine learning classification algorithm that is used to predict the probability of certain classes based on some dependent variables
By Sonia Jessica on Jul 15, 2022
Why SQL Will Remain the Data Scientist’s Best Friend
Machine learning, big data analytics or AI may steal the headlines, but if you want to hone a smart, strategic skill that can elevate your career, look no further than SQL.
By Jorge Torres on Jul 15, 2022
The 5 Best Places To Host Your Data Science Portfolio
How can you showcase your data scientist skills and abilities? The answer to this question is online platforms where you can publish your portfolio and seize opportunities.
By Nahla Davies on Jul 15, 2022
Machine Learning Is Not Like Your Brain Part 5: Biological Neurons Can’t Do Summation of Inputs
See why biological neurons can’t do the most fundamental process of the artificial perceptron, the summation of inputs.
By Charles Simon on Jul 14, 2022
MLOps: The Key To Pushing AI Into The Mainstream
In this blog, we will aim at discussing the reasons that make MLOps an essential aspect of pushing AI mainstream. Besides, we will highlight the capabilities of MLOps as a catalyst for AI implementation.
By Kanika Vatsyayan on Jul 14, 2022
Free Artificial Intelligence And Deep Learning Crash Course
Deep learning forms the backbone of modern day artificial intelligence. Learn more about the important aspects of this connection with this freely available course.
By Matthew Mayo on Jul 14, 2022
Learn from Northwestern Data Science experts
Build statistical and analytical expertise as well as the management and leadership skills necessary to implement high-level, data-driven decisions in Northwestern’s online Master of Science in Data Science program.
By Northwestern on Jul 13, 2022
Parallel Processing Large File in Python
Learn various techniques to reduce data processing time by using multiprocessing, joblib, and tqdm concurrent.
By Abid Ali Awan on Jul 13, 2022
Machine Learning Algorithms Explained in Less Than 1 Minute Each
Learn about some of the most well known machine learning algorithms in less than a minute each.
By Nisha Arya on Jul 13, 2022
KDnuggets News, July 13: Linear Algebra for Data Science; 10 Modern Data Engineering Tools
Linear Algebra for Data Science; 10 Modern Data Engineering Tools; Python String Processing Cheatsheet; Simple Salary Guide for Tech Experts 2022; 16 Essential DVC Commands for Data Science
By KDnuggets on Jul 13, 2022
3 things you didn’t know about the SAS Academy for Data Science
The SAS Academy for Data Science is one of many paths to becoming a data scientist. It is designed for those who have a background in programming and mathematics, who want to upskill as part of a career change or those who want to gain the hands-on practical skills that can advance your professional growth and experience with SAS and data science.
By SAS on Jul 12, 2022
How to Convert an RGB Image to Grayscale
This post is about working with a mixture of color and grayscale images and needing to transform them into a uniform format - all grayscale. We'll be working in Python using the Pillow, Numpy, and Matplotlib packages.
By Brandon Rohrer on Jul 12, 2022
Data Preparation and Raw Data in Machine Learning
In this article, I will describe the data preparation techniques for machine learning.
By Neeraj Agarwal on Jul 12, 2022